BLASTP 2.2.23 [Feb-03-2010] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= P31547 (217 letters) Database: ../databases/nrPDB-GO_2019.06.18_sequences.fasta 36,641 sequences; 10,336,785 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value 3DHW-A nrPDB 418 e-118 4YMU-D nrPDB 42 2e-04 4XTC-N nrPDB 35 0.033 2RD5-A nrPDB 32 0.24 3RLF-G nrPDB 29 1.4 4KHZ-F nrPDB 28 3.6 4XIG-M nrPDB 27 4.6 4O1J-A nrPDB 27 5.1 >3DHW-A nrPDB Length = 217 Score = 418 bits (1075), Expect = e-118, Method: Compositional matrix adjust. Identities = 217/217 (100%), Positives = 217/217 (100%) Query: 1 MSEPMMWLLVRGVWETLAMTFVSGFFGFVIGLPVGVLLYVTRPGQIIANAKLYRTVSAIV 60 MSEPMMWLLVRGVWETLAMTFVSGFFGFVIGLPVGVLLYVTRPGQIIANAKLYRTVSAIV Sbjct: 1 MSEPMMWLLVRGVWETLAMTFVSGFFGFVIGLPVGVLLYVTRPGQIIANAKLYRTVSAIV 60 Query: 61 NIFRSIPFIILLVWMIPFTRVIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLI 120 NIFRSIPFIILLVWMIPFTRVIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLI Sbjct: 61 NIFRSIPFIILLVWMIPFTRVIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLI 120 Query: 121 EASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAGGLGQIGYQYG 180 EASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAGGLGQIGYQYG Sbjct: 121 EASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAGGLGQIGYQYG 180 Query: 181 YIGYNATVMNTVLVLLVILVYLIQFAGDRIVRAVTRK 217 YIGYNATVMNTVLVLLVILVYLIQFAGDRIVRAVTRK Sbjct: 181 YIGYNATVMNTVLVLLVILVYLIQFAGDRIVRAVTRK 217 >4YMU-D nrPDB Length = 220 Score = 42.4 bits (98), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 38/172 (22%), Positives = 88/172 (51%), Gaps = 17/172 (9%) Query: 8 LLVRGVWETLAMTFVSGFFGFVIGLPVGVL-LYVTRPGQIIANAKLYRTVSAIVNIFRSI 66 L + G+ TL +TF++ G ++GL + ++ + +P +++A S+ + + R Sbjct: 14 LFISGLIMTLKLTFLAVTIGVLMGLFIALMKMSSIKPIKLVA--------SSYIEVIRGT 65 Query: 67 PFII--LLVW--MIPFTRVIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLIEA 122 P ++ LL++ ++ F I + G+ A L + ++ ++A ++ + + G EA Sbjct: 66 PLLVQLLLIYNGLMQFGMNIPAFTAGVSA----LAINSSAYVAEIIRAGIQAVDPGQNEA 121 Query: 123 SRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAGGLGQ 174 +R++G T +R V++P+A+ ++ A I ++ SA+ +G L + Sbjct: 122 ARSLGMTHAMAMRYVIIPQAIKNILPALGNEFIVMLKESAIVSVIGFADLTR 173 >4XTC-N nrPDB Length = 305 Score = 34.7 bits (78), Expect = 0.033, Method: Compositional matrix adjust. Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%) Query: 107 MVENALLEIPTGLIEASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVG 159 ++ N I EA+R GA +QI+ KV +P A P L ATITL+ + Sbjct: 159 LMRNYFESISASFEEAARMDGANDLQILWKVYIPLAKPAL---ATITLLCAIS 208 >2RD5-A nrPDB Length = 298 Score = 32.0 bits (71), Expect = 0.24, Method: Compositional matrix adjust. Identities = 36/115 (31%), Positives = 52/115 (45%), Gaps = 18/115 (15%) Query: 96 LTVGAAPFIARMVENALLEIPTGLIEASRAMGATPMQIVRKVLLPEALPGLVNAATITLI 155 L G P I R ++ L IP + R AT M+IV VL+ G VN ++LI Sbjct: 73 LVHGGGPDINRYLKQ--LNIPAEFRDGLRVTDATTMEIVSMVLV-----GKVNKNLVSLI 125 Query: 156 TLVGYSAMGGAVGAGGLGQI--------GYQYGYIGYNATVMNTVLVLLVILVYL 202 G +A+G +G G++ Q G++G A V +VL LV Y+ Sbjct: 126 NAAGATAVG---LSGHDGRLLTARPVPNSAQLGFVGEVARVDPSVLRPLVDYGYI 177 >3RLF-G nrPDB Length = 296 Score = 29.3 bits (64), Expect = 1.4, Method: Compositional matrix adjust. Identities = 15/30 (50%), Positives = 19/30 (63%) Query: 115 IPTGLIEASRAMGATPMQIVRKVLLPEALP 144 I + L EA+ GATP Q R VLLP ++P Sbjct: 184 IDSSLEEAAALDGATPWQAFRLVLLPLSVP 213 >4KHZ-F nrPDB Length = 514 Score = 27.7 bits (60), Expect = 3.6, Method: Compositional matrix adjust. Identities = 15/39 (38%), Positives = 19/39 (48%) Query: 102 PFIARMVENALLEIPTGLIEASRAMGATPMQIVRKVLLP 140 P++ + L IP L EAS GA P Q K+ LP Sbjct: 382 PYMMILCMGLLKAIPDDLYEASAMDGAGPFQNFFKITLP 420 >4XIG-M nrPDB Length = 301 Score = 27.3 bits (59), Expect = 4.6, Method: Compositional matrix adjust. Identities = 21/85 (24%), Positives = 41/85 (48%), Gaps = 15/85 (17%) Query: 111 ALLEIPTGLIEASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAG 170 A++ I L E+++ GAT Q++ ++ LP +P + A + +I L G Sbjct: 184 AIMSINPALYESAQVDGATRWQMITRITLPCIVPTI---AVLLVIRL------------G 228 Query: 171 GLGQIGYQYGYIGYNATVMNTVLVL 195 + ++G++Y + Y T T V+ Sbjct: 229 HILEVGFEYIILLYQPTTYETADVI 253 >4O1J-A nrPDB Length = 255 Score = 27.3 bits (59), Expect = 5.1, Method: Compositional matrix adjust. Identities = 13/26 (50%), Positives = 16/26 (61%) Query: 151 TITLITLVGYSAMGGAVGAGGLGQIG 176 + I L G+SA GGA GA G+IG Sbjct: 103 KVKHIVLCGHSACGGAAGALSDGRIG 128 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: 3DHW-A nrPDB 263 3e-71 4YMU-D nrPDB 154 3e-38 Sequences not found previously or not previously below threshold: 3D31-C nrPDB 31 0.31 3RLF-G nrPDB 30 0.58 3MWB-A nrPDB 30 0.92 4KHZ-F nrPDB 30 1.2 3J7Y-4 nrPDB 27 5.0 3QLL-A nrPDB 27 5.3 1LLU-A nrPDB 26 8.8 CONVERGED! >3DHW-A nrPDB Length = 217 Score = 263 bits (673), Expect = 3e-71, Method: Composition-based stats. Identities = 217/217 (100%), Positives = 217/217 (100%) Query: 1 MSEPMMWLLVRGVWETLAMTFVSGFFGFVIGLPVGVLLYVTRPGQIIANAKLYRTVSAIV 60 MSEPMMWLLVRGVWETLAMTFVSGFFGFVIGLPVGVLLYVTRPGQIIANAKLYRTVSAIV Sbjct: 1 MSEPMMWLLVRGVWETLAMTFVSGFFGFVIGLPVGVLLYVTRPGQIIANAKLYRTVSAIV 60 Query: 61 NIFRSIPFIILLVWMIPFTRVIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLI 120 NIFRSIPFIILLVWMIPFTRVIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLI Sbjct: 61 NIFRSIPFIILLVWMIPFTRVIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLI 120 Query: 121 EASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAGGLGQIGYQYG 180 EASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAGGLGQIGYQYG Sbjct: 121 EASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAGGLGQIGYQYG 180 Query: 181 YIGYNATVMNTVLVLLVILVYLIQFAGDRIVRAVTRK 217 YIGYNATVMNTVLVLLVILVYLIQFAGDRIVRAVTRK Sbjct: 181 YIGYNATVMNTVLVLLVILVYLIQFAGDRIVRAVTRK 217 >4YMU-D nrPDB Length = 220 Score = 154 bits (389), Expect = 3e-38, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 88/172 (51%), Gaps = 17/172 (9%) Query: 8 LLVRGVWETLAMTFVSGFFGFVIGLPVGVL-LYVTRPGQIIANAKLYRTVSAIVNIFRSI 66 L + G+ TL +TF++ G ++GL + ++ + +P +++A S+ + + R Sbjct: 14 LFISGLIMTLKLTFLAVTIGVLMGLFIALMKMSSIKPIKLVA--------SSYIEVIRGT 65 Query: 67 PFII--LLVW--MIPFTRVIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLIEA 122 P ++ LL++ ++ F I + G+ A L + ++ ++A ++ + + G EA Sbjct: 66 PLLVQLLLIYNGLMQFGMNIPAFTAGVSA----LAINSSAYVAEIIRAGIQAVDPGQNEA 121 Query: 123 SRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAGGLGQ 174 +R++G T +R V++P+A+ ++ A I ++ SA+ +G L + Sbjct: 122 ARSLGMTHAMAMRYVIIPQAIKNILPALGNEFIVMLKESAIVSVIGFADLTR 173 >3D31-C nrPDB Length = 295 Score = 31.5 bits (70), Expect = 0.31, Method: Composition-based stats. Identities = 21/120 (17%), Positives = 44/120 (36%), Gaps = 1/120 (0%) Query: 85 TSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLIEASRAMGATPMQIVRKVLLPEALP 144 L +V + + P++A + L A+R++GA + V LP + Sbjct: 159 FRDALPGIVVAMLFVSMPYLANSAREGFKSVDPRLENAARSLGAPLWKAFFFVTLPLSAR 218 Query: 145 GLVNAATITLITLVGYSAMGGAVGAGGLGQIGYQYG-YIGYNATVMNTVLVLLVILVYLI 203 L+ + +T + + + Y +I Y + + VLL+++ I Sbjct: 219 YLLIGSVMTWARAISEFGAVVILAYYPMVGPTLIYDRFISYGLSASRPIAVLLILVTLSI 278 >3RLF-G nrPDB Length = 296 Score = 30.3 bits (67), Expect = 0.58, Method: Composition-based stats. Identities = 20/100 (20%), Positives = 36/100 (36%) Query: 68 FIILLVWMIPFTRVIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLIEASRAMG 127 + L I + ++ +G ++ I + L EA+ G Sbjct: 137 LVALYALFDRLGEYIPFIGLNTHGGVIFAYLGGIALHVWTIKGYFETIDSSLEEAAALDG 196 Query: 128 ATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAV 167 ATP Q R VLLP ++P L ++ I + + + Sbjct: 197 ATPWQAFRLVLLPLSVPILAVVFILSFIAAITEVPVASLL 236 >3MWB-A nrPDB Length = 313 Score = 30.0 bits (66), Expect = 0.92, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 37/94 (39%), Gaps = 4/94 (4%) Query: 76 IPFTRVIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLIEASRAMGATPMQIVR 135 +P + G+S A + L AP+ A + + GL + +G P + R Sbjct: 126 LPNADYVPGSSTAASA--MGLLEDDAPYEAAICAPLIAAEQPGLNVLAEDIGDNPDAVTR 183 Query: 136 KVLL--PEALPGLVNAATITLITLVGYSAMGGAV 167 +L+ P ALP A T++ + G + Sbjct: 184 FILVSRPGALPERTGADKTTVVVPLPEDHPGALM 217 >4KHZ-F nrPDB Length = 514 Score = 29.6 bits (65), Expect = 1.2, Method: Composition-based stats. Identities = 35/184 (19%), Positives = 64/184 (34%), Gaps = 8/184 (4%) Query: 1 MSEPMMWLLVRGVWETLAMTFVSGFFGFVIGLPV---GVLLYVTRPGQIIANAKLYRTVS 57 + +P + + V V +L F++ G V+ V + +I + +S Sbjct: 273 IQKPFLAIFVWTVVFSLITVFLTVAVGMVLACLVQWEALRGKAVYRVLLILPYAVPSFIS 332 Query: 58 AIV--NIFRSIPFIILLVWMIPFTRVIVGTSIGLQAAIVPLTVGA---APFIARMVENAL 112 ++ +F I ++ F S A + + V P++ + L Sbjct: 333 ILIFKGLFNQSFGEINMMLSALFGVKPAWFSDPTTARTMLIIVNTWLGYPYMMILCMGLL 392 Query: 113 LEIPTGLIEASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAGGL 172 IP L EAS GA P Q K+ LP + L + + + GG Sbjct: 393 KAIPDDLYEASAMDGAGPFQNFFKITLPLLIKPLTPLMIASFAFNFNNFVLIQLLTNGGP 452 Query: 173 GQIG 176 ++G Sbjct: 453 DRLG 456 >3J7Y-4 nrPDB Length = 103 Score = 27.3 bits (59), Expect = 5.0, Method: Composition-based stats. Identities = 12/42 (28%), Positives = 18/42 (42%) Query: 118 GLIEASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVG 159 G I + + P VR +L P LP L+ A T++ Sbjct: 32 GSIRGAAPVAVEPGAAVRSLLSPGLLPHLLPALGFKNKTVLK 73 >3QLL-A nrPDB Length = 316 Score = 27.3 bits (59), Expect = 5.3, Method: Composition-based stats. Identities = 14/81 (17%), Positives = 34/81 (41%), Gaps = 4/81 (4%) Query: 36 VLLYVTRPGQIIANAKLYRTVSAIVNIFRSIPFIILLVWMIPFTRVIVGTSIGLQAAIVP 95 ++++ ++I + R ++A+ +I + P + L+ F + IG + P Sbjct: 150 LMMFAGSDTRLIGIIESVRGLNAVESIAAATPKLAGLI----FGAADMAADIGAASTWEP 205 Query: 96 LTVGAAPFIARMVENALLEIP 116 L + A ++ N + I Sbjct: 206 LALARARLVSACAMNGIPAID 226 >1LLU-A nrPDB Length = 342 Score = 26.5 bits (57), Expect = 8.8, Method: Composition-based stats. Identities = 18/60 (30%), Positives = 29/60 (48%) Query: 111 ALLEIPTGLIEASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGYSAMGGAVGAG 170 A ++I +E +R +GA+ R+ EA+ + A L+T V SA G A+G Sbjct: 198 AAIDIDDAKLELARKLGASLTVNARQEDPVEAIQRDIGGAHGVLVTAVSNSAFGQAIGMA 257 Database: ../databases/nrPDB-GO_2019.06.18_sequences.fasta Posted date: Jan 3, 2023 9:05 PM Number of letters in database: 10,336,785 Number of sequences in database: 36,641 Lambda K H 0.327 0.141 0.375 Lambda K H 0.267 0.0435 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 8,138,825 Number of Sequences: 36641 Number of extensions: 315816 Number of successful extensions: 1141 Number of sequences better than 10.0: 29 Number of HSP's better than 10.0 without gapping: 21 Number of HSP's successfully gapped in prelim test: 35 Number of HSP's that attempted gapping in prelim test: 1107 Number of HSP's gapped (non-prelim): 59 length of query: 217 length of database: 10,336,785 effective HSP length: 93 effective length of query: 124 effective length of database: 6,929,172 effective search space: 859217328 effective search space used: 859217328 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.7 bits) S2: 57 (26.5 bits)