BLASTP 2.2.23 [Feb-03-2010] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= O14368 (143 letters) Database: ../databases/nrPDB-GO_2019.06.18_sequences.fasta 36,641 sequences; 10,336,785 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value 3W1Z-A nrPDB 291 5e-80 4ZJA-A nrPDB 47 2e-06 4I88-A nrPDB 45 1e-05 1GME-A nrPDB 41 2e-04 4YLC-A nrPDB 37 0.002 2AQX-A nrPDB 27 2.3 3K85-A nrPDB 27 2.4 3IWK-A nrPDB 27 2.5 2XKJ-E nrPDB 26 6.4 5AZA-A nrPDB 26 7.4 >3W1Z-A nrPDB Length = 143 Score = 291 bits (746), Expect = 5e-80, Method: Compositional matrix adjust. Identities = 143/143 (100%), Positives = 143/143 (100%) Query: 1 MSLQPFFGFPPTVNDLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKE 60 MSLQPFFGFPPTVNDLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKE Sbjct: 1 MSLQPFFGFPPTVNDLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKE 60 Query: 61 DVQVHYDSGKLTISGEVVNERKNESTEGNQRWSERRFGSFSRTITIPAKIDADRIEANFS 120 DVQVHYDSGKLTISGEVVNERKNESTEGNQRWSERRFGSFSRTITIPAKIDADRIEANFS Sbjct: 61 DVQVHYDSGKLTISGEVVNERKNESTEGNQRWSERRFGSFSRTITIPAKIDADRIEANFS 120 Query: 121 NGLLTVTLPKVEKSQTKKQIAIK 143 NGLLTVTLPKVEKSQTKKQIAIK Sbjct: 121 NGLLTVTLPKVEKSQTKKQIAIK 143 >4ZJA-A nrPDB Length = 147 Score = 47.4 bits (111), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 41/153 (26%), Positives = 77/153 (50%), Gaps = 18/153 (11%) Query: 1 MSLQPFFGFPPTVNDLFSDFVSYSPRLNNQIPGE----LSPSIDVHE-GKDTVSVDVELP 55 M+L+ P + LFSD + RL +Q+ G+ +P+ D+ + + + V +P Sbjct: 2 MALRTLSALPVFADSLFSDRFNRIDRLFSQLTGDTPVAATPAYDLQKRDANNYLLTVSVP 61 Query: 56 GVKKEDVQVHYDSGKLTISGEVVNERKNESTEGNQRWSER--RFGSFSRTITIP--AKID 111 G K+E++++ G L I+G + E+ E W R R F + ++P AK++ Sbjct: 62 GWKEEELEIETVGGNLNITG----KHTEETVEDQTHWIYRGIRKADFQLSFSLPEHAKVN 117 Query: 112 ADRIEANFSNGLLTVTL-PKVEKSQTKKQIAIK 143 ++E GLL V + ++ +S+ K+IAI+ Sbjct: 118 NAKLE----QGLLLVEIYQEIPESEKPKKIAIE 146 >4I88-A nrPDB Length = 147 Score = 45.1 bits (105), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 41/121 (33%), Positives = 54/121 (44%), Gaps = 16/121 (13%) Query: 30 QIPGELSPSIDVHEGKDTVSVDVELPGVKKEDVQVHYDSGKLTISGEVVNERKNESTEGN 89 QI G+ I + EG + V LPGV KED+ ++ L I + ES Sbjct: 36 QISGKGFMPISIIEGDQHIKVIAWLPGVNKEDIILNAVGDTLEIRAKRSPLMITES---- 91 Query: 90 QRWSERRFGS-------FSRTITIPAKIDADRIEANFSNGLLTVTLPKVEKSQTKKQIAI 142 ER S RTI +PA + + A F NG+L+V LPK E S KK I I Sbjct: 92 ----ERIIYSEIPEEEEIYRTIKLPATVKEENASAKFENGVLSVILPKAE-SSIKKGINI 146 Query: 143 K 143 + Sbjct: 147 E 147 >1GME-A nrPDB Length = 151 Score = 41.2 bits (95), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 39/129 (30%), Positives = 60/129 (46%), Gaps = 8/129 (6%) Query: 9 FPPTVNDLFSDFVSYSPRLN---NQIPGELSPSIDVHEGKDTVSVDVELPGVKKEDVQVH 65 F D F F S P ++ ++ + +D E + +LPGVKKE+V+V Sbjct: 13 FADLWADPFDTFRSIVPAISGGGSETAAFANARMDWKETPEAHVFKADLPGVKKEEVKVE 72 Query: 66 YDSGKLTISGEVVNERKNESTEGNQRWS--ERRFGSFSRTITIPAKIDADRIEANFSNGL 123 + G + + V ER E + N +W ER G F R + + ++A NG+ Sbjct: 73 VEDGNVLV---VSGERTKEKEDKNDKWHRVERSSGKFVRRFRLLEDAKVEEVKAGLENGV 129 Query: 124 LTVTLPKVE 132 LTVT+PK E Sbjct: 130 LTVTVPKAE 138 >4YLC-A nrPDB Length = 124 Score = 37.4 bits (85), Expect = 0.002, Method: Compositional matrix adjust. Identities = 23/94 (24%), Positives = 47/94 (50%), Gaps = 6/94 (6%) Query: 37 PSIDVHEGKDTVSVDVELPGVKKEDVQVHYDSGKLTISGEVVNERKNESTEGNQRWSERR 96 P ID++E + V +L G K+ + V +L+ E++ + E +++ +R Sbjct: 30 PPIDMYEEGGELVVVADLAGFNKDKISV-----RLSAQNELIINAEREIQYIGTKYATQR 84 Query: 97 FGSFSRTITIPAKIDAD-RIEANFSNGLLTVTLP 129 + I +P K+ D ++ A + NG+LT+ +P Sbjct: 85 PLKIHKVIRLPVKVKRDSQVTAKYENGVLTIRIP 118 >2AQX-A nrPDB Length = 289 Score = 27.3 bits (59), Expect = 2.3, Method: Compositional matrix adjust. Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 1/44 (2%) Query: 47 TVSVDVELPGVKKEDVQVHYDSGKLTISGEVVNERKNESTEGNQ 90 T ++ + G+KKED V+ D K T + E V E E T+GNQ Sbjct: 153 TATLGFRIEGIKKEDGSVNRDFKK-TKTREQVTEAFREFTKGNQ 195 >3K85-A nrPDB Length = 357 Score = 27.3 bits (59), Expect = 2.4, Method: Compositional matrix adjust. Identities = 12/31 (38%), Positives = 19/31 (61%) Query: 60 EDVQVHYDSGKLTISGEVVNERKNESTEGNQ 90 E V Y +G+ S ++NE+K ++EGNQ Sbjct: 196 ESSMVLYFTGRSRSSAAIINEQKKNTSEGNQ 226 >3IWK-A nrPDB Length = 503 Score = 27.3 bits (59), Expect = 2.5, Method: Compositional matrix adjust. Identities = 24/76 (31%), Positives = 36/76 (47%), Gaps = 11/76 (14%) Query: 16 LFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKEDVQVHYDSGKLTISG 75 LF D P LN +IP ++PS T ++ ++P KEDV + D+ K IS Sbjct: 10 LFIDGEWRVPILNKRIP-NINPS--------TENIIGDIPAATKEDVDLAVDAAKRAISR 60 Query: 76 EVVNERKNESTEGNQR 91 + N R + G+ R Sbjct: 61 K--NGRDWSAASGSLR 74 >2XKJ-E nrPDB Length = 767 Score = 25.8 bits (55), Expect = 6.4, Method: Composition-based stats. Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 3/56 (5%) Query: 55 PGVKKEDVQVHYDSGKLTISGEVVNERKNESTEGNQRWSERRFGSFSRTITIPAKI 110 P K + Y KL+ E++ +E +G W + GS IT+PA++ Sbjct: 377 PDDPKSFAAMRYTEAKLSAYSELL---LSELGQGTSEWQDNFDGSLKEPITLPARV 429 >5AZA-A nrPDB Length = 872 Score = 25.8 bits (55), Expect = 7.4, Method: Compositional matrix adjust. Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 5/59 (8%) Query: 21 VSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELP-GV---KKEDVQVHYDSGKLTISG 75 ++ P+++ ++ +I + EG++TV V ELP GV K+++Q Y KL I G Sbjct: 784 AAFEPQMDVFFITKIGENIQLKEGENTVKVRAELPEGVISSYKDELQRKYGD-KLIIRG 841 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: 3W1Z-A nrPDB 200 1e-52 4ZJA-A nrPDB 156 3e-39 1GME-A nrPDB 128 6e-31 4I88-A nrPDB 119 5e-28 Sequences not found previously or not previously below threshold: 4YLC-A nrPDB 50 4e-07 4YDZ-A nrPDB 47 3e-06 2KLR-A nrPDB 43 5e-05 6F2R-T nrPDB 41 3e-04 5XYI-T nrPDB 28 1.3 3HVD-A nrPDB 26 6.2 3TEX-A nrPDB 26 7.4 >3W1Z-A nrPDB Length = 143 Score = 200 bits (509), Expect = 1e-52, Method: Composition-based stats. Identities = 143/143 (100%), Positives = 143/143 (100%) Query: 1 MSLQPFFGFPPTVNDLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKE 60 MSLQPFFGFPPTVNDLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKE Sbjct: 1 MSLQPFFGFPPTVNDLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKE 60 Query: 61 DVQVHYDSGKLTISGEVVNERKNESTEGNQRWSERRFGSFSRTITIPAKIDADRIEANFS 120 DVQVHYDSGKLTISGEVVNERKNESTEGNQRWSERRFGSFSRTITIPAKIDADRIEANFS Sbjct: 61 DVQVHYDSGKLTISGEVVNERKNESTEGNQRWSERRFGSFSRTITIPAKIDADRIEANFS 120 Query: 121 NGLLTVTLPKVEKSQTKKQIAIK 143 NGLLTVTLPKVEKSQTKKQIAIK Sbjct: 121 NGLLTVTLPKVEKSQTKKQIAIK 143 >4ZJA-A nrPDB Length = 147 Score = 156 bits (394), Expect = 3e-39, Method: Composition-based stats. Identities = 39/151 (25%), Positives = 72/151 (47%), Gaps = 14/151 (9%) Query: 1 MSLQPFFGFPPTVNDLFSDFVSYSPRLNNQIPGE----LSPSIDVHE-GKDTVSVDVELP 55 M+L+ P + LFSD + RL +Q+ G+ +P+ D+ + + + V +P Sbjct: 2 MALRTLSALPVFADSLFSDRFNRIDRLFSQLTGDTPVAATPAYDLQKRDANNYLLTVSVP 61 Query: 56 GVKKEDVQVHYDSGKLTISGEVVNERKNESTEGNQRWSER--RFGSFSRTITIPAKIDAD 113 G K+E++++ G L I+G + E+ E W R R F + ++P + Sbjct: 62 GWKEEELEIETVGGNLNITG----KHTEETVEDQTHWIYRGIRKADFQLSFSLPEHAKVN 117 Query: 114 RIEANFSNGLLTVTL-PKVEKSQTKKQIAIK 143 A GLL V + ++ +S+ K+IAI+ Sbjct: 118 N--AKLEQGLLLVEIYQEIPESEKPKKIAIE 146 >1GME-A nrPDB Length = 151 Score = 128 bits (322), Expect = 6e-31, Method: Composition-based stats. Identities = 40/132 (30%), Positives = 61/132 (46%), Gaps = 8/132 (6%) Query: 6 FFGFPPTVNDLFSDFVSYSPRLN---NQIPGELSPSIDVHEGKDTVSVDVELPGVKKEDV 62 F F D F F S P ++ ++ + +D E + +LPGVKKE+V Sbjct: 10 FDPFADLWADPFDTFRSIVPAISGGGSETAAFANARMDWKETPEAHVFKADLPGVKKEEV 69 Query: 63 QVHYDSGKLTISGEVVNERKNESTEGNQRWS--ERRFGSFSRTITIPAKIDADRIEANFS 120 +V + G + + V ER E + N +W ER G F R + + ++A Sbjct: 70 KVEVEDGNVLV---VSGERTKEKEDKNDKWHRVERSSGKFVRRFRLLEDAKVEEVKAGLE 126 Query: 121 NGLLTVTLPKVE 132 NG+LTVT+PK E Sbjct: 127 NGVLTVTVPKAE 138 >4I88-A nrPDB Length = 147 Score = 119 bits (297), Expect = 5e-28, Method: Composition-based stats. Identities = 42/142 (29%), Positives = 56/142 (39%), Gaps = 16/142 (11%) Query: 9 FPPTVNDLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKEDVQVHYDS 68 F + QI G+ I + EG + V LPGV KED+ ++ Sbjct: 15 FKEFFATPMTGTTMIQSSTGIQISGKGFMPISIIEGDQHIKVIAWLPGVNKEDIILNAVG 74 Query: 69 GKLTISGEVVNERKNESTEGNQRWSERRFGS-------FSRTITIPAKIDADRIEANFSN 121 L I + ES ER S RTI +PA + + A F N Sbjct: 75 DTLEIRAKRSPLMITES--------ERIIYSEIPEEEEIYRTIKLPATVKEENASAKFEN 126 Query: 122 GLLTVTLPKVEKSQTKKQIAIK 143 G+L+V LPK E S KK I I+ Sbjct: 127 GVLSVILPKAE-SSIKKGINIE 147 >4YLC-A nrPDB Length = 124 Score = 49.9 bits (117), Expect = 4e-07, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 49/109 (44%), Gaps = 8/109 (7%) Query: 23 YSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKEDVQVHYDSGK-LTISGEVVNER 81 L+ + + P ID++E + V +L G K+ + V + L I+ E Sbjct: 16 KLDELSREFYESVIPPIDMYEEGGELVVVADLAGFNKDKISVRLSAQNELIINAER---- 71 Query: 82 KNESTEGNQRWSERRFGSFSRTITIPAKIDAD-RIEANFSNGLLTVTLP 129 E +++ +R + I +P K+ D ++ A + NG+LT+ +P Sbjct: 72 --EIQYIGTKYATQRPLKIHKVIRLPVKVKRDSQVTAKYENGVLTIRIP 118 >4YDZ-A nrPDB Length = 159 Score = 46.8 bits (109), Expect = 3e-06, Method: Composition-based stats. Identities = 29/144 (20%), Positives = 52/144 (36%), Gaps = 24/144 (16%) Query: 3 LQPFFGFPPTVN---DLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKK 59 + F P + ++F + P+ N+ V V +++ K Sbjct: 15 FRDFEDMMPYWAQRHSMLNNFNNIVPQQLNE----------VENTAQKFCVKLDVAAFKP 64 Query: 60 EDVQVHYDSGKLTISGEVVNERKNESTEGNQRWSERRFGSFSRTITIPAKIDADRIEANF 119 E+++V+ + LTI G TE SF+R T+P +D I Sbjct: 65 EELKVNLEGHVLTIEG-----HHEVKTE-----HGFSKRSFTRQFTLPKDVDLAHIHTVI 114 Query: 120 -SNGLLTVTLPKVEKSQTKKQIAI 142 G +T+ PK + T + + I Sbjct: 115 NKEGQMTIDAPKTGSNTTVRALPI 138 >2KLR-A nrPDB Length = 175 Score = 42.9 bits (99), Expect = 5e-05, Method: Composition-based stats. Identities = 34/144 (23%), Positives = 60/144 (41%), Gaps = 16/144 (11%) Query: 6 FFGFPPTVNDLF------SDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKK 59 FFG +DLF S F P + ++ KD SV++++ Sbjct: 27 FFGEHLLESDLFPTSTSLSPFYLRPPSFLRAPSWFDTGLSEMRLEKDRFSVNLDVKHFSP 86 Query: 60 EDVQVHYDSGKLTISGEVVNERKNESTEGNQRWSERRFGSFSRTITIPAKIDADRIEANF 119 E+++V + + G+ +E + + R F R IPA +D I ++ Sbjct: 87 EELKVKVLGDVIEVHGK------HEERQDEHGFISR---EFHRKYRIPADVDPLTITSSL 137 Query: 120 -SNGLLTVTLPKVEKSQTKKQIAI 142 S+G+LTV P+ + S ++ I I Sbjct: 138 SSDGVLTVNGPRKQVSGPERTIPI 161 >6F2R-T nrPDB Length = 161 Score = 40.6 bits (93), Expect = 3e-04, Method: Composition-based stats. Identities = 25/125 (20%), Positives = 46/125 (36%), Gaps = 17/125 (13%) Query: 6 FFGFP-PTVNDLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKEDVQV 64 + P PT+ DL + SP +++ E P EGK + +++ ED+ + Sbjct: 34 LYALPGPTIVDLRKTRAAQSPPVDS--AAETPPR----EGKSHFQILLDVVQFLPEDIII 87 Query: 65 HYDSGKLTISGEVVNERKNESTEGNQRWSERRFGSFSRTITIPAKIDADRIEANF-SNGL 123 G L I + SF+R +P ++ + A +G+ Sbjct: 88 QTFEGWLLIKAQHGTRMDE---------HGFISRSFTRQYKLPDGVEIKDLSAVLCHDGI 138 Query: 124 LTVTL 128 L V + Sbjct: 139 LVVEV 143 >5XYI-T nrPDB Length = 148 Score = 28.3 bits (61), Expect = 1.3, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 23/58 (39%) Query: 83 NESTEGNQRWSERRFGSFSRTITIPAKIDADRIEANFSNGLLTVTLPKVEKSQTKKQI 140 + NQ W R + R I + + ++ L VT+PK + ++K I Sbjct: 50 KQMAPSNQNWFYTRAAAVIRQIYMHHDASLSGLSFHYGANLKAVTMPKHHHNASRKVI 107 >3HVD-A nrPDB Length = 548 Score = 26.0 bits (55), Expect = 6.2, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 31/79 (39%) Query: 12 TVNDLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKEDVQVHYDSGKL 71 T +D +SDF + R++ + E + V ++ + ++ + DS Sbjct: 61 TASDPYSDFEKVTGRIDKNVSPEARHPLVAAYPIVHVDMENIILSKNEDQSTQNTDSQTR 120 Query: 72 TISGEVVNERKNESTEGNQ 90 TIS R + S G+ Sbjct: 121 TISKNTSTSRTHTSEPGSN 139 >3TEX-A nrPDB Length = 715 Score = 25.6 bits (54), Expect = 7.4, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 31/79 (39%) Query: 12 TVNDLFSDFVSYSPRLNNQIPGELSPSIDVHEGKDTVSVDVELPGVKKEDVQVHYDSGKL 71 T +D +SDF + R++ + E + V ++ + ++ + DS Sbjct: 228 TASDPYSDFEKVTGRIDKNVSPEARHPLVAAYPIVHVDMENIILSKNEDQSTQNTDSQTR 287 Query: 72 TISGEVVNERKNESTEGNQ 90 TIS R + S G+ Sbjct: 288 TISKNTSTSRTHTSEPGSN 306 Database: ../databases/nrPDB-GO_2019.06.18_sequences.fasta Posted date: Jan 3, 2023 9:05 PM Number of letters in database: 10,336,785 Number of sequences in database: 36,641 Lambda K H 0.302 0.118 0.275 Lambda K H 0.267 0.0359 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 4,233,804 Number of Sequences: 36641 Number of extensions: 146185 Number of successful extensions: 318 Number of sequences better than 10.0: 11 Number of HSP's better than 10.0 without gapping: 10 Number of HSP's successfully gapped in prelim test: 15 Number of HSP's that attempted gapping in prelim test: 298 Number of HSP's gapped (non-prelim): 25 length of query: 143 length of database: 10,336,785 effective HSP length: 88 effective length of query: 55 effective length of database: 7,112,377 effective search space: 391180735 effective search space used: 391180735 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.4 bits) S2: 54 (25.6 bits)