Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2773 |
Symbol | |
ID | 6143753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2851680 |
End bp | 2856377 |
Gene Length | 4698 bp |
Protein Length | 1565 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641617643 |
Product | hypothetical protein |
Protein accession | YP_001744804 |
Protein GI | 170679851 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.426931 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACACAA TACACTTGCG CTGTCTCTTC AGGATGAATC CCCTGGTCTG GTGCCTGTGG GCTGATGTTG CAGCAAAGCT AAGGTCGCTT AAACGCTACT CAGTATTCAC TTTTCAGAGG ATGAAATTTA TGAACAGGAC CAGTCTCCAT TATTGTCGCC GCTCAGTACT TTCCTTATGG ATATCTGCCT TGATATATGC CCCGCCCGGG ATGGCGGCCT TCACTCCTGA TGTTATTGGT GTGGTAAACG ATGAGACTGT AGATGGCAGC CAAAAAGTAG ATGAACGAGG TACAACAAAT AACACTCATA TTATCAACCA TGGCCAGCAG AATGTTTATG GTGGGATATC TAATGAAAGT CTTATTGAAT CTGGTGGATA TCAAGATATA GGACGTCATA ACAATTATGT GGGGCAGGCT AATAATACAA CCATTAACGG GGGCAGACAG ACAATTCATG ACGGGGGTAT TTCCACAGGT ACAATAATCG AGAGTGGCAA TCAGGACGTT TATACAGGTG GTATCAGCAA TGGAACGACA ATTAAGGGGG GTAATTCACA CATAAGTGGG GGGACTGCGA ATGGAACAAT CATTGATGGT GGCAGCCAAC GAGTAACAAC TCAGGGGCAT GTCGACAGTA CAACGATAAA TAAGTCTGGC TCTCAGGACG TAGTACAAGG AAGTCTGGCA ACGAACACAA CCATAAATGG TGGTCGACAG TATGTTGAAC AGAGCACAGT AGAAACAACC ACCATCAAAA ATGGCGGTGA GCAAAGAGTA TATGAGAGCC GTGCGCTGGA CACTACGATT GAAGGCGGAA CTCAGTCTCT GAATAGTAAT TCAACGGCAA AAAATACGCA GATCTATTCT GGTGGTACGC AAATTGTTGA TTACACCAGC ACCTCGGATG TTATTGAAAT TTATTCCGGT GGCGTGCTTG ATGTTAGAGG TGGTATGGCA ACAAATGTTA CTCAGCACGA TGGAGCCGCT TTAAAAGTAA CGACTTACGA TTTGACGGTG AGCGGTACGA ATAGTGAAGG GGCATTCTCC ATCCACAATA ACGTGGCAGA CAATGTGTTG CTGGAAAACG GTGGTCATTT AGACGTATAT GGTTCGGCAA CCAGGACGAT AATTAAAGAT AAAGGAACAA TGTCAGTTTT AACGAATGCT AAAGCTGATG CGACCCGAAT AGATAATGGC GGGGTTATGG ATGTCGCCGG AAACGCGACC AATACCATAA TCAATGGTGG TACACAGAAT ATTAATAATC ATGGTATTGC CACTGGCACC AATATCAACA GTGGAACGCA AAATATCAAG AGCGGCGGGA AAGCTGACAC GACAAATATA TCCTCCGGGA GCCGGCAGAT TGTTGAGAAA GATGGTACGG CAACTGGCAG CAATATTAGC GCTGGAGGCT CGCTGATTGT CTATACCGGC GGTATTGCAC ATGGGGTTAA CCAGGAGACG GGCAGTGCTT TAGTTGCCAA CACGGGCGCA GGGACTGATA TCGAAGGATA CAACAAGCTC TCTCGCTTCA CTATTATCGG AGGAGAGGCT AATTATGTTG TGCTGGAAAA TACCGGCGAA CTGACGGTAG TGGCTAAAAC CTCGGCGAAA AATACTACCA TTGATGCTGG CGGTAAGCTG ATTGTCCAGA AGGAGGCTAA AACAGATAGC ACCAGACTTA ATAATGGCGG CGTTCTGGAG GTTCAGGACG GTGGTGAGGC TAAGCATGTT GAGCAACAAT CCGGCGGCGC ATTAATTGCT TCCACGACCT CCGGAACACT TATCGAAGGA ACCAACAGTT ATGGTGATGC TTTCTACATC AGGAATTCAG AAGCTAAAAA TGTAGTGCTG GAAAACGCTG GCTCATTAAC AGTCGTCACT GGTTCCCGGG CAGTTGACAC GATTATTAAT GCCAACGGCA AAATGGATGT TTATGGAAAA GATGTTGGTA CTGTACTTAA TAGTGCGGGT ACCCAAACAA TATATGCCAG TGCCACTTCT GAAAAAGCAA ATATCAAAGG TGGCAAGCAA ACGGTATATG GTTTAGCCAC TGAGGCAAAT ATCGAAAGTG GTGAACAAAT TGTTGATGGT GGGTCAACAG ATAAAACGCA CATCAAAGGC GGCACGCAAA CCGTTCAGAA TTATGGTAAG GCGATCAATA CCGATATCGT CTCTGGCCTA CAACAAATTA TGGCAAACGG GACAGCGGAA GGTTCCATTA TTAATGGCGG TTCACAGGTA GTTAATGAGG GCGGTCTGGC TGAAAACTCG GTACTTAATG ACGGCGGCAC ACTCGATGTG CGGGAGAAAG GCAGCGCTAC GGGGATACAG CAGAGTAGCC AGGGCGCGTT GGTGGCAACC ACCAGAGCGA CGCGGGTCAC AGGGACACGC GCTGATGGCG TCGCGTTCAG CATCGAGCAG GGTGCGGCGA ACAATATCCT GCTGACAAAT GGCGGCGTGT TAACCGTGGA GTCAGACACC TCTTCCGCCA AAACACAGGT CAATGCGGGC GGGCGGGAGA TCGTCAAAAC AAAAGCCACT GCGACAGGCA CGACGCTCAC CGGCGGCGAA CAAATCGTCG AGGGTGTGGC GAATGAGACA ACGATTAACG ACGGCGGAAT ACAAACAGTT TCGGCTAACG GTGAGGCAGT AAAAACAACG ATTAATGAAG GCGGTACGCT GACAGTCAAC GATAATGGCA AAGCGACAGA TATCATCCAG AACAGCGGTG CCGCTCTCCA GACGAGCACG GCTAACGGTA TTGAAATCAG CGGTACTCAC CAGTACGGCA CCTTTTCCAT TGCCGGAAAT TTAGCGACCA ATGCGTTGCT GGAAAATGGC GGTAATTTAT TGGTATTAGC AGGTACCGAA GCCCGTGACT CCACGGTTGG CAAGGGTGGT GCAATGCAAA ACCTGGGTCA GGACTCCGCC ACAAAGGTTA ACTCTGGCGG GCAATATACC CTTGGGCGGT CAAAAGATGA GTTTCAGGCT CTGGCCCGGG CAGAAGATCT CCAGGTCGCT GGCGGGACGG CAATCGTCTA TGCAGGTACG CTGGCGGATG CATCGGTCAG TGGCGCGACA GGAAGCCTGT CGTTAATGAC GCCACGGGAT AATGTTACGC CAGTTAAACT CGAAGGGGCG ATCCGGATTA CCGATAGCGC GACATTAACT ATCGGAAATG GCGTCGATAC GACGCTTACC GACCTGACGG CTGCCAGCCG GGGCAGTGTC TGGCTTAACA GCAATAATTC CTGTGCAGGT ACCAGCAACT GCGAATATAG AGTAAACAGT TTGCTCCTCA ACGACGGTGA TGTTTATTTA TCAGCACAAA CAGCAGCGCC TGCCACAACT AACGGTATCT ACAATACGCT GACAACCAGT GAACTTTCCG GCAGCGGTAA TTTCTACCTG CATACCAACG TTGCAGGCTC CCGGGGCGAT CAACTGGTCG TCAACAACAA CGCCACTGGT AATTTTAAAA TCTTTGTTCA GGATACCGGC GTCAGCCCTC AGTCTGACGA CGCGATGACG CTGGTGAAAA CAGGGGGAGG GGATGCTTCG TTTACGCTGG GTAATACCGG CGGTTTCGTT GATCTTGGGA CCTATGAGTA TGTCCTGAAA AGCGACGGCA ACAGCAACTG GAACCTGACC AATGATGTCA AACCCAACCC GGACCCCAAC CCAAATCCGA AGCCGGATCC AAAACCAGAC CCAAAACCGG ATCCGAAACC AGACCCGACT CCCGATCCAA CGCCGACACC CGTCCCGGAG AAACGCATCA CGCCTTCTAC GGCAGCCGTA CTCAATATGG CAGCAACATT ACCGTTGGTC TTTGATGCTG AGCTAAACAG TATTCGCGAG CGGTTGAACA TAATGAAAGC GAGTCCACAC AACAATAATG TCTGGGGGAC GACGTATAAC ACCCGTAATA ATGTCACCAC CGATGCGGGT GCCGGGTTTG AGCAGACGCT GACCGGAATG ACGGTGGGGA TCGATAGCCG TAATGATATT CCTGAGGGGA TTGCGACGCT GGGCGCTTTT ATGGGTTATT CCCATTCACA TATCGGTTTT GATCGTGGAG GACATGGCAG TGTGGGCAGT TATTCTCTGG GGGGCTATGC CAGCTGGGAA CATGAAAGTG GTTTCTATCT GGACGGTATC GTGAAGCTGA ACCGCTTTGA AAGTAACGTA GCCGGTAAAA TGAGCAGCGG TGGAGCCGCC AACGGCAGTT ATCGCAGCAA CGGGCTGGGC GGTCACATTG AAACCGGGAT GCGATTTACC GATGGTAACT GGAACCTGAC GCCGTATGCA TCGTTAACGG GGTTCACCGC TGATAACCCC GAATATCATT TATCCAATGG CATGGAATCG AAATCAGTCG ATACCCGCAG TATATATCGT GAACTGGGCG CAACGCTGAG TTACAACATG CGTCTGGGGA ACGGCATGGA AGTTGAGCCG TGGCTGAAGG CGGCTGTGCG CAAAGAATTT GTCGATGATA ACCGGGTGAA AGTGAATAAT GACGGTAATT TTGTCAATGA TTTGTCGGGC AGACGTGGAA TATACCAGGC AGGTATTAAA GCCTCATTCA GCAGTTCGTT AAGCGGGAAT CTCGGGGTGG GGTATAGCCA TGGTGCCGGT GTGGAATCCC CGTGGAACGC AGTGGCTGGT GTGAACTGGT CGTTCTGA
|
Protein sequence | MNTIHLRCLF RMNPLVWCLW ADVAAKLRSL KRYSVFTFQR MKFMNRTSLH YCRRSVLSLW ISALIYAPPG MAAFTPDVIG VVNDETVDGS QKVDERGTTN NTHIINHGQQ NVYGGISNES LIESGGYQDI GRHNNYVGQA NNTTINGGRQ TIHDGGISTG TIIESGNQDV YTGGISNGTT IKGGNSHISG GTANGTIIDG GSQRVTTQGH VDSTTINKSG SQDVVQGSLA TNTTINGGRQ YVEQSTVETT TIKNGGEQRV YESRALDTTI EGGTQSLNSN STAKNTQIYS GGTQIVDYTS TSDVIEIYSG GVLDVRGGMA TNVTQHDGAA LKVTTYDLTV SGTNSEGAFS IHNNVADNVL LENGGHLDVY GSATRTIIKD KGTMSVLTNA KADATRIDNG GVMDVAGNAT NTIINGGTQN INNHGIATGT NINSGTQNIK SGGKADTTNI SSGSRQIVEK DGTATGSNIS AGGSLIVYTG GIAHGVNQET GSALVANTGA GTDIEGYNKL SRFTIIGGEA NYVVLENTGE LTVVAKTSAK NTTIDAGGKL IVQKEAKTDS TRLNNGGVLE VQDGGEAKHV EQQSGGALIA STTSGTLIEG TNSYGDAFYI RNSEAKNVVL ENAGSLTVVT GSRAVDTIIN ANGKMDVYGK DVGTVLNSAG TQTIYASATS EKANIKGGKQ TVYGLATEAN IESGEQIVDG GSTDKTHIKG GTQTVQNYGK AINTDIVSGL QQIMANGTAE GSIINGGSQV VNEGGLAENS VLNDGGTLDV REKGSATGIQ QSSQGALVAT TRATRVTGTR ADGVAFSIEQ GAANNILLTN GGVLTVESDT SSAKTQVNAG GREIVKTKAT ATGTTLTGGE QIVEGVANET TINDGGIQTV SANGEAVKTT INEGGTLTVN DNGKATDIIQ NSGAALQTST ANGIEISGTH QYGTFSIAGN LATNALLENG GNLLVLAGTE ARDSTVGKGG AMQNLGQDSA TKVNSGGQYT LGRSKDEFQA LARAEDLQVA GGTAIVYAGT LADASVSGAT GSLSLMTPRD NVTPVKLEGA IRITDSATLT IGNGVDTTLT DLTAASRGSV WLNSNNSCAG TSNCEYRVNS LLLNDGDVYL SAQTAAPATT NGIYNTLTTS ELSGSGNFYL HTNVAGSRGD QLVVNNNATG NFKIFVQDTG VSPQSDDAMT LVKTGGGDAS FTLGNTGGFV DLGTYEYVLK SDGNSNWNLT NDVKPNPDPN PNPKPDPKPD PKPDPKPDPT PDPTPTPVPE KRITPSTAAV LNMAATLPLV FDAELNSIRE RLNIMKASPH NNNVWGTTYN TRNNVTTDAG AGFEQTLTGM TVGIDSRNDI PEGIATLGAF MGYSHSHIGF DRGGHGSVGS YSLGGYASWE HESGFYLDGI VKLNRFESNV AGKMSSGGAA NGSYRSNGLG GHIETGMRFT DGNWNLTPYA SLTGFTADNP EYHLSNGMES KSVDTRSIYR ELGATLSYNM RLGNGMEVEP WLKAAVRKEF VDDNRVKVNN DGNFVNDLSG RRGIYQAGIK ASFSSSLSGN LGVGYSHGAG VESPWNAVAG VNWSF
|
| |