Gene EcSMS35_2773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2773 
Symbol 
ID6143753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2851680 
End bp2856377 
Gene Length4698 bp 
Protein Length1565 aa 
Translation table11 
GC content50% 
IMG OID641617643 
Producthypothetical protein 
Protein accessionYP_001744804 
Protein GI170679851 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.426931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACAA TACACTTGCG CTGTCTCTTC AGGATGAATC CCCTGGTCTG GTGCCTGTGG 
GCTGATGTTG CAGCAAAGCT AAGGTCGCTT AAACGCTACT CAGTATTCAC TTTTCAGAGG
ATGAAATTTA TGAACAGGAC CAGTCTCCAT TATTGTCGCC GCTCAGTACT TTCCTTATGG
ATATCTGCCT TGATATATGC CCCGCCCGGG ATGGCGGCCT TCACTCCTGA TGTTATTGGT
GTGGTAAACG ATGAGACTGT AGATGGCAGC CAAAAAGTAG ATGAACGAGG TACAACAAAT
AACACTCATA TTATCAACCA TGGCCAGCAG AATGTTTATG GTGGGATATC TAATGAAAGT
CTTATTGAAT CTGGTGGATA TCAAGATATA GGACGTCATA ACAATTATGT GGGGCAGGCT
AATAATACAA CCATTAACGG GGGCAGACAG ACAATTCATG ACGGGGGTAT TTCCACAGGT
ACAATAATCG AGAGTGGCAA TCAGGACGTT TATACAGGTG GTATCAGCAA TGGAACGACA
ATTAAGGGGG GTAATTCACA CATAAGTGGG GGGACTGCGA ATGGAACAAT CATTGATGGT
GGCAGCCAAC GAGTAACAAC TCAGGGGCAT GTCGACAGTA CAACGATAAA TAAGTCTGGC
TCTCAGGACG TAGTACAAGG AAGTCTGGCA ACGAACACAA CCATAAATGG TGGTCGACAG
TATGTTGAAC AGAGCACAGT AGAAACAACC ACCATCAAAA ATGGCGGTGA GCAAAGAGTA
TATGAGAGCC GTGCGCTGGA CACTACGATT GAAGGCGGAA CTCAGTCTCT GAATAGTAAT
TCAACGGCAA AAAATACGCA GATCTATTCT GGTGGTACGC AAATTGTTGA TTACACCAGC
ACCTCGGATG TTATTGAAAT TTATTCCGGT GGCGTGCTTG ATGTTAGAGG TGGTATGGCA
ACAAATGTTA CTCAGCACGA TGGAGCCGCT TTAAAAGTAA CGACTTACGA TTTGACGGTG
AGCGGTACGA ATAGTGAAGG GGCATTCTCC ATCCACAATA ACGTGGCAGA CAATGTGTTG
CTGGAAAACG GTGGTCATTT AGACGTATAT GGTTCGGCAA CCAGGACGAT AATTAAAGAT
AAAGGAACAA TGTCAGTTTT AACGAATGCT AAAGCTGATG CGACCCGAAT AGATAATGGC
GGGGTTATGG ATGTCGCCGG AAACGCGACC AATACCATAA TCAATGGTGG TACACAGAAT
ATTAATAATC ATGGTATTGC CACTGGCACC AATATCAACA GTGGAACGCA AAATATCAAG
AGCGGCGGGA AAGCTGACAC GACAAATATA TCCTCCGGGA GCCGGCAGAT TGTTGAGAAA
GATGGTACGG CAACTGGCAG CAATATTAGC GCTGGAGGCT CGCTGATTGT CTATACCGGC
GGTATTGCAC ATGGGGTTAA CCAGGAGACG GGCAGTGCTT TAGTTGCCAA CACGGGCGCA
GGGACTGATA TCGAAGGATA CAACAAGCTC TCTCGCTTCA CTATTATCGG AGGAGAGGCT
AATTATGTTG TGCTGGAAAA TACCGGCGAA CTGACGGTAG TGGCTAAAAC CTCGGCGAAA
AATACTACCA TTGATGCTGG CGGTAAGCTG ATTGTCCAGA AGGAGGCTAA AACAGATAGC
ACCAGACTTA ATAATGGCGG CGTTCTGGAG GTTCAGGACG GTGGTGAGGC TAAGCATGTT
GAGCAACAAT CCGGCGGCGC ATTAATTGCT TCCACGACCT CCGGAACACT TATCGAAGGA
ACCAACAGTT ATGGTGATGC TTTCTACATC AGGAATTCAG AAGCTAAAAA TGTAGTGCTG
GAAAACGCTG GCTCATTAAC AGTCGTCACT GGTTCCCGGG CAGTTGACAC GATTATTAAT
GCCAACGGCA AAATGGATGT TTATGGAAAA GATGTTGGTA CTGTACTTAA TAGTGCGGGT
ACCCAAACAA TATATGCCAG TGCCACTTCT GAAAAAGCAA ATATCAAAGG TGGCAAGCAA
ACGGTATATG GTTTAGCCAC TGAGGCAAAT ATCGAAAGTG GTGAACAAAT TGTTGATGGT
GGGTCAACAG ATAAAACGCA CATCAAAGGC GGCACGCAAA CCGTTCAGAA TTATGGTAAG
GCGATCAATA CCGATATCGT CTCTGGCCTA CAACAAATTA TGGCAAACGG GACAGCGGAA
GGTTCCATTA TTAATGGCGG TTCACAGGTA GTTAATGAGG GCGGTCTGGC TGAAAACTCG
GTACTTAATG ACGGCGGCAC ACTCGATGTG CGGGAGAAAG GCAGCGCTAC GGGGATACAG
CAGAGTAGCC AGGGCGCGTT GGTGGCAACC ACCAGAGCGA CGCGGGTCAC AGGGACACGC
GCTGATGGCG TCGCGTTCAG CATCGAGCAG GGTGCGGCGA ACAATATCCT GCTGACAAAT
GGCGGCGTGT TAACCGTGGA GTCAGACACC TCTTCCGCCA AAACACAGGT CAATGCGGGC
GGGCGGGAGA TCGTCAAAAC AAAAGCCACT GCGACAGGCA CGACGCTCAC CGGCGGCGAA
CAAATCGTCG AGGGTGTGGC GAATGAGACA ACGATTAACG ACGGCGGAAT ACAAACAGTT
TCGGCTAACG GTGAGGCAGT AAAAACAACG ATTAATGAAG GCGGTACGCT GACAGTCAAC
GATAATGGCA AAGCGACAGA TATCATCCAG AACAGCGGTG CCGCTCTCCA GACGAGCACG
GCTAACGGTA TTGAAATCAG CGGTACTCAC CAGTACGGCA CCTTTTCCAT TGCCGGAAAT
TTAGCGACCA ATGCGTTGCT GGAAAATGGC GGTAATTTAT TGGTATTAGC AGGTACCGAA
GCCCGTGACT CCACGGTTGG CAAGGGTGGT GCAATGCAAA ACCTGGGTCA GGACTCCGCC
ACAAAGGTTA ACTCTGGCGG GCAATATACC CTTGGGCGGT CAAAAGATGA GTTTCAGGCT
CTGGCCCGGG CAGAAGATCT CCAGGTCGCT GGCGGGACGG CAATCGTCTA TGCAGGTACG
CTGGCGGATG CATCGGTCAG TGGCGCGACA GGAAGCCTGT CGTTAATGAC GCCACGGGAT
AATGTTACGC CAGTTAAACT CGAAGGGGCG ATCCGGATTA CCGATAGCGC GACATTAACT
ATCGGAAATG GCGTCGATAC GACGCTTACC GACCTGACGG CTGCCAGCCG GGGCAGTGTC
TGGCTTAACA GCAATAATTC CTGTGCAGGT ACCAGCAACT GCGAATATAG AGTAAACAGT
TTGCTCCTCA ACGACGGTGA TGTTTATTTA TCAGCACAAA CAGCAGCGCC TGCCACAACT
AACGGTATCT ACAATACGCT GACAACCAGT GAACTTTCCG GCAGCGGTAA TTTCTACCTG
CATACCAACG TTGCAGGCTC CCGGGGCGAT CAACTGGTCG TCAACAACAA CGCCACTGGT
AATTTTAAAA TCTTTGTTCA GGATACCGGC GTCAGCCCTC AGTCTGACGA CGCGATGACG
CTGGTGAAAA CAGGGGGAGG GGATGCTTCG TTTACGCTGG GTAATACCGG CGGTTTCGTT
GATCTTGGGA CCTATGAGTA TGTCCTGAAA AGCGACGGCA ACAGCAACTG GAACCTGACC
AATGATGTCA AACCCAACCC GGACCCCAAC CCAAATCCGA AGCCGGATCC AAAACCAGAC
CCAAAACCGG ATCCGAAACC AGACCCGACT CCCGATCCAA CGCCGACACC CGTCCCGGAG
AAACGCATCA CGCCTTCTAC GGCAGCCGTA CTCAATATGG CAGCAACATT ACCGTTGGTC
TTTGATGCTG AGCTAAACAG TATTCGCGAG CGGTTGAACA TAATGAAAGC GAGTCCACAC
AACAATAATG TCTGGGGGAC GACGTATAAC ACCCGTAATA ATGTCACCAC CGATGCGGGT
GCCGGGTTTG AGCAGACGCT GACCGGAATG ACGGTGGGGA TCGATAGCCG TAATGATATT
CCTGAGGGGA TTGCGACGCT GGGCGCTTTT ATGGGTTATT CCCATTCACA TATCGGTTTT
GATCGTGGAG GACATGGCAG TGTGGGCAGT TATTCTCTGG GGGGCTATGC CAGCTGGGAA
CATGAAAGTG GTTTCTATCT GGACGGTATC GTGAAGCTGA ACCGCTTTGA AAGTAACGTA
GCCGGTAAAA TGAGCAGCGG TGGAGCCGCC AACGGCAGTT ATCGCAGCAA CGGGCTGGGC
GGTCACATTG AAACCGGGAT GCGATTTACC GATGGTAACT GGAACCTGAC GCCGTATGCA
TCGTTAACGG GGTTCACCGC TGATAACCCC GAATATCATT TATCCAATGG CATGGAATCG
AAATCAGTCG ATACCCGCAG TATATATCGT GAACTGGGCG CAACGCTGAG TTACAACATG
CGTCTGGGGA ACGGCATGGA AGTTGAGCCG TGGCTGAAGG CGGCTGTGCG CAAAGAATTT
GTCGATGATA ACCGGGTGAA AGTGAATAAT GACGGTAATT TTGTCAATGA TTTGTCGGGC
AGACGTGGAA TATACCAGGC AGGTATTAAA GCCTCATTCA GCAGTTCGTT AAGCGGGAAT
CTCGGGGTGG GGTATAGCCA TGGTGCCGGT GTGGAATCCC CGTGGAACGC AGTGGCTGGT
GTGAACTGGT CGTTCTGA
 
Protein sequence
MNTIHLRCLF RMNPLVWCLW ADVAAKLRSL KRYSVFTFQR MKFMNRTSLH YCRRSVLSLW 
ISALIYAPPG MAAFTPDVIG VVNDETVDGS QKVDERGTTN NTHIINHGQQ NVYGGISNES
LIESGGYQDI GRHNNYVGQA NNTTINGGRQ TIHDGGISTG TIIESGNQDV YTGGISNGTT
IKGGNSHISG GTANGTIIDG GSQRVTTQGH VDSTTINKSG SQDVVQGSLA TNTTINGGRQ
YVEQSTVETT TIKNGGEQRV YESRALDTTI EGGTQSLNSN STAKNTQIYS GGTQIVDYTS
TSDVIEIYSG GVLDVRGGMA TNVTQHDGAA LKVTTYDLTV SGTNSEGAFS IHNNVADNVL
LENGGHLDVY GSATRTIIKD KGTMSVLTNA KADATRIDNG GVMDVAGNAT NTIINGGTQN
INNHGIATGT NINSGTQNIK SGGKADTTNI SSGSRQIVEK DGTATGSNIS AGGSLIVYTG
GIAHGVNQET GSALVANTGA GTDIEGYNKL SRFTIIGGEA NYVVLENTGE LTVVAKTSAK
NTTIDAGGKL IVQKEAKTDS TRLNNGGVLE VQDGGEAKHV EQQSGGALIA STTSGTLIEG
TNSYGDAFYI RNSEAKNVVL ENAGSLTVVT GSRAVDTIIN ANGKMDVYGK DVGTVLNSAG
TQTIYASATS EKANIKGGKQ TVYGLATEAN IESGEQIVDG GSTDKTHIKG GTQTVQNYGK
AINTDIVSGL QQIMANGTAE GSIINGGSQV VNEGGLAENS VLNDGGTLDV REKGSATGIQ
QSSQGALVAT TRATRVTGTR ADGVAFSIEQ GAANNILLTN GGVLTVESDT SSAKTQVNAG
GREIVKTKAT ATGTTLTGGE QIVEGVANET TINDGGIQTV SANGEAVKTT INEGGTLTVN
DNGKATDIIQ NSGAALQTST ANGIEISGTH QYGTFSIAGN LATNALLENG GNLLVLAGTE
ARDSTVGKGG AMQNLGQDSA TKVNSGGQYT LGRSKDEFQA LARAEDLQVA GGTAIVYAGT
LADASVSGAT GSLSLMTPRD NVTPVKLEGA IRITDSATLT IGNGVDTTLT DLTAASRGSV
WLNSNNSCAG TSNCEYRVNS LLLNDGDVYL SAQTAAPATT NGIYNTLTTS ELSGSGNFYL
HTNVAGSRGD QLVVNNNATG NFKIFVQDTG VSPQSDDAMT LVKTGGGDAS FTLGNTGGFV
DLGTYEYVLK SDGNSNWNLT NDVKPNPDPN PNPKPDPKPD PKPDPKPDPT PDPTPTPVPE
KRITPSTAAV LNMAATLPLV FDAELNSIRE RLNIMKASPH NNNVWGTTYN TRNNVTTDAG
AGFEQTLTGM TVGIDSRNDI PEGIATLGAF MGYSHSHIGF DRGGHGSVGS YSLGGYASWE
HESGFYLDGI VKLNRFESNV AGKMSSGGAA NGSYRSNGLG GHIETGMRFT DGNWNLTPYA
SLTGFTADNP EYHLSNGMES KSVDTRSIYR ELGATLSYNM RLGNGMEVEP WLKAAVRKEF
VDDNRVKVNN DGNFVNDLSG RRGIYQAGIK ASFSSSLSGN LGVGYSHGAG VESPWNAVAG
VNWSF