Gene EcSMS35_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3921 
Symbol 
ID6145561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3993788 
End bp3997597 
Gene Length3810 bp 
Protein Length1269 aa 
Translation table11 
GC content45% 
IMG OID641618747 
Productouter membrane autotransporter 
Protein accessionYP_001745886 
Protein GI170682879 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.728778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTTG TATATACTCT AAAGTTAAAT AAAAAAAGAG AGTTGGTTGT AGTATCTGAA 
CTTTCTGGTG GGGTTAAAAA ATCAGCTCGA AATAAGCTAT TAAAATCTGT TGTTGTTTTA
ATGACTACTT TAGCTACTCA ACTCTATTCA CCATTAATTC AGGCGTCAAT TGTGGGAATG
GATATCCCAT ATCAGACCTA TCGTGATTTC GCTGAAAATA AAGGAGCGTT CTCAGTCGGA
GCGCTGGATA TTCCTTTGTA TAAGAAAGAC GGGACGTTGT ACTCCACACT GAATAAGGCG
CCAATGATAG ATTTTAGCGC TGTTGATAGT GGACAGACAG TAGCTACGTT AATTTCGCCA
CAATATATTG TAAGCGTAAA GCATAATACT GGCTATAAGA ATGTTCGGTT TGGCTACAGA
GATGATAGTT CTTATATTCT TGTCGATCGT AATAATAGTT CTGTAGATTT CCATACTCCG
CGTTTAAATA AAATTGTAAC CGAAGTGGTT CCGGCTGATA TAACCGATGC CGGTACTGCC
AACGGAACTT ATCAAAATCA AGACCGTTTT CCGATCTTTT ACCGTGTTGG TACTGGTACG
CAATATGTGA AAGACCGTAA TGGGAAATTA ACTCAACTTG CTGGCGGTTA CGCATATAGA
ACGGGCGGGA CCGTTGGTAA ACCAACATCA TCAAACAAGA GAATAGTGTC CAACCCTGGT
AATACCTATT CTGCGGCTAA CGGCCCTATG CCTTCCTATG GAATCCCCGG TGATAGTGGC
TCTCCCCTTT TTGCCTGGGA TACGCAACGT AATAAATGGG TGTTGGTTGC AGTACTCAAT
TCCTATGCGG GTAATGCAGG AAAAACCAAC TGGTTTACCG TTATTCCAGT GAATGAAGTG
AGTGCCAATA TTGAGGCTGA TACTGACGCG CCAGTGACGC CAACGAGTAC AACCGAAAAT
ATTAACTGGA CTTACGATAT TTCCACGGGT ACCGGTAAGC TGACCCAGGG AACGGATGCC
TGGGAGATGC ATGGTCGTGA CACTGGAAGT AGTGCTGTTT CATTTAATCA TGGAAAGGAT
TTGTCCTTCG AAAACACCGG CACAGTGGTA TTAAAGGATA TTGTCAATCA GGGAGCGGGT
ACGCTCACAT TTAACGGTGA TTACATAGTA AAACCGGACG CTGATCAGAC TTGGGTAGGG
GGCGGTATTA TTGTCAATGG TGATCATACC GTGAACTGGC AAGTTAACGG TGTTAAGGGC
GATAGTATGC ATAAGCTAGG TACCGGGACG TTAAACATTT CCGGCACTGG CATCAATCCC
GGAACGCTCA GTGTGGGTGA TGGAACAGTT GTGCTCGCCC AAAAACCAGA TAGCAACGGT
CAGGTTCAGG CGTTCGAGTC CGCCAGTATT GTTAGTGGTA GGCCAACTCT GGTACTGAGC
GATAGTCAGC AAATGAACCC GGATAATATC AAATGGGGGT ATCGTGGCGG CAAACTCGAT
ATTAACGGTA ATGATTTAAC CTTTCATGCG TTAAATGCAG CTGATGAAGG AGCGATATTA
ACCAATAGCG GTTCGCTTGC GACAACTAGC CTGGATTTTA ATTCAACAGA CACCACGAAG
CCGGTGACGA CGATGTTCCA TGGTTTTTTC ACTGGCAATG TGAATGTTAA AAATAACGCA
ACATCCAATG TGAATAATAC ATTCGTGGTT GATGGCGGAA TTAATACGCC AGCGGGTAGT
ATGACGCAGC AGGGGGGGCG CCTCTTTTTT CAGGGACATC CGGTTATCCA TGCTGTGAGC
ACTCAGTCAG TCGCCAATAA ATTAAAGGCG CTGGGCGATG ATTCCGTGCT CACCCAACCT
GTCTCTTTCA CGCAAAGTGA CTGGCAAACA CGGCAATTTA ATCTGAAGTC GCTGGATCTC
AACAATGCGG CATTTTACCT TGCTCGTAAT GCAGGATTAA TCACAACAAT CAATGCCAAT
AATTCCACTG TTACTTTGGG TAGTGAAGAT CTTTATATAG ATACCAATGA TGGTAATGGC
GTAAAAACGA CACCTGTGGA AGGTCAATCA GTTGCGACTG CATCAGAAGA TCAAAGCCAC
TTTACGGGGA ACGTTAATCT GACAAATGGC TCAGCCCTGC GGGTCAATGA GAACTTCAGT
GGTGGAATTA TCAGTAGCAA TAGTAGTGTA ACTATCTCTT CGACTAACGC GAATCTGACC
GAGAGCAGCA TGTTTACCCA CTCTGTAATC AAACTCTCTG ATAATGCTCA ACTGACAAGC
ACTGCCGGTT TGCAATCGGA TGGTACTATC GAGTTCGGAA ATGGGGCAAA ACTCTCATTG
TTAGGGGAAT CATCGTCAAC CTTTACGCCT TTTTCAGCAA CGGCCTGGAA TCTCAAGGGG
ACCGGATCCT CGCTGAATAT TGGTTCTGGC ACCAACGTCA ATGGCGACAT TAATGCATGG
TCTGATACCA ACATTAACTT TGGGAATAGC GGAAAACAAA GTACGTCATC AGGCATTTTG
TATACAGGCG ATATCTACGC ACCAGAAGCT AATGTCAGCA TTGATAACAC TTCATGGACA
TTGAACAAAA CCTCATTGCT GGGCAATCTG ACACTCAAAA ACAGTCAGCT TATTATGTCT
ACAGATGGCA AGACCAGCAG CGGCATCAAG GTTGTTGATA CGTTTAGTGG AGAAAACAAT
ATTCTGTATG TGAAACCGAC CCGGTCGCTT AGTGAAATGT CGGTCAGCAA TATTCCGCTT
ATTACCGCCA AAAACGTAAC CAACAATACC CGGGTATTTA AAACGGTGAC CCAACAAACA
GGCTTTCACT CAATGACTCC AAAGATTGAG GTGGTTAATG TCGATGGGAC TACACAGTGG
CGTCTGAAAG GATTCGATGT TCAGAGTGAT AGTACAGCGC TTAAAGAAGG TCAGCGTTTG
ATGAACACTA ATATTAAAAA CTTTCTGACT GAGGTTAATA ACTTAAACCG TCGTATGGGT
GACCTGCGCG ATACAAAAGG TGAAACCGGC GCATGGGCTC GGTTGATGAA CTCTTCTGGC
TCAGGTTATG GCGGTTTTTC CGATCGCCAC GTACATCTGC AGGTTGGTGC TGACAGAAAG
CATCATTTTG AGGGGGGAGA TCTTTTTACA GGCGTCATGA TGACCGTTAC CGATAGTAAA
GCCAGCGCTG AAAGTTATCA GGGGAAAACC CGTTCCGTAG GTGGTGGACT GTACGCATCG
ACATTGTTTG ATTCAGGCGT GTATGTTGAC GTAATCGGTA AATATGTCCA TCACAGCAAC
GACTATTTAC TGCAAACGAT GGGATTAAAG GCAGACGATA CTGCTCACTC ATGGTATTTA
GGCGCAGAAA CGGGTTGGCG CTATCAGTGG AAGCCTGATG TCTTTATTGA ACCGCAGGCC
GAACTGGTTT ACGGGACGTT GTCAGGCAAT ACCTTTAACT GGCAATACAA CGGTATGGAT
GTCAGCATGG AGCGTAAAAA GGCAAAACCC TTGATTGGCC GTACTGGTGT GGAGTTTGGA
AAAACACTGG ATGGCCGCGA CTGGCAGGTT ACAGCGAAGG CTGGTTTAAG TTATCAATTT
GACCTGCGTA ATACTGGTGC TACCACGTAC CGTGATTTTG CTGGTGAGTC CACCGTGTAC
AACGGTAAAG ATGGTCGTAT GTTAGCCAAT ATTGGGATTG ATACACGCAT TAAAGACAAT
ACCCGTATCG GCCTGACCGT TGAGAAATCA GCATTTGGTA AGTACAACGT CGATAATGCG
ATCAATGCTA ATATCCGCTA TACATTCTAA
 
Protein sequence
MNFVYTLKLN KKRELVVVSE LSGGVKKSAR NKLLKSVVVL MTTLATQLYS PLIQASIVGM 
DIPYQTYRDF AENKGAFSVG ALDIPLYKKD GTLYSTLNKA PMIDFSAVDS GQTVATLISP
QYIVSVKHNT GYKNVRFGYR DDSSYILVDR NNSSVDFHTP RLNKIVTEVV PADITDAGTA
NGTYQNQDRF PIFYRVGTGT QYVKDRNGKL TQLAGGYAYR TGGTVGKPTS SNKRIVSNPG
NTYSAANGPM PSYGIPGDSG SPLFAWDTQR NKWVLVAVLN SYAGNAGKTN WFTVIPVNEV
SANIEADTDA PVTPTSTTEN INWTYDISTG TGKLTQGTDA WEMHGRDTGS SAVSFNHGKD
LSFENTGTVV LKDIVNQGAG TLTFNGDYIV KPDADQTWVG GGIIVNGDHT VNWQVNGVKG
DSMHKLGTGT LNISGTGINP GTLSVGDGTV VLAQKPDSNG QVQAFESASI VSGRPTLVLS
DSQQMNPDNI KWGYRGGKLD INGNDLTFHA LNAADEGAIL TNSGSLATTS LDFNSTDTTK
PVTTMFHGFF TGNVNVKNNA TSNVNNTFVV DGGINTPAGS MTQQGGRLFF QGHPVIHAVS
TQSVANKLKA LGDDSVLTQP VSFTQSDWQT RQFNLKSLDL NNAAFYLARN AGLITTINAN
NSTVTLGSED LYIDTNDGNG VKTTPVEGQS VATASEDQSH FTGNVNLTNG SALRVNENFS
GGIISSNSSV TISSTNANLT ESSMFTHSVI KLSDNAQLTS TAGLQSDGTI EFGNGAKLSL
LGESSSTFTP FSATAWNLKG TGSSLNIGSG TNVNGDINAW SDTNINFGNS GKQSTSSGIL
YTGDIYAPEA NVSIDNTSWT LNKTSLLGNL TLKNSQLIMS TDGKTSSGIK VVDTFSGENN
ILYVKPTRSL SEMSVSNIPL ITAKNVTNNT RVFKTVTQQT GFHSMTPKIE VVNVDGTTQW
RLKGFDVQSD STALKEGQRL MNTNIKNFLT EVNNLNRRMG DLRDTKGETG AWARLMNSSG
SGYGGFSDRH VHLQVGADRK HHFEGGDLFT GVMMTVTDSK ASAESYQGKT RSVGGGLYAS
TLFDSGVYVD VIGKYVHHSN DYLLQTMGLK ADDTAHSWYL GAETGWRYQW KPDVFIEPQA
ELVYGTLSGN TFNWQYNGMD VSMERKKAKP LIGRTGVEFG KTLDGRDWQV TAKAGLSYQF
DLRNTGATTY RDFAGESTVY NGKDGRMLAN IGIDTRIKDN TRIGLTVEKS AFGKYNVDNA
INANIRYTF