Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3921 |
Symbol | |
ID | 6145561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3993788 |
End bp | 3997597 |
Gene Length | 3810 bp |
Protein Length | 1269 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641618747 |
Product | outer membrane autotransporter |
Protein accession | YP_001745886 |
Protein GI | 170682879 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.728778 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTTG TATATACTCT AAAGTTAAAT AAAAAAAGAG AGTTGGTTGT AGTATCTGAA CTTTCTGGTG GGGTTAAAAA ATCAGCTCGA AATAAGCTAT TAAAATCTGT TGTTGTTTTA ATGACTACTT TAGCTACTCA ACTCTATTCA CCATTAATTC AGGCGTCAAT TGTGGGAATG GATATCCCAT ATCAGACCTA TCGTGATTTC GCTGAAAATA AAGGAGCGTT CTCAGTCGGA GCGCTGGATA TTCCTTTGTA TAAGAAAGAC GGGACGTTGT ACTCCACACT GAATAAGGCG CCAATGATAG ATTTTAGCGC TGTTGATAGT GGACAGACAG TAGCTACGTT AATTTCGCCA CAATATATTG TAAGCGTAAA GCATAATACT GGCTATAAGA ATGTTCGGTT TGGCTACAGA GATGATAGTT CTTATATTCT TGTCGATCGT AATAATAGTT CTGTAGATTT CCATACTCCG CGTTTAAATA AAATTGTAAC CGAAGTGGTT CCGGCTGATA TAACCGATGC CGGTACTGCC AACGGAACTT ATCAAAATCA AGACCGTTTT CCGATCTTTT ACCGTGTTGG TACTGGTACG CAATATGTGA AAGACCGTAA TGGGAAATTA ACTCAACTTG CTGGCGGTTA CGCATATAGA ACGGGCGGGA CCGTTGGTAA ACCAACATCA TCAAACAAGA GAATAGTGTC CAACCCTGGT AATACCTATT CTGCGGCTAA CGGCCCTATG CCTTCCTATG GAATCCCCGG TGATAGTGGC TCTCCCCTTT TTGCCTGGGA TACGCAACGT AATAAATGGG TGTTGGTTGC AGTACTCAAT TCCTATGCGG GTAATGCAGG AAAAACCAAC TGGTTTACCG TTATTCCAGT GAATGAAGTG AGTGCCAATA TTGAGGCTGA TACTGACGCG CCAGTGACGC CAACGAGTAC AACCGAAAAT ATTAACTGGA CTTACGATAT TTCCACGGGT ACCGGTAAGC TGACCCAGGG AACGGATGCC TGGGAGATGC ATGGTCGTGA CACTGGAAGT AGTGCTGTTT CATTTAATCA TGGAAAGGAT TTGTCCTTCG AAAACACCGG CACAGTGGTA TTAAAGGATA TTGTCAATCA GGGAGCGGGT ACGCTCACAT TTAACGGTGA TTACATAGTA AAACCGGACG CTGATCAGAC TTGGGTAGGG GGCGGTATTA TTGTCAATGG TGATCATACC GTGAACTGGC AAGTTAACGG TGTTAAGGGC GATAGTATGC ATAAGCTAGG TACCGGGACG TTAAACATTT CCGGCACTGG CATCAATCCC GGAACGCTCA GTGTGGGTGA TGGAACAGTT GTGCTCGCCC AAAAACCAGA TAGCAACGGT CAGGTTCAGG CGTTCGAGTC CGCCAGTATT GTTAGTGGTA GGCCAACTCT GGTACTGAGC GATAGTCAGC AAATGAACCC GGATAATATC AAATGGGGGT ATCGTGGCGG CAAACTCGAT ATTAACGGTA ATGATTTAAC CTTTCATGCG TTAAATGCAG CTGATGAAGG AGCGATATTA ACCAATAGCG GTTCGCTTGC GACAACTAGC CTGGATTTTA ATTCAACAGA CACCACGAAG CCGGTGACGA CGATGTTCCA TGGTTTTTTC ACTGGCAATG TGAATGTTAA AAATAACGCA ACATCCAATG TGAATAATAC ATTCGTGGTT GATGGCGGAA TTAATACGCC AGCGGGTAGT ATGACGCAGC AGGGGGGGCG CCTCTTTTTT CAGGGACATC CGGTTATCCA TGCTGTGAGC ACTCAGTCAG TCGCCAATAA ATTAAAGGCG CTGGGCGATG ATTCCGTGCT CACCCAACCT GTCTCTTTCA CGCAAAGTGA CTGGCAAACA CGGCAATTTA ATCTGAAGTC GCTGGATCTC AACAATGCGG CATTTTACCT TGCTCGTAAT GCAGGATTAA TCACAACAAT CAATGCCAAT AATTCCACTG TTACTTTGGG TAGTGAAGAT CTTTATATAG ATACCAATGA TGGTAATGGC GTAAAAACGA CACCTGTGGA AGGTCAATCA GTTGCGACTG CATCAGAAGA TCAAAGCCAC TTTACGGGGA ACGTTAATCT GACAAATGGC TCAGCCCTGC GGGTCAATGA GAACTTCAGT GGTGGAATTA TCAGTAGCAA TAGTAGTGTA ACTATCTCTT CGACTAACGC GAATCTGACC GAGAGCAGCA TGTTTACCCA CTCTGTAATC AAACTCTCTG ATAATGCTCA ACTGACAAGC ACTGCCGGTT TGCAATCGGA TGGTACTATC GAGTTCGGAA ATGGGGCAAA ACTCTCATTG TTAGGGGAAT CATCGTCAAC CTTTACGCCT TTTTCAGCAA CGGCCTGGAA TCTCAAGGGG ACCGGATCCT CGCTGAATAT TGGTTCTGGC ACCAACGTCA ATGGCGACAT TAATGCATGG TCTGATACCA ACATTAACTT TGGGAATAGC GGAAAACAAA GTACGTCATC AGGCATTTTG TATACAGGCG ATATCTACGC ACCAGAAGCT AATGTCAGCA TTGATAACAC TTCATGGACA TTGAACAAAA CCTCATTGCT GGGCAATCTG ACACTCAAAA ACAGTCAGCT TATTATGTCT ACAGATGGCA AGACCAGCAG CGGCATCAAG GTTGTTGATA CGTTTAGTGG AGAAAACAAT ATTCTGTATG TGAAACCGAC CCGGTCGCTT AGTGAAATGT CGGTCAGCAA TATTCCGCTT ATTACCGCCA AAAACGTAAC CAACAATACC CGGGTATTTA AAACGGTGAC CCAACAAACA GGCTTTCACT CAATGACTCC AAAGATTGAG GTGGTTAATG TCGATGGGAC TACACAGTGG CGTCTGAAAG GATTCGATGT TCAGAGTGAT AGTACAGCGC TTAAAGAAGG TCAGCGTTTG ATGAACACTA ATATTAAAAA CTTTCTGACT GAGGTTAATA ACTTAAACCG TCGTATGGGT GACCTGCGCG ATACAAAAGG TGAAACCGGC GCATGGGCTC GGTTGATGAA CTCTTCTGGC TCAGGTTATG GCGGTTTTTC CGATCGCCAC GTACATCTGC AGGTTGGTGC TGACAGAAAG CATCATTTTG AGGGGGGAGA TCTTTTTACA GGCGTCATGA TGACCGTTAC CGATAGTAAA GCCAGCGCTG AAAGTTATCA GGGGAAAACC CGTTCCGTAG GTGGTGGACT GTACGCATCG ACATTGTTTG ATTCAGGCGT GTATGTTGAC GTAATCGGTA AATATGTCCA TCACAGCAAC GACTATTTAC TGCAAACGAT GGGATTAAAG GCAGACGATA CTGCTCACTC ATGGTATTTA GGCGCAGAAA CGGGTTGGCG CTATCAGTGG AAGCCTGATG TCTTTATTGA ACCGCAGGCC GAACTGGTTT ACGGGACGTT GTCAGGCAAT ACCTTTAACT GGCAATACAA CGGTATGGAT GTCAGCATGG AGCGTAAAAA GGCAAAACCC TTGATTGGCC GTACTGGTGT GGAGTTTGGA AAAACACTGG ATGGCCGCGA CTGGCAGGTT ACAGCGAAGG CTGGTTTAAG TTATCAATTT GACCTGCGTA ATACTGGTGC TACCACGTAC CGTGATTTTG CTGGTGAGTC CACCGTGTAC AACGGTAAAG ATGGTCGTAT GTTAGCCAAT ATTGGGATTG ATACACGCAT TAAAGACAAT ACCCGTATCG GCCTGACCGT TGAGAAATCA GCATTTGGTA AGTACAACGT CGATAATGCG ATCAATGCTA ATATCCGCTA TACATTCTAA
|
Protein sequence | MNFVYTLKLN KKRELVVVSE LSGGVKKSAR NKLLKSVVVL MTTLATQLYS PLIQASIVGM DIPYQTYRDF AENKGAFSVG ALDIPLYKKD GTLYSTLNKA PMIDFSAVDS GQTVATLISP QYIVSVKHNT GYKNVRFGYR DDSSYILVDR NNSSVDFHTP RLNKIVTEVV PADITDAGTA NGTYQNQDRF PIFYRVGTGT QYVKDRNGKL TQLAGGYAYR TGGTVGKPTS SNKRIVSNPG NTYSAANGPM PSYGIPGDSG SPLFAWDTQR NKWVLVAVLN SYAGNAGKTN WFTVIPVNEV SANIEADTDA PVTPTSTTEN INWTYDISTG TGKLTQGTDA WEMHGRDTGS SAVSFNHGKD LSFENTGTVV LKDIVNQGAG TLTFNGDYIV KPDADQTWVG GGIIVNGDHT VNWQVNGVKG DSMHKLGTGT LNISGTGINP GTLSVGDGTV VLAQKPDSNG QVQAFESASI VSGRPTLVLS DSQQMNPDNI KWGYRGGKLD INGNDLTFHA LNAADEGAIL TNSGSLATTS LDFNSTDTTK PVTTMFHGFF TGNVNVKNNA TSNVNNTFVV DGGINTPAGS MTQQGGRLFF QGHPVIHAVS TQSVANKLKA LGDDSVLTQP VSFTQSDWQT RQFNLKSLDL NNAAFYLARN AGLITTINAN NSTVTLGSED LYIDTNDGNG VKTTPVEGQS VATASEDQSH FTGNVNLTNG SALRVNENFS GGIISSNSSV TISSTNANLT ESSMFTHSVI KLSDNAQLTS TAGLQSDGTI EFGNGAKLSL LGESSSTFTP FSATAWNLKG TGSSLNIGSG TNVNGDINAW SDTNINFGNS GKQSTSSGIL YTGDIYAPEA NVSIDNTSWT LNKTSLLGNL TLKNSQLIMS TDGKTSSGIK VVDTFSGENN ILYVKPTRSL SEMSVSNIPL ITAKNVTNNT RVFKTVTQQT GFHSMTPKIE VVNVDGTTQW RLKGFDVQSD STALKEGQRL MNTNIKNFLT EVNNLNRRMG DLRDTKGETG AWARLMNSSG SGYGGFSDRH VHLQVGADRK HHFEGGDLFT GVMMTVTDSK ASAESYQGKT RSVGGGLYAS TLFDSGVYVD VIGKYVHHSN DYLLQTMGLK ADDTAHSWYL GAETGWRYQW KPDVFIEPQA ELVYGTLSGN TFNWQYNGMD VSMERKKAKP LIGRTGVEFG KTLDGRDWQV TAKAGLSYQF DLRNTGATTY RDFAGESTVY NGKDGRMLAN IGIDTRIKDN TRIGLTVEKS AFGKYNVDNA INANIRYTF
|
| |