Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0402 |
Symbol | |
ID | 6146526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 414664 |
End bp | 416808 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641615298 |
Product | outer membrane autotransporter |
Protein accession | YP_001742505 |
Protein GI | 170683055 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGGAAA GTTCTGGTTA TGGTCATTTC GGTAACTCTG GTGAGCCGAG TGATTATGCT GGTCCGGGTG ACGTTGCATT GTCTTTCACT GACAGTACTT CAGACTATGC AATGAAGAAC AATGTATATT TCAGCAATTC TACTCTGGTG GGTGATGTTG CGTTTACCAG CACCTGGAAT GCTAATTTTG ATCCAAGTGG TCATGATTCT AACGGTGACG GCGTAAAAGA TACCAATGCT GGTTGGGTTG ATGATAGCCT GAACGTTGAT GAGCTGAATA TTACTCTTGA TAATGGAAGC AAGTGGGTTG GACAGGCTAC ATTCAATGCT GAAACCATTT CTCCAGACAC AATGTATGAT GTTGCAACTA ACAGTTTGAC CCCTGGTGGA ACAGCTGAAG CTAATGGTTG GAATCGCATC ATCGATAATA AAGTATTCCA GAGTGGTGTA TTTAACGTAG CGTTGAACAA CGGTTCTGAA TGGGATACTA CAGGTCGTTC CGTCGTTGAT ACCTTGACAG TTAATAATGC TTCTCAGGTT AATGTTTCGG AATCTAAATT AACTTCAGAT ACTATCGATT TAACTAACGG TTCTTCGCTG AACATTGGTG AAGATGGTTA CGTTGATACC GATCATCTGA CTATTAACTC CTACAGTACT GTTGCGTTGA CCGAATCTAC AGGTTGGGGT GCTGATTACA ACCTGTACGC CAATACCATC ACCGTAACTA ACGGTGGTGT ATTGGATGTG AACGTTGATC AGTTCGATAC TGAAGCTTTC CGTACTGACA AACTGGAACT GACCAGCGGC AACATTGCTG ACAATAATGG TAATGTAGTA TCCGGTGTAT TCGATATCCA CAGCAGCGAT TATGTTCTGA ACGCTGACTT GGTGAACGAC CGTACCTGGG ATACTTCCAA GTCTAACTAC GGCTACGGTA TTGTTGCTAT GAACTCTGAT GGTCATCTGA CTATCAATGG TAACGGCGAC ATGAACAACG GCGACGAACT GGATAACAGC TCTGTAGACA ACGTTGTTGC TGCAACCGGT AACTATAAAG TTCGTATCGA CAACGCAACT GGCGCTGGCG CAATCGCTGA TTACAAAGAT AAAGAAATTA TCTACGTAAA CGACGTCAAC AGCAACGCGA CCTTCTCTGC TGCTAACAAA GCTGACCTGG GTGCATACAC CTATCAGGCT GAACAGCGTG GTAACACCGT TGTTCTGCAA CAGATGGAGC TGACCGACTA CGCTAACATG GCGCTGAGCA TCCCGTCTGC GAACACCAAT ATCTGGAACC TGGAACAAGA CACCGTTGGT ACTCGTCTGA CCAACTCTCG TCATGGCCTG GCTGATAACG GCGGCGCATG GGTAAGCTAC TTCGGTGGTA ACTTCAACGG CGACAACGGC ACCATCAACT ATGATCAGGA TGTTAACGGC ATCATGGTCG GTGTTGATAC CAAAATTGAC GGTAACAACG CTAAGTGGAT CGTCGGTGCG GCTGCAGGCT TCGCTAAAGG TGACATGAAT GACCGTTCTG GTCAGGTGGA TCAAGACAGC CAGACTGCCT ACATCTACTC TTCTGCTCAC TTCGCGAACA ACGTCTTTGT TGATGGTAGC TTGAGCTACT CTCACTTCAA CAACGACCTG TCTGCAACCA TGAGCAACGG TACTTATGTT GATGGTAGCA CCAACTCCGA CGCTTGGGGC TTCGGTTTGA AAGCCGGTTA CGACTTCAAA CTGGGTGATG CTGGTTACGT GACTCCTTAC GGCAGCGTTT CTGGTCTGTT CCAGTCTGGT GATGACTACC AGCTGAGCAA CAACATGAAA GTTGACGGTC AGTCTTACGA CAGCATGCGT TATGAACTGG GTGTAGATGC AGGTTATACC TTCACCTACA GCGAAGACCA GGCTCTGACT CCGTACTTCA AACTGGCTTA CGTTTACGAC GACTCTAACA ACCATAACGA TGTGAACGGT GATTCCATCG ATAACGGTAC TGAAGGGTCT GCGGTACGTG TTGGTCTGGG TACTCAGTTC AGCTTCACCA AGAACTTCAG CGCCTATACC GATGCTAACT ACCTCGGTGG TGGTGACGTA GATCAAGACT GGTCCGCGAA CGTGGGTGTT AAATATACCT GGTAA
|
Protein sequence | MLESSGYGHF GNSGEPSDYA GPGDVALSFT DSTSDYAMKN NVYFSNSTLV GDVAFTSTWN ANFDPSGHDS NGDGVKDTNA GWVDDSLNVD ELNITLDNGS KWVGQATFNA ETISPDTMYD VATNSLTPGG TAEANGWNRI IDNKVFQSGV FNVALNNGSE WDTTGRSVVD TLTVNNASQV NVSESKLTSD TIDLTNGSSL NIGEDGYVDT DHLTINSYST VALTESTGWG ADYNLYANTI TVTNGGVLDV NVDQFDTEAF RTDKLELTSG NIADNNGNVV SGVFDIHSSD YVLNADLVND RTWDTSKSNY GYGIVAMNSD GHLTINGNGD MNNGDELDNS SVDNVVAATG NYKVRIDNAT GAGAIADYKD KEIIYVNDVN SNATFSAANK ADLGAYTYQA EQRGNTVVLQ QMELTDYANM ALSIPSANTN IWNLEQDTVG TRLTNSRHGL ADNGGAWVSY FGGNFNGDNG TINYDQDVNG IMVGVDTKID GNNAKWIVGA AAGFAKGDMN DRSGQVDQDS QTAYIYSSAH FANNVFVDGS LSYSHFNNDL SATMSNGTYV DGSTNSDAWG FGLKAGYDFK LGDAGYVTPY GSVSGLFQSG DDYQLSNNMK VDGQSYDSMR YELGVDAGYT FTYSEDQALT PYFKLAYVYD DSNNHNDVNG DSIDNGTEGS AVRVGLGTQF SFTKNFSAYT DANYLGGGDV DQDWSANVGV KYTW
|
| |