Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3839 |
Symbol | |
ID | 6143599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3910238 |
End bp | 3913711 |
Gene Length | 3474 bp |
Protein Length | 1157 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641618665 |
Product | cellulose synthase subunit BcsC |
Protein accession | YP_001745805 |
Protein GI | 170680722 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.503197 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAAT TCACACTAAA CATATTCACG CTTTCCCTCG GTCTGGCCGT CATGCCGATG GTCGAGGCAG CACCAACCGC TCAGCAACAG TTGCTGGAGC AAGTTCGGTT AGGCGAAGCG ACCCATCGTG AAGACCTGGT GCAACAGTCG TTGTATCGTC TGGAACTTAT TGATCCGAAT AACCCGGACG TCGTTGCCGC CCGTTTCCGT TCTCTGTTAC GTCAGGGCGA TATTGATGGC GCGCAAAAAC AGCTCGATCG GCTGTCGCAG TTAGCGCCGA GTTCAAATGC GTATAAATCG TCGCGGACCA CAATGTTGCT TTCCACGCCG GATGGTCGTC AGGCACTGCA ACAGGCAAGA TTGCAGGCGA CTACTGGTCA TGCAGAAGAA GCTGTGGCGA GTTACAACAA ACTGTTCAAC GGTGCGCCGC CGGAAGGTGA CATTGCCGTC GAGTACTGGA GTACGGTGGC GAAAATTCCG GCTCGCCGTG GCGAAGCGAT TAATCAGCTA AAACGCATTA ATGCGGATAC GCCGGGCAAT ACGGGCCTGC AAAACAATCT GGCGCTATTG CTGTTTAGTA GCGATCGCCG TGACGAAGGT TTTGCCGTCC TGGAACAGAT GGCGAAATCG AACGCCGGGC GCGAAGGGGC CTCTAAAATC TGGTACGGGC AGATTAAAGA CATGCCCGTC AGCGATGCCA GTGTGTCGGC ACTGAAAAAA TATCTCTCGA TCTTTAGCGA TGGCGATAGC GTGGCGGCTG CACAATCGCA ACTGGCAGAA CAGCAAAAAC AGCTGGCCGA CCCTGCTTTC CGCGCTCGTG CGCAAGGTTT AGCGGCGGTG GACTCTGGTA TGGCGGGTAA AGCCATTCCC GAACTACAAC AGGCGGTGCG GGCAAACCCG AAAGACAGTG AGGCTCTGGG GGCGCTGGGC CAGGCGTATT CTCAGAAAGG CGATCGCGCC AATGCGGTGG CGAATCTGGA AAAAGCCCTC GCACTGGACC CGCACAGCAG CAACAACGAC AAATGGAACA GTCTGCTGAA AGTAAACCGC TACTGGCTGG CAATCCAGCA GGGCGATGCT GCGCTGAAAG CCAATAATCC TGACCGGGCA GAACGCCTGT TCCAGCAGGC GCGTAATGTC GATAACACCG ACAGCTATGC AGTGCTTGGG CTGGGCGATG TGGCGATGGC GCGCAAAGAT TACCCCGCCG CTGAACGCTA TTATCAGCAG ACCCTGCGTA TGGACAGCGG CAACACTAAC GCCGTGCGCG GGCTGGCAAA TATTTACCGC CAGCAATCGC CAGAAAAAGC TGAAGCGTTT ATCGCCTCGC TCTCTGCCAG TCAGCGGCGT AGCATTGATG ATATCGAACG CAGCCTACAA AACGACCGTC TGGCACAGCA GGCAGAGGCA CTGGAAAACC AGGGCAAATG GGGGCAGGCG GCAGCACTTC AGCGGCAACG ACTGGCGCTG GATCCCGGCA GCGTATGGAT TACTTACCGA CTTTCGCAGG ATCTCTGGCA GGCCGGACAA CGCAGCCAGG CTGATACATT AATGCGCAAT CTGGCACAGC AGAAGCCGAA CGACCCGGAG CAGGTTTACG CTTACGGGCT GTATCTCTCT GGTCATGACC AGGACAGAGC GGCGCTGGCG CATATCAACA GCCTGCCGCG CGCGCAGTGG AACAGCAATA TTCAGGAACT GGTTAATCGC CTGCAAAGCG ATCAGGTGCT GGAAACCGCT AATCGCCTGC GAGAAAACGG CAAAGAAGCT GAAGCGGAAG CGATGCTGCG CCAGCAACCA CCTTCCACGC GCATAGACCT CACGCTGGCT GACTGGGCGC AACAACGACG TGATTACACC GCCGCTCGCG CTGCATATCA GAATGTCCTG ACGCGGGAGC CAACTAACGC CGATGCCATT CTTGGTCTGA CGGAAGTGGA TATTGCTGCC GGTGACAAAG CGGCGGCACG TAGCCAGCTG GCGAAACTGC CCGCCACTGA TAACGCCTCG CTGAACACTC AGCGGCGCGT GGCGCTGGCA CAGGCGCAGC TTGGCGATAC CGCAGCGGCG CAGCAGACGT TTAATAAGTT GATCCCGCAG GCAAAATCTC AGCCACCGTC GATGGAAAGC GCGATGGTGC TGCGCGACGG TGCGAAGTTT GAAGCGCAGG CGGGCGATCC AACGCAGGCG CTGGAAACCT ACAAAGACGC GATGGTCGCA TCCGGTGTGA CCACGACGCG TCCGCAGGAT AACGACACCT TTACCCAGCT GACCCGTAAC GACGAGAAAG ATGACTGGCT GAAACGCGGC GTGCGCAGCG ATGCAGCGGA CCTCTATCGC CAGCAGGATC TTAACGTCAC CCTCGAGCAC GATTACTGGG GTTCGAGCGG CACCGGTGGT TACTCCGATC TGAAAGCGCA CACCACTATG TTGCAGGTGG ATGCGCCGTA TTCTGACGGG CGGATGTTCT TTCGCAGTGA TTTCGTCAAT ATGAACGTCG GCAGTTTCTC CACTAATGCC GATGGTAAAT GGGATGACAA CTGGGGCACC TGTACATTAC AGGATTGTAG CGGCAACCGC AGCCAGTCGG ATTCCGGTGC CAGCGTGGCG GTCGGCTGGC GAAATGACGT CTGGAGCTGG GATATCGGTA CCACGCCGAT GGGCTTCAAC GTGGTGGATG TGGTTGGCGG CATCAGTTAC AGCGATGATA TCGGGCCGCT GGGTTACACC GTTAACGCCC ATCGTCGGCC CATCTCCAGT TCTTTGCTGG CCTTTGGTGG GCAAAAAGAC TCTCCGAGCA ATACCGGGAA AAAATGGGGC GGCGTGCGTG CCGACGGCGT GGGGCTAAGT CTGAGTTACG ATAAAGGTGA AGCAAACGGC GTCTGGGCAT CGCTTAGCGG TGATCAGTTA ACTGGTAAAA ATGTCGAAGA TAACTGGCGC GTGCGCTGGA TGACGGGCTA TTACTATAAG GTCATCAACC AGAATAATCG CCGTGTGACC ATTGGTCTTA ACAATATGAT CTGGCATTAC GACAAAGACC TGAGTGGCTA CTCACTTGGT CAGGGCGGTT ACTACAGCCC GCAGGAATAC CTGTCGTTTG CCATACCGGT GATGTGGCGG GAGCGGACGG AAAACTGGTC GTGGGAGCTG GGGGCGTCTG GCTCGTGGTC GCATTCACGC ACCAAAACCA TGCCGCGTTA TCCGCTGATG AATCTGATCC CGACCGACTG GCAGGAAGAA GCTGCGCGGC AATCCAACGA TGGCGGCAGC AGTCAGGGCT TTGGCTACAC GGCGCGGGCA TTACTTGAAC GACGTGTTAC CTCCAACTGG TTTGTCGGCA CGGCAATTGA TATCCAGCAG GCAAAAGATT ACGCACCCAG CCATTTCCTG CTCTACGTAC GTTATTCCGC CGCCGGATGG CAGGGTGACA TGGATTTACC GCCGCAGCCG CTGATACCTT ACGCCGACTG GTAA
|
Protein sequence | MRKFTLNIFT LSLGLAVMPM VEAAPTAQQQ LLEQVRLGEA THREDLVQQS LYRLELIDPN NPDVVAARFR SLLRQGDIDG AQKQLDRLSQ LAPSSNAYKS SRTTMLLSTP DGRQALQQAR LQATTGHAEE AVASYNKLFN GAPPEGDIAV EYWSTVAKIP ARRGEAINQL KRINADTPGN TGLQNNLALL LFSSDRRDEG FAVLEQMAKS NAGREGASKI WYGQIKDMPV SDASVSALKK YLSIFSDGDS VAAAQSQLAE QQKQLADPAF RARAQGLAAV DSGMAGKAIP ELQQAVRANP KDSEALGALG QAYSQKGDRA NAVANLEKAL ALDPHSSNND KWNSLLKVNR YWLAIQQGDA ALKANNPDRA ERLFQQARNV DNTDSYAVLG LGDVAMARKD YPAAERYYQQ TLRMDSGNTN AVRGLANIYR QQSPEKAEAF IASLSASQRR SIDDIERSLQ NDRLAQQAEA LENQGKWGQA AALQRQRLAL DPGSVWITYR LSQDLWQAGQ RSQADTLMRN LAQQKPNDPE QVYAYGLYLS GHDQDRAALA HINSLPRAQW NSNIQELVNR LQSDQVLETA NRLRENGKEA EAEAMLRQQP PSTRIDLTLA DWAQQRRDYT AARAAYQNVL TREPTNADAI LGLTEVDIAA GDKAAARSQL AKLPATDNAS LNTQRRVALA QAQLGDTAAA QQTFNKLIPQ AKSQPPSMES AMVLRDGAKF EAQAGDPTQA LETYKDAMVA SGVTTTRPQD NDTFTQLTRN DEKDDWLKRG VRSDAADLYR QQDLNVTLEH DYWGSSGTGG YSDLKAHTTM LQVDAPYSDG RMFFRSDFVN MNVGSFSTNA DGKWDDNWGT CTLQDCSGNR SQSDSGASVA VGWRNDVWSW DIGTTPMGFN VVDVVGGISY SDDIGPLGYT VNAHRRPISS SLLAFGGQKD SPSNTGKKWG GVRADGVGLS LSYDKGEANG VWASLSGDQL TGKNVEDNWR VRWMTGYYYK VINQNNRRVT IGLNNMIWHY DKDLSGYSLG QGGYYSPQEY LSFAIPVMWR ERTENWSWEL GASGSWSHSR TKTMPRYPLM NLIPTDWQEE AARQSNDGGS SQGFGYTARA LLERRVTSNW FVGTAIDIQQ AKDYAPSHFL LYVRYSAAGW QGDMDLPPQP LIPYADW
|
| |