Gene EcDH1_0183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0183 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp194590 
End bp198063 
Gene Length3474 bp 
Protein Length1157 aa 
Translation table11 
GC content57% 
IMG OID 
Productcellulose synthase operon C domain protein 
Protein accessionACX37877 
Protein GI260447455 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAAT TCACACTAAA CATATTCACG CTTTCCCTCG GTCTGGCCGT CATGCCGATG 
GTCGAGGCAG CACCAACCGC TCAGCAACAG TTGCTGGAGC AAGTTCGGTT AGGCGAAGCG
ACCCATCGTG AAGATCTGGT GCAACAGTCG TTATATCGGC TGGAACTTAT TGATCCGAAT
AACCCGGACG TCGTTGCCGC CCGTTTCCGT TCTTTGTTAC GTCAGGGCGA TATTGATGGC
GCGCAAAAAC AGCTCGATCG GCTGTCGCAG TTAGCGCCGA GTTCAAATGC GTATAAATCG
TCGCGGACTA CGATGCTACT TTCCACGCCG GATGGTCGTC AGGCACTGCA ACAGGCACGA
TTGCAGGCGA CGACCGGTCA TGCAGAAGAA GCTGTGGCGA GTTACAACAA ACTGTTCAAC
GGTGCGCCGC CGGAAGGTGA CATTGCTGTC GAGTACTGGA GTACGGTGGC GAAAATTCCG
GCTCGCCGTG GCGAAGCGAT TAATCAGTTA AAACGCATCA ATGCGGATGC ACCGGGCAAT
ACGGGCCTGC AAAACAATCT GGCGCTATTG CTGTTTAGTA GCGATCGCCG TGACGAAGGT
TTTGCCGTCC TGGAACAGAT GGCAAAATCG AACGCCGGGC GCGAAGGGGC CTCTAAAATC
TGGTACGGGC AGATTAAAGA CATGCCCGTC AGTGATGCCA GTGTGTCGGC GCTGAAAAAA
TATCTCTCGA TCTTTAGTGA TGGCGATAGC GTGGCGGCTG CGCAATCGCA ACTGGCAGAA
CAGCAAAAAC AGCTGGCCGA TCCTGCTTTC CGCGCTCGTG CGCAAGGTTT AGCGGCGGTG
GACTCTGGTA TGGCGGGTAA AGCCATTCCC GAACTACAAC AGGCGGTGCG GGCGAACCCG
AAAGACAGTG AAGCTCTGGG GGCGCTGGGC CAGGCGTATT CTCAGAAAGG CGATCGCGCC
AATGCAGTGG CGAATCTGGA AAAAGCCCTC GCACTGGACC CGCACAGCAG CAACAACGAC
AAATGGAACA GTCTGCTGAA AGTAAACCGC TACTGGCTGG CGATCCAGCA GGGCGATGCT
GCGCTGAAAG CCAATAATCC TGACCGGGCA GAACGCCTGT TCCAGCAGGC GCGTAATGTC
GATAACACCG ACAGTTATGC AGTGCTGGGG CTGGGCGATG TGGCGATGGC GCGAAAAGAT
TATCCCGCCG CCGAACGTTA TTATCAGCAG ACCTTGCGTA TGGACAGCGG CAACACTAAC
GCCGTGCGCG GGCTGGCAAA TATTTACCGC CAGCAATCGC CAGAAAAAGC TGAAGCGTTT
ATCGCCTCGC TCTCTGCCAG TCAGCGGCGT AGCATTGATG ATATCGAACG CAGCCTGCAA
AACGACCGTC TGGCACAGCA GGCAGAGGCA CTGGAAAACC AGGGCAAATG GGCGCAGGCG
GCAGCACTTC AGCGGCAACG ACTGGCGCTG GACCCCGGCA GCGTATGGAT TACTTACCGA
CTTTCGCAGG ATCTCTGGCA GGCCGGACAA CGCAGCCAGG CCGATACGTT AATGCGCAAT
CTGGCGCAGC AGAAGTCGAA CGACCCGGAG CAGGTTTACG CTTACGGGCT GTACCTCTCT
GGTCATGACC AGGACAGAGC GGCGCTGGCG CATATCAATA GCCTGCCGCG TGCGCAGTGG
AACAGCAATA TTCAGGAGCT GGTTAATCGA CTGCAAAGCG ATCAGGTGCT GGAAACCGCT
AACCGCCTGC GAGAAAGCGG CAAAGAGGCA GAAGCGGAAG CGATGCTGCG CCAGCAACCA
CCTTCCACGC GTATTGACCT CACGCTGGCT GACTGGGCGC AACAACGACG TGATTACACC
GCCGCCCGCG CTGCATATCA GAATGTCCTG ACGCGGGAGC CAGCTAACGC CGACGCCATT
CTTGGTCTGA CGGAAGTGGA TATTGCTGCC GGTGACAAAG CGGCGGCACG TAGCCAGCTG
GCGAAACTGC CCGCTACCGA TAACGCCTCG CTGAACACAC AGCGGCGCGT GGCGCTGGCA
CAGGCGCAGC TTGGCGATAC CGCAGCAGCG CAGCGGACGT TTAATAAGTT GATCCCGCAG
GCAAAATCTC AGCCACCGTC GATGGAAAGC GCGATGGTGC TGCGTGATGG TGCGAAGTTT
GAAGCGCAGG CGGGCGATCC AACGCAGGCG CTGGAAACCT ACAAAGACGC CATGGTCGCA
TCCGGTGTGA CTACGACGCG TCCGCAGGAT AACGACACCT TTACCCGACT GACCCGTAAC
GACGAGAAAG ATGACTGGCT GAAACGTGGC GTGCGCAGCG ATGCGGCGGA CCTCTATCGC
CAGCAGGATC TTAACGTCAC CCTTGAGCAC GATTACTGGG GTTCGAGCGG CACCGGTGGT
TACTCCGATC TGAAAGCGCA CACTACCATG TTGCAGGTGG ATGCGCCGTA TTCTGACGGG
CGGATGTTCT TTCGCAGTGA TTTCGTCAAT ATGAACGTCG GCAGTTTCTC CACTAATGCC
GATGGCAAAT GGGATGACAA CTGGGGCACC TGTACATTAC AGGACTGTAG CGGCAACCGC
AGCCAGTCGG ATTCCGGTGC CAGCGTGGCG GTCGGCTGGC GAAATGACGT CTGGAGCTGG
GATATCGGTA CCACGCCGAT GGGCTTCAAC GTGGTGGATG TGGTCGGCGG CATCAGTTAC
AGCGATGATA TCGGGCCGCT GGGTTACACC GTTAACGCCC ACCGTCGGCC CATCTCCAGT
TCTTTGCTGG CCTTTGGTGG GCAAAAAGAC TCCCCGAGCA ATACCGGGAA AAAATGGGGT
GGCGTACGTG CCGACGGTGT GGGGCTAAGT CTGAGCTACG ATAAAGGTGA AGCAAACGGC
GTCTGGGCAT CGCTTAGTGG CGACCAGTTA ACCGGTAAAA ATGTCGAAGA TAACTGGCGC
GTGCGCTGGA TGACGGGCTA TTACTATAAG GTCATTAACC AGAACAATCG CCGCGTCACA
ATCGGCCTGA ACAACATGAT CTGGCATTAC GACAAAGATC TGAGTGGCTA CTCACTCGGT
CAGGGCGGTT ACTACAGTCC GCAGGAATAC CTGTCGTTTG CCATACCGGT GATGTGGCGG
GAGCGCACGG AAAACTGGTC GTGGGAGCTG GGTGCGTCTG GCTCGTGGTC GCATTCACGC
ACCAAAACCA TGCCGCGTTA TCCGCTGATG AATCTGATCC CGACCGACTG GCAGGAAGAA
GCTGCGCGGC AATCCAACGA TGGCGGCAGC AGTCAGGGCT TCGGCTACAC GGCGCGGGCA
TTACTTGAAC GACGTGTTAC TTCCAACTGG TTTGTTGGCA CGGCAATTGA TATCCAGCAG
GCGAAAGATT ACGCACCCAG CCATTTCCTG CTCTACGTAC GTTATTCCGC CGCCGGATGG
CAGGGTGACA TGGATTTACC GCCGCAGCCG CTGATACCTT ACGCCGACTG GTAA
 
Protein sequence
MRKFTLNIFT LSLGLAVMPM VEAAPTAQQQ LLEQVRLGEA THREDLVQQS LYRLELIDPN 
NPDVVAARFR SLLRQGDIDG AQKQLDRLSQ LAPSSNAYKS SRTTMLLSTP DGRQALQQAR
LQATTGHAEE AVASYNKLFN GAPPEGDIAV EYWSTVAKIP ARRGEAINQL KRINADAPGN
TGLQNNLALL LFSSDRRDEG FAVLEQMAKS NAGREGASKI WYGQIKDMPV SDASVSALKK
YLSIFSDGDS VAAAQSQLAE QQKQLADPAF RARAQGLAAV DSGMAGKAIP ELQQAVRANP
KDSEALGALG QAYSQKGDRA NAVANLEKAL ALDPHSSNND KWNSLLKVNR YWLAIQQGDA
ALKANNPDRA ERLFQQARNV DNTDSYAVLG LGDVAMARKD YPAAERYYQQ TLRMDSGNTN
AVRGLANIYR QQSPEKAEAF IASLSASQRR SIDDIERSLQ NDRLAQQAEA LENQGKWAQA
AALQRQRLAL DPGSVWITYR LSQDLWQAGQ RSQADTLMRN LAQQKSNDPE QVYAYGLYLS
GHDQDRAALA HINSLPRAQW NSNIQELVNR LQSDQVLETA NRLRESGKEA EAEAMLRQQP
PSTRIDLTLA DWAQQRRDYT AARAAYQNVL TREPANADAI LGLTEVDIAA GDKAAARSQL
AKLPATDNAS LNTQRRVALA QAQLGDTAAA QRTFNKLIPQ AKSQPPSMES AMVLRDGAKF
EAQAGDPTQA LETYKDAMVA SGVTTTRPQD NDTFTRLTRN DEKDDWLKRG VRSDAADLYR
QQDLNVTLEH DYWGSSGTGG YSDLKAHTTM LQVDAPYSDG RMFFRSDFVN MNVGSFSTNA
DGKWDDNWGT CTLQDCSGNR SQSDSGASVA VGWRNDVWSW DIGTTPMGFN VVDVVGGISY
SDDIGPLGYT VNAHRRPISS SLLAFGGQKD SPSNTGKKWG GVRADGVGLS LSYDKGEANG
VWASLSGDQL TGKNVEDNWR VRWMTGYYYK VINQNNRRVT IGLNNMIWHY DKDLSGYSLG
QGGYYSPQEY LSFAIPVMWR ERTENWSWEL GASGSWSHSR TKTMPRYPLM NLIPTDWQEE
AARQSNDGGS SQGFGYTARA LLERRVTSNW FVGTAIDIQQ AKDYAPSHFL LYVRYSAAGW
QGDMDLPPQP LIPYADW