Gene EcDH1_2156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2156 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2309655 
End bp2310974 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content52% 
IMG OID 
Productprotein of unknown function DUF187 
Protein accessionACX39808 
Protein GI260449386 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.360071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATCT GCTCCCGAAA CAAGAAATTA ACGATTAGAA GACCAGCGAT ACTAGTTGCA 
CTGGCACTTT TACTGTGTAG TTGTAAAAGC ACGCCTCCAG AGTCCATGGT GACACCACCA
GCAGGTTCAA AGCCACCAGC CACGACGCAA CAATCGTCAC AACCGATGCG TGGCATCTGG
CTGGCCACGG TTTCTCGGCT CGACTGGCCA CCGGTTTCCT CGGTTAACAT TAGTAACCCC
ACCAGCCGGG CCCGTGTACA ACAACAGGCG ATGATCGACA AACTGGATCA TCTGCAACGT
CTCGGCATAA ACACGGTCTT TTTCCAGGTC AAGCCGGACG GTACCGCCCT GTGGCCATCG
AAAATTTTGC CGTGGTCCGA TCTTATGACC GGTAAGATTG GTGAAAATCC GGGTTACGAT
CCGCTGCAAT TTATGCTCGA CGAAGCCCAC AAGCGTGGGA TGAAAGTACA CGCCTGGTTT
AACCCCTATC GCGTATCGGT TAATACGAAG CCCGGTACTA TCAGGGAACT GAATAGCACT
CTGTCTCAAC AACCGGCGAG CGTCTATGTG CAACACCGCG ACTGGATCAG AACGTCTGGC
GATCGCTTTG TCCTCGACCC GGGCATCCCT GAGGTTCAGG ACTGGATCAC ATCAATAGTC
GCAGAAGTGG TTTCCCGCTA TCCCGTAGAT GGCGTGCAGT TTGACGACTA TTTCTATACG
GAGTCACCGG GTTCACGGCT AAATGATAAC GAAACGTACC GTAAATACGG AGGCGCATTT
GCGTCAAAAG CAGACTGGCG GCGCAACAAT ACTCAGCAGT TAATTGCAAA GGTATCGCAC
ACCATTAAAA GCATTAAGCC GGGAGTCGAA TTTGGTGTTA GCCCGGCAGG CGTGTGGCGT
AACCGATCAC ACGATCCGCT CGGTTCCGAT ACCCGAGGCG CGGCAGCCTA TGACGAATCC
TACGCTGACA CCCGTCGATG GGTGGAACAA GGATTGCTGG ATTACATTGC TCCCCAAATT
TACTGGCCGT TCTCACGGAG TGCCGCGCGT TATGACGTGT TGGCAAAATG GTGGGCGGAT
GTCGTTAAAC CGACCAGGAC CCGCCTGTAT ATCGGTATCG CCTTCTATAA AGTGGGTGAA
CCTTCAAAGA TAGAGCCAGA CTGGATGATT AACGGCGGCG TACCGGAACT GAAAAAGCAG
CTCGATCTTA ACGATGCTGT GCCCGAAATT AGCGGCACCA TCTTGTTCCG TGAGGACTAT
CTGAATAAAC CGCAGACTCA ACAAGCGGTC AGCTATCTGC AAAGTCGCTG GGGCAGTTAA
 
Protein sequence
MDICSRNKKL TIRRPAILVA LALLLCSCKS TPPESMVTPP AGSKPPATTQ QSSQPMRGIW 
LATVSRLDWP PVSSVNISNP TSRARVQQQA MIDKLDHLQR LGINTVFFQV KPDGTALWPS
KILPWSDLMT GKIGENPGYD PLQFMLDEAH KRGMKVHAWF NPYRVSVNTK PGTIRELNST
LSQQPASVYV QHRDWIRTSG DRFVLDPGIP EVQDWITSIV AEVVSRYPVD GVQFDDYFYT
ESPGSRLNDN ETYRKYGGAF ASKADWRRNN TQQLIAKVSH TIKSIKPGVE FGVSPAGVWR
NRSHDPLGSD TRGAAAYDES YADTRRWVEQ GLLDYIAPQI YWPFSRSAAR YDVLAKWWAD
VVKPTRTRLY IGIAFYKVGE PSKIEPDWMI NGGVPELKKQ LDLNDAVPEI SGTILFREDY
LNKPQTQQAV SYLQSRWGS