Gene EcDH1_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1839 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1990499 
End bp1991614 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content54% 
IMG OID 
Productribonuclease D 
Protein accessionACX39497 
Protein GI260449075 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0204908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACCA CGGACGATGC GCTGGCTTCT TTGTGTGAAG CCGTCCGTGC CTTTCCGGCG 
ATAGCCCTGG ATACTGAATT TGTTCGTACG CGCACTTATT ACCCGCAGCT GGGGTTGATT
CAACTTTTCG ATGGCGAGCA TCTGGCGCTA ATCGATCCAC TCGGGATCAC CGACTGGTCA
CCGCTGAAAG CGATCCTGCG CGATCCGTCC ATCACAAAAT TTCTCCATGC AGGCAGTGAA
GATCTGGAAG TGTTCCTCAA TGTCTTTGGC GAATTACCAC AACCCTTGAT TGACACGCAA
ATCCTTGCTG CCTTCTGCGG ACGCCCGATG TCATGGGGTT TCGCTTCCAT GGTGGAAGAG
TATTCCGGCG TTACGCTGGA CAAGAGTGAA TCGCGCACCG ACTGGCTGGC CAGACCGCTG
ACCGAACGTC AGTGTGAATA CGCAGCGGCG GATGTCTGGT ATCTGTTACC GATCACCGCC
AAGCTTATGG TAGAAACGGA GGCCTCCGGC TGGCTACCTG CGGCGCTGGA TGAATGCCGC
CTGATGCAAA TGCGTCGTCA GGAAGTCGTT GCGCCGGAAG ATGCCTGGCG TGATATCACC
AATGCCTGGC AATTACGCAC ACGCCAACTG GCCTGTCTGC AACTGTTAGC CGACTGGCGA
CTGCGCAAGG CGCGAGAGCG CGATCTGGCG GTGAACTTTG TCGTGCGTGA AGAGCATTTG
TGGTCGGTAG CGCGTTATAT GCCGGGAAGT TTAGGCGAAC TGGACAGCCT GGGTTTATCC
GGTAGCGAAA TCCGCTTTCA CGGTAAAACG CTGCTAGCGC TGGTGGAAAA AGCGCAGACA
TTGCCGGAAG ATGCCTTACC GCAGCCGATG CTTAACCTGA TGGACATGCC GGGTTATCGT
AAAGCGTTTA AAGCGATTAA GTCGCTGATT ACTGACGTGA GCGAAACGCA TAAGATCAGC
GCCGAATTGC TGGCATCGCG TCGGCAAATC AACCAACTGC TGAACTGGCA CTGGAAACTG
AAACCGCAGA ACAATTTGCC GGAGCTGATT TCCGGCTGGC GTGGTGAGCT GATGGCGGAA
GCATTACACA ATTTATTGCA GGAATATCCG CAGTAA
 
Protein sequence
MITTDDALAS LCEAVRAFPA IALDTEFVRT RTYYPQLGLI QLFDGEHLAL IDPLGITDWS 
PLKAILRDPS ITKFLHAGSE DLEVFLNVFG ELPQPLIDTQ ILAAFCGRPM SWGFASMVEE
YSGVTLDKSE SRTDWLARPL TERQCEYAAA DVWYLLPITA KLMVETEASG WLPAALDECR
LMQMRRQEVV APEDAWRDIT NAWQLRTRQL ACLQLLADWR LRKARERDLA VNFVVREEHL
WSVARYMPGS LGELDSLGLS GSEIRFHGKT LLALVEKAQT LPEDALPQPM LNLMDMPGYR
KAFKAIKSLI TDVSETHKIS AELLASRRQI NQLLNWHWKL KPQNNLPELI SGWRGELMAE
ALHNLLQEYP Q