Gene EcDH1_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0100 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp102746 
End bp103936 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content56% 
IMG OID 
ProductFMN-dependent alpha-hydroxy acid dehydrogenase 
Protein accessionACX37798 
Protein GI260447376 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATTT CCGCAGCCAG CGATTATCGC GCCGCAGCGC AACGCATTCT GCCGCCGTTC 
CTGTTCCACT ATATGGATGG TGGTGCATAT TCTGAATACA CGCTGCGCCG CAACGTGGAA
GATTTGTCAG AAGTGGCGCT GCGCCAGCGT ATTCTGAAAA ACATGTCCGA CTTAAGCCTG
GAAACGACGC TGTTTAATGA GAAATTGTCG ATGCCGGTGG CACTGGCTCC GGTGGGTTTG
TGTGGCATGT ATGCGCGTCG TGGCGAAGTT CAGGCAGCCA AAGCGGCGGA CGCGCATGGT
ATTCCGTTTA CTCTCTCGAC GGTTTCCGTT TGCCCGATTG AAGAAGTCGC GCCAGCCATC
AAGCGCCCAA TGTGGTTCCA GCTTTATGTA CTGCGCGATC GCGGCTTTAT GCGTAACGCG
CTGGAGCGAG CAAAAGCAGC GGGTTGTTCG ACGCTGGTTT TCACCGTGGA TATGCCGACA
CCGGGCGCAC GCTACCGTGA TGCGCATTCA GGTATGAGCG GCCCGAACGC GGCAATGCGC
CGCTACTTGC AAGCGGTGAC ACATCCGCAA TGGGCGTGGG ATGTGGGCCT GAACGGTCGT
CCACATGATT TAGGTAATAT CTCAGCTTAT CTCGGCAAAC CGACCGGACT GGAAGATTAC
ATCGGCTGGC TGGGGAATAA CTTCGATCCG TCCATCTCAT GGAAAGACCT TGAATGGATC
CGCGATTTCT GGGATGGCCC GATGGTGATC AAAGGGATCC TCGATCCGGA AGATGCGCGC
GATGCAGTAC GTTTTGGTGC TGATGGAATT GTGGTTTCTA ACCACGGTGG CCGCCAGCTG
GACGGTGTAC TCTCTTCCGC CCGTGCACTG CCTGCTATTG CAGATGCGGT GAAAGGTGAT
ATAGCCATTC TGGCGGATAG CGGAATTCGT AACGGGCTTG ATGTCGTGCG TATGATTGCG
CTCGGTGCCG ACACCGTACT GCTGGGTCGT GCTTTCTTGT ATGCGCTGGC AACAGCGGGC
CAGGCGGGTG TAGCTAACCT GCTAAATCTG ATCGAAAAAG AGATGAAAGT GGCGATGACG
CTGACTGGCG CGAAATCGAT CAGCGAAATT ACGCAAGATT CGCTGGTGCA GGGGCTGGGT
AAAGAGTTGC CTGCGGCACT GGCTCCCATG GCGAAAGGGA ATGCGGCATA G
 
Protein sequence
MIISAASDYR AAAQRILPPF LFHYMDGGAY SEYTLRRNVE DLSEVALRQR ILKNMSDLSL 
ETTLFNEKLS MPVALAPVGL CGMYARRGEV QAAKAADAHG IPFTLSTVSV CPIEEVAPAI
KRPMWFQLYV LRDRGFMRNA LERAKAAGCS TLVFTVDMPT PGARYRDAHS GMSGPNAAMR
RYLQAVTHPQ WAWDVGLNGR PHDLGNISAY LGKPTGLEDY IGWLGNNFDP SISWKDLEWI
RDFWDGPMVI KGILDPEDAR DAVRFGADGI VVSNHGGRQL DGVLSSARAL PAIADAVKGD
IAILADSGIR NGLDVVRMIA LGADTVLLGR AFLYALATAG QAGVANLLNL IEKEMKVAMT
LTGAKSISEI TQDSLVQGLG KELPAALAPM AKGNAA