Gene EcDH1_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2103 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2247565 
End bp2249025 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content49% 
IMG OID 
ProductMannitol dehydrogenase domain protein 
Protein accessionACX39758 
Protein GI260449336 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000000135615 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAATA ATTTGTTATC AGCAAAAGCG ACACTCCCTG TTTATGATCT TAATAACCTG 
GCTCCAAGAA TTGTTCATTT AGGCTTTGGT GCATTTCACC GTGCGCATCA GGGTGTGTAT
GCCGATATTC TTGCTACGGA ACATTTCAGT GACTGGGGAT ATTATGAGGT CAACTTAATC
GGCGGCGAAC AGCAAATTGC CGATTTACAA CAGCAAGATA ATCTTTATAC CGTTGCGGAA
ATGTCGGCCG ATGTGTGGAC GGCTCGCGTC GTTGGCGTCG TTAAAAAAGC CTTGCACGTA
CAGATAGATG GCTTAGAAAC CGTGTTGGCA GCGATGTGTG AACCGCAAAT CGCGATTGTC
TCTCTGACAA TCACCGAAAA AGGGTATTTC CACTCTCCGG CGACCGGACA GTTAATGCTC
GATCACCCGA TGGTAGCTGC CGACGTGCAA AATCCCCACC AGCCGAAAAC AGCAACAGGG
GTGATTGTTG AGGCGCTGGC TCGCCGTAAA GCGGCAGGAC TTCCCGCATT TACCGTCATG
TCATGTGACA ACATGCCAGA AAACGGTCAT GTTATGCGTG ACGTTGTCAC TTCCTACGCA
CAAGCCGTTG ATGTAAAACT GGCACAATGG ATCGAAGATA ACGTGACTTT CCCATCAACA
ATGGTGGACC GTATTGTGCC CGCAGTGACA GAGGATACGC TGGCGAAAAT CGAACAACTT
ACCGGTGTGC GCGATCCTGC GGGCGTTGCC TGTGAACCTT TCCGCCAGTG GGTAATAGAA
GATAACTTTG TTGCCGGACG TCCGGAATGG GAAAAAGCGG GAGCCGAACT GGTTAGCGAT
GTGCTGCCTT ATGAAGAGAT GAAGTTGCGC ATGCTCAACG GCAGTCATTC ATTCCTGGCG
TATCTGGGGT ATCTTGCAGG ATATCAGCAC ATTAATGACT GTATGGAAGA TGAACATTAT
CGTTATGCGG CGTATGGCTT GATGTTGCAG GAACAAGCGC CGACGTTGAA AGTGCAGGGC
GTTGATTTGC AAGATTACGC TAACCGATTA ATTGCACGCT ATAGCAACCC GGCGTTACGT
CATCGAACCT GGCAGATTGC GATGGATGGT AGCCAGAAAT TGCCACAGCG GATGTTGGAT
TCTGTTCGCT GGCATCTGGC GCATGACAGC AAGTTCGATC TGCTGGCGCT GGGCGTCGCG
GGTTGGATGC GTTATGTCGG TGGTGTTGAT GAACAGGGAA ATCCGATAGA AATCAGTGAC
CCACTGTTAC CTGTTATTCA GAAGGCTGTA CAAAGTAGTG CCGAAGGGAA AGCGCGCGTC
CAGTCATTGC TGGCGATTAA GGCGATCTTT GGTGATGATT TGCCAGACAA TAGTTTGTTT
ACTGCAAGAG TGACGGAAAC GTACTTGTCT TTATTAGCGC ATGGCGCGAA AGCGACCGTG
GCAAAATATT CCGTGAAGTA A
 
Protein sequence
MGNNLLSAKA TLPVYDLNNL APRIVHLGFG AFHRAHQGVY ADILATEHFS DWGYYEVNLI 
GGEQQIADLQ QQDNLYTVAE MSADVWTARV VGVVKKALHV QIDGLETVLA AMCEPQIAIV
SLTITEKGYF HSPATGQLML DHPMVAADVQ NPHQPKTATG VIVEALARRK AAGLPAFTVM
SCDNMPENGH VMRDVVTSYA QAVDVKLAQW IEDNVTFPST MVDRIVPAVT EDTLAKIEQL
TGVRDPAGVA CEPFRQWVIE DNFVAGRPEW EKAGAELVSD VLPYEEMKLR MLNGSHSFLA
YLGYLAGYQH INDCMEDEHY RYAAYGLMLQ EQAPTLKVQG VDLQDYANRL IARYSNPALR
HRTWQIAMDG SQKLPQRMLD SVRWHLAHDS KFDLLALGVA GWMRYVGGVD EQGNPIEISD
PLLPVIQKAV QSSAEGKARV QSLLAIKAIF GDDLPDNSLF TARVTETYLS LLAHGAKATV
AKYSVK