Gene EcDH1_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0531 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp559694 
End bp561031 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content54% 
IMG OID 
Productphosphoglucosamine mutase 
Protein accessionACX38219 
Protein GI260447797 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0782231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATC GTAAATATTT CGGTACCGAT GGGATTCGTG GTCGTGTAGG GGATGCGCCG 
ATCACACCTG ATTTTGTGCT TAAGCTGGGT TGGGCCGCGG GTAAAGTGCT GGCGCGCCAC
GGCTCCCGTA AGATTATTAT TGGTAAAGAC ACGCGTATTT CTGGCTATAT GCTGGAGTCA
GCACTGGAAG CGGGTCTGGC GGCAGCGGGC CTTTCCGCAC TCTTCACTGG CCCGATGCCA
ACACCGGCCG TGGCTTATCT GACGCGTACC TTCCGCGCAG AGGCCGGAAT TGTGATATCT
GCATCGCATA ACCCGTTCTA CGATAATGGC ATTAAATTCT TCTCTATCGA CGGCACCAAA
CTGCCGGATG CGGTAGAAGA GGCCATCGAA GCGGAAATGG AAAAGGAGAT CAGCTGCGTT
GATTCGGCAG AACTGGGTAA AGCCAGCCGT ATCGTTGATG CCGCGGGTCG CTATATCGAG
TTTTGCAAAG CCACGTTCCC GAACGAACTT AGCCTCAGTG AACTGAAGAT TGTGGTGGAT
TGTGCAAACG GTGCGACTTA TCACATCGCG CCGAACGTGC TGCGCGAACT GGGGGCGAAC
GTTATCGCTA TCGGTTGTGA GCCAAACGGT GTAAACATCA ATGCCGAAGT GGGGGCTACC
GACGTTCGCG CGCTCCAGGC TCGTGTGCTG GCTGAAAAAG CGGATCTCGG TATTGCCTTC
GACGGCGATG GCGATCGCGT GATTATGGTT GACCATGAAG GCAATAAAGT CGATGGCGAT
CAGATCATGT ATATCATCGC GCGTGAAGGT CTTCGTCAGG GCCAGCTGCG TGGTGGCGCT
GTGGGTACAT TGATGAGCAA CATGGGGCTT GAACTGGCGC TGAAACAGTT AGGAATTCCA
TTTGCGCGCG CGAAAGTGGG TGACCGCTAC GTACTGGAAA AAATGCAGGA GAAAGGCTGG
CGTATCGGTG CAGAGAATTC CGGTCATGTG ATCCTGCTGG ATAAAACTAC TACCGGTGAC
GGCATCGTTG CTGGCTTGCA GGTGCTGGCG GCGATGGCAC GTAACCATAT GAGCCTGCAC
GACCTTTGCA GCGGCATGAA AATGTTCCCG CAGATTCTGG TTAACGTACG TTACACCGCA
GGTAGCGGCG ATCCACTTGA GCATGAGTCA GTTAAAGCCG TGACCGCAGA GGTTGAAGCT
GCGCTGGGCA ACCGTGGACG CGTGTTGCTG CGTAAATCCG GCACCGAACC GTTAATTCGC
GTGATGGTGG AAGGCGAAGA CGAAGCGCAG GTGACTGAAT TTGCACACCG CATCGCCGAT
GCAGTAAAAG CCGTTTAA
 
Protein sequence
MSNRKYFGTD GIRGRVGDAP ITPDFVLKLG WAAGKVLARH GSRKIIIGKD TRISGYMLES 
ALEAGLAAAG LSALFTGPMP TPAVAYLTRT FRAEAGIVIS ASHNPFYDNG IKFFSIDGTK
LPDAVEEAIE AEMEKEISCV DSAELGKASR IVDAAGRYIE FCKATFPNEL SLSELKIVVD
CANGATYHIA PNVLRELGAN VIAIGCEPNG VNINAEVGAT DVRALQARVL AEKADLGIAF
DGDGDRVIMV DHEGNKVDGD QIMYIIAREG LRQGQLRGGA VGTLMSNMGL ELALKQLGIP
FARAKVGDRY VLEKMQEKGW RIGAENSGHV ILLDKTTTGD GIVAGLQVLA AMARNHMSLH
DLCSGMKMFP QILVNVRYTA GSGDPLEHES VKAVTAEVEA ALGNRGRVLL RKSGTEPLIR
VMVEGEDEAQ VTEFAHRIAD AVKAV