Gene Ent638_3653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3653 
Symbol 
ID5111901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3956050 
End bp3957696 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content56% 
IMG OID640493858 
Productdihydroxyacetone kinase 
Protein accessionYP_001178361 
Protein GI146313287 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID[TIGR02361] dihydroxyacetone kinase, ATP-dependent 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0429503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGAT TCTTTTTTAA TGACCGCAAA CAGCTGGTCA ACGACGCCAT TGAAGGCATA 
CTGATTTCCG CGCCGCACGG GAATCTTGTC AAACTTGATA TCGATCCGGC CATTCGGGTG
GTTGCGCGTA GCGACTGGGA TAAAAGCCGC GTAGCGGTGA TTTCCGGTGG TGGGTCGGGG
CACGAACCCG CTCATGCCGG ATTTGTCGGC AAAGGGATGT TGACCGCAGC CGTCTGTGGC
GATCTGTTTG CCTCACCGAG CGTAGATGCG GTGTTAAACG CGATTGTGGC GGTAACGGGC
GATCGCGGTT GCCTGTTAAT CGTCAAAAAT TATACCGGCG ATCGGCTTAA CTTTGGCCTC
GCGGCGGAAA AGGCCAAACG CTATGGGCTG AAGGTTGAGA TGGTGATTGT TGCTGATGAC
ATCGCCCTGC CGGATAACAA ACAGCCGCGT GGCATTGCGG GTACGGCGCT GGTACACAAA
ATTGCCGGAT ATGCAGCCGA ACAGGGGAAA TCACTGGCTG ACGTGCGGGA TATTGCGCAG
CAGGCCTGTG ACAATATCTG GAGCCTGGGC GTGGCGATGC AAACGTGCAA CCTGCCGGGC
AGCGACGATG AAGAAGGGCG TATCAAGGAT GGACATGTCG AACTGGGGCT GGGCATTCAC
GGCGAGCCGG GCGCGTCGGT GGTTGATACG CACAACAGCA AAGAGATTAT CGACACCCTG
GTGAAGCCGT TAAAAGAGAC GGCCGGCGAA GGCAAATTTG CGGTGCTGAT TAACAATCTC
GGCGGTGTAT CGGCGCTGGA GATGGCGCTG CTCACGAAAG AACTGGCGGA TTCTGCGCTG
AAAGAAAATA TTGCGTATCT GATTGGCCCT GCGCCGCTGG TAAGCTCGCT GGATATGAAA
GGCTTTTCGC TGTCACTGTT ACAGCTTAAC GATACCTTTG AGAAAGCCAT TAACGCACCC
GTCGAAACTA TCGGCTGGCA AAAGCCGGTA GCATTCGCGC CATTACGCAC GCTTTCGCAT
ACTGCGATTC AGGATCGTGT TGAATTTACG CCTTCCGGGA ACGACGAGGT CGCAGCGCGA
GTGGCAGCGG CGACGCAAAC GTTGCTCGCT CTGGAGAACC GTTTAAATGC GCTGGACGCC
AAAGTGGGCG ACGGCGATAC CGGGTCGACT TTTGCGCAAG GCGCGCGGGA AATTGCGCAG
CTTCTGGAGC AAAAACAGCT TCCGCTAAAC GATCTTTCTA AGCTGCTGTT GTTGATCGGC
GAACGGCTGG CGACGGTCAT GGGCGGGTCG AGTGGCGTCC TGATGTCGAT CTTCTTCACA
GCTGCCGGAC AGAAAATGCA TGACGGAAAA TCACTGCCGG AGGCATTGCT GAGTGGGCTT
GCGCAAATGA AGCATTACGG CGGAGCGGAT CTTGGCGATC GTACCTTGAT CGACGCGCTA
CAGCCTGCAC TGGAGACGCT GCATAACGGC GATATTCAGG CGGCTGCCCA GGCAGCGAAA
AAAGGCGCAG ACGCTACGGC TGGCATGCAA AAAGCGGGAG CAGGGCGTTC GTCGTATGTG
AATAAAGAGA ACCTGGAAGG TGTAATAGAT CCTGGGGCAG TGGCCGTTGC AGAGGTGTTT
GCGGCAGTGG CCAAAGCAAA ACAGTAG
 
Protein sequence
MSRFFFNDRK QLVNDAIEGI LISAPHGNLV KLDIDPAIRV VARSDWDKSR VAVISGGGSG 
HEPAHAGFVG KGMLTAAVCG DLFASPSVDA VLNAIVAVTG DRGCLLIVKN YTGDRLNFGL
AAEKAKRYGL KVEMVIVADD IALPDNKQPR GIAGTALVHK IAGYAAEQGK SLADVRDIAQ
QACDNIWSLG VAMQTCNLPG SDDEEGRIKD GHVELGLGIH GEPGASVVDT HNSKEIIDTL
VKPLKETAGE GKFAVLINNL GGVSALEMAL LTKELADSAL KENIAYLIGP APLVSSLDMK
GFSLSLLQLN DTFEKAINAP VETIGWQKPV AFAPLRTLSH TAIQDRVEFT PSGNDEVAAR
VAAATQTLLA LENRLNALDA KVGDGDTGST FAQGAREIAQ LLEQKQLPLN DLSKLLLLIG
ERLATVMGGS SGVLMSIFFT AAGQKMHDGK SLPEALLSGL AQMKHYGGAD LGDRTLIDAL
QPALETLHNG DIQAAAQAAK KGADATAGMQ KAGAGRSSYV NKENLEGVID PGAVAVAEVF
AAVAKAKQ