Gene Hoch_3605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3605 
Symbol 
ID8545995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4960855 
End bp4962456 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content74% 
IMG OID646388274 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_003268000 
Protein GI262196791 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.329001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000690649 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGCTACG TGGTCACAGC CGAGGAGATG CAGGCGCTCG ATCGCGAGAC CATCGAGGGC 
ATCGGTCTCC CCGGCGTGGT GCTCATGGAG AACGCCGGCC GGGCGGTGGT GCGCATCATC
GAGGACCTGC TCGCCCACGA CGGCAAGTGG AGCGGCGACA GCGGCCGCAG CGCGCTCACC
CGCGACGCCA GCGTGAGCGG GCGACCGGCG CCGCGCATCG CCGTGGTCTG CGGCGGCGGC
AACAACGGCG GCGACGGCTA CGTCATCGGC CGCTGCCTGC GCGAGGCCGG CATGCACGTG
ACCGTGTACA TGGCCGCGCG CCGCGAGGCG GTGAAGGGCG ACGCCCGCCG CCACCTCGAC
GTCTACGAGA ACGCCGGCGG CCTGCTGGTG TCGGTGACCG ACGAGGCCTC GCTGTTCACC
CACGCCGAGC GCATCCGCAA CGCCGAGGTG GTGGTCGACG CGGTCTTTGG CACCGGGCTC
ACCCGCGAGG TGAGCGGTCA CTACCGCAAA ATCATCGAGA CCATCAACCA GTGCGGCGGC
CACCGCGTCG CCGTCGACAT CCCCAGCGGC CTCTCGGCCG ACACCGGCGA GGTGCTCGGC
ATCGCGGTCA ACGCCACCTG CACCGTGACC ATGGCGTTCC TCAAGGTAGG CCTGGCGACC
ACGCCCGGGT GCGCGCGCTG CGGCGACCTG CACGTCGCCG AGATCGGGAT TCCCGACGCG
CTGGCCGAAA AACACGGCAT CCGCACCGCC CTGATCGAAC CCGACGACCT CACGCCGCTG
CTGCCCGCGG ACGATGCGGT GGTGCACAAG AACCGCCGCG GTCACGTGCT GGCGGTGGCC
GGCTCGCCGG GCAAACGCGG CGCCGGACGG CTGGTCGCGT GGTCGGCGCT GCGCGCGGGC
GCCGGCCTGG TGACGCTGGC CTCGCCGTGG ACCAGCGGCG AGGTGTACGC GCCCGACCCG
GTGATGACCG AGGCCTTTGA CGCCGAGGCC GCCGACGCCT TGGAGCGGCT GCTGGCCCTG
GCCGAGGGCA AGCAGGTGGT GGCCATGGGG CCGGGCATGC CGACCTCGGA GGGCGCGCGC
GACCTGGTGC ACGCGGCCCT GGCCGAGCTC GAGGTGCCGA TGGTGCTCGA CGCCGACGCG
CTCAATCACA TCGGCACCCA CCTCGAGCGG GTGGCCACGG CCAAGGCGCC CATCATCCTC
ACGCCGCACC CGGGCGAGGC CGCGCGCCTG CTCGGCCGCA GCTCGGCCGC GGTGCAGAAG
GATCGCGTGG GCGCGGCGCG GGCGCTGGCC GCGCGCTCGG ACGCCATCGT GGTGCTCAAG
GGCGCGCGCA CATTGGTCTG CGTGGACGAC TTCGTCACCG TCAACCCCAG CGGCCACCCG
GCGCTGGCCA CGGCCGGCAC CGGCGACGTG CTCACCGGCC TGATCGCGTC GCTGGTGGCC
CAGGGCGTGT CGCCGGCCGA CGCCGCGCGC CTGGGCGTGT TTCTGCACGG CCGCGTGGGC
GAATACGCGG CCGCGGCGCT GAGCAGCCGC GGCGTGACCT CGGCCGACCT GGCCGAGCAC
ATGCCGCGGG CCATGAACAA CCTCGCCGCC GGAGCGCCGT GA
 
Protein sequence
MRYVVTAEEM QALDRETIEG IGLPGVVLME NAGRAVVRII EDLLAHDGKW SGDSGRSALT 
RDASVSGRPA PRIAVVCGGG NNGGDGYVIG RCLREAGMHV TVYMAARREA VKGDARRHLD
VYENAGGLLV SVTDEASLFT HAERIRNAEV VVDAVFGTGL TREVSGHYRK IIETINQCGG
HRVAVDIPSG LSADTGEVLG IAVNATCTVT MAFLKVGLAT TPGCARCGDL HVAEIGIPDA
LAEKHGIRTA LIEPDDLTPL LPADDAVVHK NRRGHVLAVA GSPGKRGAGR LVAWSALRAG
AGLVTLASPW TSGEVYAPDP VMTEAFDAEA ADALERLLAL AEGKQVVAMG PGMPTSEGAR
DLVHAALAEL EVPMVLDADA LNHIGTHLER VATAKAPIIL TPHPGEAARL LGRSSAAVQK
DRVGAARALA ARSDAIVVLK GARTLVCVDD FVTVNPSGHP ALATAGTGDV LTGLIASLVA
QGVSPADAAR LGVFLHGRVG EYAAAALSSR GVTSADLAEH MPRAMNNLAA GAP