Gene Hoch_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3072 
Symbol 
ID8545460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4238800 
End bp4240470 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content72% 
IMG OID646387743 
ProductCarbohydrate kinase, FGGY-like protein 
Protein accessionYP_003267471 
Protein GI262196262 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.317516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.334528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTCT CCGACTCCAG ACCGGACTCT GCGCCCAACT CCGCGAATGA CTCCGCCCGC 
GACGCTGTGC GCGCATCGGT GCGCACGCCG GCGCGCGACC ACGTCCTGGC CATCGACATG
GGCACCTCGG GGCCCAAGCT GTGCCTGGTC GATGCGCGCG GCGAAATTCT CGGCCACGAG
TTCGAACCCA CCGAGCTGCT GCTCTTGCCC GGCGGCGGCC AGGAGCAGCG CCCCGAGGAC
TGGTGGGCGG CCATCGACAC CGCGGCCAAG CGCCTGCTGG CCCGCGGGCT GGTCGCGCCC
GAGCGCATCG GCGCCATCGC GGCCACGGCG CAATGGTCGG GCACCGTGGC CGTGGACGCC
GAGGGCCAGC ACCTGGCCAA CGCCATCGGC TGGATGGACA CCCGCGGCGC CGCGCACATC
GCGCGCATCA CCGACGGGCT GATCAAGGTC GAGCGCTACG GCCTGAGCCG CGTGCTGCGC
TGGATGCGTC TCACCGGCGG CGCCCCCGGC CACGCCGGCA AAGACTCGCT GGCGCACATC
TTGTGGTTCC AGCACGAGGC GCCCGCCGTG TACCAGCGGA CGCACAAATT CCTCGAGCCC
AAGGACTGGA TCAACCTCCG GCTCACCGGC CGCTTCGCCG CCAGCTACGA CTCCATCGCC
CTGTACTGGG CGACCGACAA CCGCGATCCC GCGCGCGTCG ACTACCATCC CAAGCTGCTG
GCCATGAGCG GGCTCGAGCG CGCCAAGCTG CCCGACCTCA AAGCCGCGAC CGAGCTGCTG
GGGCCGCTGC GCCCCGAGCT GGCGGCCGAG TGGGGCCTGT CGCCCGCGGT GCAGGTGGTC
ATGGGCACGC CCGACGTCCA CGCCGCCGCG CTCGGCTCGG GCGGCGTCGG CGACTTCGAG
CCGCACGTGT ACATCGGCAC CTCGAGCTGG CTGAGCTGCC ACGTGCCCTT CAAGAAGACC
GACATCGGCG GCAACATGGC CTCGCTGCCG GCCGCGATTC CCGGGCGCTA TCTGGTGCTC
AACGAGCAGG AGGCCGCGGG TGCGTGCCTG AGCTGGCTGC GCGACCGGGT GTTCTTCCCC
GCCGATGAGG GGGCGCTGGG CCTGGGCACC GGGCCCGCGC CCGCCGACGC GTACGCGCGT
TTCGACGCGC TGGCCGAAAA CGCCGCCCCG GGCAGCGGCA AGCTCTTGTT CTGCCCCTGG
CTGGTCGGCG AGCGCACGCC CGTCGAGGAC CACCACGTGC GCGGCGGCTG GTTCAACGCC
TCGCTGGCCA GCACCCGCGC CGAGCTGGTG CGCTCGGTGC TCGAGGGCGT GGCGCTCAAC
ACCCGCTGGC TGCTGGGCGG GGTCGAGCGT TTCTGCAAGC GCCGCTTCGA CACCCTGCGC
ATCATCGGCG GCGGCGCGCG CTCGGCGACC TGGTGCCAGA TCTACGCCGA CGTGCTCGCG
CGGCCCATGC TGCAGATGGC GGCGCCGCTC GAGGCCGGCG CCCGGGGCGC CGCCGCGCTG
GCCCTGGTCG CGCTCGGCCA GCTCGATTTC GCAGGCTTTG GCGCGAGCGT GCGGGTCGAG
CGTCGCTTTG AACCGCGCGC CGACAACCGC GTCATCTACG ATGAACTGTT CGCCGAATTT
ACCGCCCTGT ACCGGCGTAA TCGCGCCGCG TGGCGGCGTT TGAACGCCTG A
 
Protein sequence
MTVSDSRPDS APNSANDSAR DAVRASVRTP ARDHVLAIDM GTSGPKLCLV DARGEILGHE 
FEPTELLLLP GGGQEQRPED WWAAIDTAAK RLLARGLVAP ERIGAIAATA QWSGTVAVDA
EGQHLANAIG WMDTRGAAHI ARITDGLIKV ERYGLSRVLR WMRLTGGAPG HAGKDSLAHI
LWFQHEAPAV YQRTHKFLEP KDWINLRLTG RFAASYDSIA LYWATDNRDP ARVDYHPKLL
AMSGLERAKL PDLKAATELL GPLRPELAAE WGLSPAVQVV MGTPDVHAAA LGSGGVGDFE
PHVYIGTSSW LSCHVPFKKT DIGGNMASLP AAIPGRYLVL NEQEAAGACL SWLRDRVFFP
ADEGALGLGT GPAPADAYAR FDALAENAAP GSGKLLFCPW LVGERTPVED HHVRGGWFNA
SLASTRAELV RSVLEGVALN TRWLLGGVER FCKRRFDTLR IIGGGARSAT WCQIYADVLA
RPMLQMAAPL EAGARGAAAL ALVALGQLDF AGFGASVRVE RRFEPRADNR VIYDELFAEF
TALYRRNRAA WRRLNA