Gene Hoch_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2003 
Symbol 
ID8544385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2763553 
End bp2764863 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID646386706 
ProductGlucose/sorbosone dehydrogenase-like protein 
Protein accessionYP_003266441 
Protein GI262195232 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.528812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTCCT CGTCATTTGC CCCACGTCCC GATCAACCTA CCGCGGGCGT GCGATTCGGA 
CACCTGGACT CGATCCGCCG CTCGCTGGCG ACTGCCCGCG CGCTGCCGGT GATTGCCATT
CTGGCGCTGC TCGGCGGCGG CTCCAGTTGT CGCACGAACA ACCCCGCCGA CGAGATCCCG
CCGCCGCCGA CGCCGCCGCC GACAACAAAG GCGACGGCAA CGCCGGAGTC GGCGGAGACG
GAGGAGACGG CGGAGACGGC GATGATCAGC GCGCTCGGCG GCAAGCTGCG CGTGCCCAAG
GGCTTCCGCG TCGAGGTCTT CAGCAAGGAG GTGCCCAACG CCCGCGGCAT GGCGCTGGGC
CCCGAGGGCA CGCTGTTCGT GGGCTCGCGC CAGGCCGGCA AGGTCTACGC GGTGGTCGAC
GAGGACGGCG ACGGCCGCGG CGACCGCGTG CACACCATCG CCGAGGGCCT GCAGATGCCC
GTGGGCCTCG ATGTCCGCGA CGGCGCTCTG TACGTGTCGG CGACCGATCG CGTGCTGCGC
TTCGACGGCA TCGAGACCAA GCTCGACAGC CCGCCGACCC CGGCCGTGGT CTCCGAGGCC
TTTCCCGACG ACACCCACCA CGGCTGGAAG TTCATCCGCT TCGGCCCCGA TGGCTGGCTC
TACGTGCCCG TGGGCGCGCC CTGCAACATG TGCCTCGAAG AGGACGAGCG CTACGCTAGC
ATCATGCGCA TGAAGCCCGA CGGCAGCGCG CTCGAGGTCT ACGCTCACGG CGTGCGCAAC
ACCGTGGGCT TCGACTGGCA CCCCGAGAGC GGCGCCATGT ACTTCACCGA CAACGGCCGC
GATATGCTCG GCGACGACCT GCCGCCCGAC GAACTCAACC GCGCGTCCGA AAAAGGCCAG
CACTTCGGCT ACCCCTTCTG CCACGCCGGC ACCATCGCCG ATCCTGAGTT CGGCGAGCAG
CGGCCGTGCC GCGAGTTCGT GCCCCCGGTG CAGAAGCTCG GGCCGCACGT GGCCGCGCTG
GGCATGCGCT TCTACACCGG CACGCAGTTC CCGGCCGAGT ACCGCGGCGC CATCTTCCTC
GCTGAACACG GCTCGTGGAA TCGCTCTGAG CCCATCGGTT ACCGCGTGAG CGTGGTCAAG
CTCGACGGTG AGGGCAACGC GACCAGCTAC GAGCCCTTCG TCGAGGGCTG GCTGCGCGAG
GGGGAGGCCT GGGGACGGCC CGTGGACGTG CTGGTCATGC CCGATGGCGC GCTGCTGATC
TCCGACGATC GAGCTGGTTG GATCTATCGC GTCAGCTACG AGGCCGGTTG A
 
Protein sequence
MRSSSFAPRP DQPTAGVRFG HLDSIRRSLA TARALPVIAI LALLGGGSSC RTNNPADEIP 
PPPTPPPTTK ATATPESAET EETAETAMIS ALGGKLRVPK GFRVEVFSKE VPNARGMALG
PEGTLFVGSR QAGKVYAVVD EDGDGRGDRV HTIAEGLQMP VGLDVRDGAL YVSATDRVLR
FDGIETKLDS PPTPAVVSEA FPDDTHHGWK FIRFGPDGWL YVPVGAPCNM CLEEDERYAS
IMRMKPDGSA LEVYAHGVRN TVGFDWHPES GAMYFTDNGR DMLGDDLPPD ELNRASEKGQ
HFGYPFCHAG TIADPEFGEQ RPCREFVPPV QKLGPHVAAL GMRFYTGTQF PAEYRGAIFL
AEHGSWNRSE PIGYRVSVVK LDGEGNATSY EPFVEGWLRE GEAWGRPVDV LVMPDGALLI
SDDRAGWIYR VSYEAG