Gene Hoch_5211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5211 
Symbol 
ID8547623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7165422 
End bp7166711 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content66% 
IMG OID646389886 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_003269590 
Protein GI262198381 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAAT CGACACAGGA GACCATCGCC GTCATCGGTC TGGGTTACGT CGGCCTGCCC 
GTGGCACTGA GCTTCGCCAA GAAGTACCCG ACCATTGGCT TCGACATCAG CGAGCGCCGC
ATCAAGGAGT TGCGCGAAGG CCTGGACGTC ACCGCCGAGG TCGAGAGCGA GGACCTCACG
TCCTCGAAGA TCAAATTCTC GGCGGACCCC GCGGACCTGG CCGAGGCGAC CTTCATCATC
GTCGCCGTGC CCACGCCCAT CGACAAGAAC AACCGCCCGG ACCTAACTCC CGTGATCGGC
GCCTCGCGCA CGGTCGGCAA GAATCTGCGC AAGGGCGCGG TCGTGGTCTA CGAATCGACC
GTGTACCCGG GCGTGACCGA GGAGGAGTGC GGGCCGGTGC TCGAGCAGGA GTCGGGCCTC
AAGGCCGGTG AGGACTTCTT CCTCGGCTAC TCGCCCGAGC GCATCAACCC GGGCGACAAG
GAGCACACCT TCGAGCGCAT CGTCAAAGTG GTGTCCGGCC AGGACGCGGC CACTCTCGAC
CGGGTGTCCG AGGTCTACTC CTCGGTGGTC ACCGCCGGCG TGCACCGGGC CTCGACCATC
AAGGTCGCCG AGGCCGCCAA GGTCATCGAG AACACCCAGC GCGACCTCAA CATCGCGCTG
ATGAACGAGC TGGCCCTGAT CTTCGATCGC GTCGGCATCC GCACCAAGGA CGTGCTCGAG
GCCGCCGGCA CCAAGTGGAA CTTCCTCAAG TTCACGCCCG GCCTGGTCGG CGGACACTGC
ATCGGCGTCG ACCCCTACTA CCTCACCACC AAGGCCGAGG AGCTGGGCTA CCAGCCCGAG
GTCATCCTCG CCGGCCGGCG CATCAACAAC AACATCGGCG CGTTCGTGGC CCAGAAGGCG
CTCAAGCTGA TGGCCGCGGG CAAGGTGCCC CTGCACCTGG CCAAGGTCGG CATCTTTGGC
CTCACCTTCA AGGAGAACGT GCCCGACCTG CGCAACAGCA AGGTGCCCGA CATCGTCCAC
GAGCTGCGCC AGTTCGGCGT GGTGCCGCTG GTTCACGATC CCATGGGCGA CTCCGAGGAG
GCCGAGCACG AGTACGGCAT CAAGCTCTCG CCCTGGGAGG ACATGAGCGA CCTAAACGCC
GCCATCTTCG CGGTCTCGCA CAGCTTCTAC CTCGACCAGG GCATCGAGGC GCTGCTCGCG
CGCGTGCGTC CGGGCGGGGC CTTCGTCGAC GTCAAGTCGG CCTTTCAGCC CAGCGATATC
TCCGACAAAT ACGCCTACTG GAGCCTGTAG
 
Protein sequence
MSQSTQETIA VIGLGYVGLP VALSFAKKYP TIGFDISERR IKELREGLDV TAEVESEDLT 
SSKIKFSADP ADLAEATFII VAVPTPIDKN NRPDLTPVIG ASRTVGKNLR KGAVVVYEST
VYPGVTEEEC GPVLEQESGL KAGEDFFLGY SPERINPGDK EHTFERIVKV VSGQDAATLD
RVSEVYSSVV TAGVHRASTI KVAEAAKVIE NTQRDLNIAL MNELALIFDR VGIRTKDVLE
AAGTKWNFLK FTPGLVGGHC IGVDPYYLTT KAEELGYQPE VILAGRRINN NIGAFVAQKA
LKLMAAGKVP LHLAKVGIFG LTFKENVPDL RNSKVPDIVH ELRQFGVVPL VHDPMGDSEE
AEHEYGIKLS PWEDMSDLNA AIFAVSHSFY LDQGIEALLA RVRPGGAFVD VKSAFQPSDI
SDKYAYWSL