Gene Hoch_5072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5072 
Symbol 
ID8547483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6992525 
End bp6993592 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content71% 
IMG OID646389748 
Productpolysaccharide deacetylase 
Protein accessionYP_003269453 
Protein GI262198244 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGAACC GCGGCGAGCA TCTAGAAGCG CGAGGCGTAC GTGCGCGGCT CGCGCGCCCC 
GGGCTGTTCG CGCTCACCTT GACCCTGGCG CTGAGCTGCG CGGCGGCCTT GTGGGCGGCG
CCGCCGACCG GCGTGGCCAG CGCCGCGCCC CGTCTCGACA ACGCCCACAC GGCGTCGCCG
CAGCCCGCCG AGCCCGGTCT CGCGGGCGCC GCGGCACAGG CCGACGAGGA GTACGTGGAC
CCGGACGAGC TCGGCGACTT CGACGACGGC GAGGCCGAGG AGGAGGAGCA CCGCGGCAGC
GAGCAAGAGC GCCGCGGCTG GCCCCATCCC GCGGCCGGCA AGTCGGCCAG CGGCGGCCCC
GAGGTGGTGT TCACCTTCGA CGACGGCCCG CATCGCAAGT ACACGGCCGA GATCCTCGAC
GAGCTCCAGG AGCGCGACAT CCAGGCCATC TTCTTCTGGG TCGGCCACCG CGTCACCAAG
GGCGGCGGCG TGGCCCAGCA GCGCGCCCTG GTCGAGCGCG CGGTGCGCGA GAATCACCTG
GTCGCCAACC ACACCATCAC CCACGCCAAC CTGTGCCAGA TCCCGCGCGA CGAAGCCGCC
CACGAGATCG ACGAGAACGG GCGCATCTAC GCGGAGCTGA GCGGCCTGCC GCTGCACCTG
TTCCGCTCGC CCTACGGCGC CTACTGCCGC AATCTGGTCG GCCTGCTCGA GGAGCGCTCC
ATGCAGCACA TGCACTGGGA CATCGACCCG CGCGAGTGGG AGCACCACAG CAAGGAGTGG
GTGGTCAACT ACGTCACCAC GCGCCTGCGC CGTCTCGACG GCCGCGCCGT GGTGCTGCTG
CACGACACCA AAGCCGCCTC GGCGCGCGCG CTGCCCGAGA TCCTCGACTG GATCGACAAG
GAAAACCAGC GCCGCGACCA GCGCGGCAAC AAGCTGCCCA TCCGCATCCT GTCGGGCTCG
GACCTATTGT TCGAGCGCAT CTCGCCCGGG CTGACCAGCT TCATCACCGC GACCGCCGAG
CGCTCCGTGA GCGCGGTGGC CAACGCGGTG ACTCGCCTGG TCCCCTGA
 
Protein sequence
MKNRGEHLEA RGVRARLARP GLFALTLTLA LSCAAALWAA PPTGVASAAP RLDNAHTASP 
QPAEPGLAGA AAQADEEYVD PDELGDFDDG EAEEEEHRGS EQERRGWPHP AAGKSASGGP
EVVFTFDDGP HRKYTAEILD ELQERDIQAI FFWVGHRVTK GGGVAQQRAL VERAVRENHL
VANHTITHAN LCQIPRDEAA HEIDENGRIY AELSGLPLHL FRSPYGAYCR NLVGLLEERS
MQHMHWDIDP REWEHHSKEW VVNYVTTRLR RLDGRAVVLL HDTKAASARA LPEILDWIDK
ENQRRDQRGN KLPIRILSGS DLLFERISPG LTSFITATAE RSVSAVANAV TRLVP