Gene Hoch_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1971 
Symbol 
ID8544353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2720956 
End bp2722179 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content78% 
IMG OID646386675 
Productglycosyl transferase group 1 
Protein accessionYP_003266410 
Protein GI262195201 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.190227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.341888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCATC GTCCCACCGA GCCCGCGCCG CCCGCGCCCC TGCGCCTGCT GCTGGCGGTC 
ACCGATCCCC GCAGCACCGG CTTTCTGCGC GGTCAGCTCG CGGCCGCGCG CGCCGCCGGC
TTCGAGGTCA GCCTGCTCAG CGGTCCCGGC CCCGCCGCCC GCGCGCTGGC CGCCGCCGAG
CACGCGCGCC TCTACGAGAT CCCCATGGCG CGCGCGATCG CGCCAGCCCG CGACCTCGTC
GCCCTGGCCC GCGTGGCCCG GGCGCTGGCG CACGCGCGGC CGCACATCGT CAACGCCGGC
ACGCCCAAGG CGGCCCTGCT CACCCTGCTC GCGGCCCGCG CGCTGGGCGT GCCCTGCCGC
ATCCACACCC TGCACGGCCT GCGCGGCGAG ACCCTGCGCG GCGCCCGCCG GCGCCTGCTC
GACGGCCTCA CCCGGGTCAC CTCGGCGCTC GCCCAGCGGG TCATCTGCGT GAGCCCCAGC
CTGGCGCGCG AGGCCGTGGC CGCGGGCGTG GCCGCGCCCG CCCAGGTGCT CGTGCTCGGC
CGCGGTAGCG CCAACGGCAT CGATCTCGAG CGCTTCTGCC CCAGCCCCGA GCACGCCGCC
GCCGGCCGCG CCCTGCGCGC GCGCTGCGGC ATCCCGGACG GCGCCCGGGT GCTGGGCTTT
GTCGGACGAC TGGCCGACGA CAAAGGCGTG GCCGAGCTGG CGCGGGCCTG GAGCGGGCTG
CGCCGGCGCT TTCCCGACCT GCACTGGCTC GTGCTCGGCG CGCCCGACGA CACCGACCCC
GTGCCCGCCG AAGTTCTCGA CCAGATGTCC CAAGACCCGC GCGTGCACTG CCTGGGTCAG
GTCGCCGACC CGCGGCCGGC GTACGCGGGC ATGGACGTGC TGGCGCTGCC CACGCGCCGC
GAGGGCCTGG GCTACGCGCT CATCGAAGCC GCCGCCTTCG AGCTGCCGAG CGTGGCCACG
CGGGTCACCG GCTGCGTCGA TGCGGTGCAC GACGGCGTCA CCGGCACCCT GGTCGCGCGC
GGCGATACCC GGGCGCTGGC AGCGGCGCTG GCCGCGTACC TCGACGATCC CGCCCTGCGC
CGCCAGCACG GCCACGCCGG ACGCGCCTTT GTCGCCGCGC ACTTCGAGCA GCGGGCGCTG
TGGGCGCGCC TGCACCGCGA GTACGCGCGC CTGGCCGCGG CCGCCGCGCT GCCCGGCGCC
GCCGCGCTCA CGACACGCTT CTGA
 
Protein sequence
MNHRPTEPAP PAPLRLLLAV TDPRSTGFLR GQLAAARAAG FEVSLLSGPG PAARALAAAE 
HARLYEIPMA RAIAPARDLV ALARVARALA HARPHIVNAG TPKAALLTLL AARALGVPCR
IHTLHGLRGE TLRGARRRLL DGLTRVTSAL AQRVICVSPS LAREAVAAGV AAPAQVLVLG
RGSANGIDLE RFCPSPEHAA AGRALRARCG IPDGARVLGF VGRLADDKGV AELARAWSGL
RRRFPDLHWL VLGAPDDTDP VPAEVLDQMS QDPRVHCLGQ VADPRPAYAG MDVLALPTRR
EGLGYALIEA AAFELPSVAT RVTGCVDAVH DGVTGTLVAR GDTRALAAAL AAYLDDPALR
RQHGHAGRAF VAAHFEQRAL WARLHREYAR LAAAAALPGA AALTTRF