Gene Hoch_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1350 
Symbol 
ID8543732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp1792537 
End bp1793823 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content71% 
IMG OID646386064 
Productglycosyl transferase family 28 
Protein accessionYP_003265799 
Protein GI262194590 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0669225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTA TTTTGATTAT GACCTACGGC ACCCGAGGCG ACATCGAGCC GTTCCTCGCG 
CTCGCGCTCG GGTTGAAAGA CGCCGGCCAC GACGTCACGC TCGCGACTGC CGAGCAGTTC
GGCGACTGGG TGTCCGATTT CGGCATCGAC TACGCGCCGA TCACCAACGC GTCGCTCGAC
GTGATCCACT CCGAGGACGG CAAGACCATG CTCGAGGGCG GCGCGGGGCT GTTCGCGCGG
ATCGCGGCCG GCGCTCGTCT CGCGCGGCAA TCGCGGGCGA TCAACGAGCA GCTTTGCCGG
GACGCGTGGC GCGCGGCCGA AGCCACGCGG CCCGAGCTGA TCCTCTATCA CCCCAAGATC
ATGGCCGCAC CGCACATCGC CGAGAAACTC GCTATCCCCG CCATCCTGGC CGCCCTGCAG
CCGATGCTGG TGCCGACCGC CGCGTTCCCG GTCACCGGCC TGCCGCGCCT GCCCGTCCCC
GGCTACAACC GCTTTAGCTA TCGCTTCGTC AACCTGTCGT ACGGGGCGCT GAAGGGCTCG
GTGAACCGTT TCCGGCGCGA GCTGCTCGGC CTTGCGCCGG TGCGCCGCGC CGCCGAGGTG
CTGCGGCCGC CGAGCGCGCG CGCGAGCAAA GTGCTGCACG GCCTGAGCCC GCAGGTCATC
CCGAGGCCGG ACGACTGGCC CAATTACGCC ATCATGAGCG GCTACTGGCC GCTGCCGCCA
GACCCGGCGT TTACGCCCCC CGATGAGCTG CTGCGCTTTC TCGACGCCGG GCCGCCGCCC
GTGTACGTGG GCTTCGGCAG CATGGTCTCG AAGGATCCCG AGGCGCTCGC CGAGCTGGTG
GTCGAAGCGC TGCGCCTGGC CGGCGTGCGC GGCGTGCTCG GAGCTGGCTG GGCCGGGCTG
GCGGCCGATG CCGACGGGGT CGTGGCGGTC CGCGATATCC CCTACGGCTG GCTATTCCCG
CAGATGGCGG CCGTGGTTCA CCACGGCGGC GCGGGCACCA CGGCCGCGGG ATTTCGCGCT
GGCGTGCCGT CGGTCATCTG TCCGTTTTTC GGCGATCAGC CCGGCTGGGC CGCGGCCAGC
GTCGCGCTCG GTGTCGGCGC GCCGCCCGTG CCCCGCAAGC GCCTCAGCGC CGAGCGACTG
GCGGCGTCGA TCCGAGTGGC GACCAGCGAC CAGACGCTCA AGCGCAATGC CAAGCGTCTC
GCCGCCGCGC TCGACGCCGA GGACGGCATC GCGGTCGCCA TCGCCGAGAT CGAAGACACG
CTGCAGCAGG CCGCCTCTGC AAGCTGA
 
Protein sequence
MARILIMTYG TRGDIEPFLA LALGLKDAGH DVTLATAEQF GDWVSDFGID YAPITNASLD 
VIHSEDGKTM LEGGAGLFAR IAAGARLARQ SRAINEQLCR DAWRAAEATR PELILYHPKI
MAAPHIAEKL AIPAILAALQ PMLVPTAAFP VTGLPRLPVP GYNRFSYRFV NLSYGALKGS
VNRFRRELLG LAPVRRAAEV LRPPSARASK VLHGLSPQVI PRPDDWPNYA IMSGYWPLPP
DPAFTPPDEL LRFLDAGPPP VYVGFGSMVS KDPEALAELV VEALRLAGVR GVLGAGWAGL
AADADGVVAV RDIPYGWLFP QMAAVVHHGG AGTTAAGFRA GVPSVICPFF GDQPGWAAAS
VALGVGAPPV PRKRLSAERL AASIRVATSD QTLKRNAKRL AAALDAEDGI AVAIAEIEDT
LQQAASAS