Gene Hoch_5686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5686 
Symbol 
ID8548100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7800850 
End bp7802043 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content75% 
IMG OID646390354 
Productglycosyltransferase, MGT family 
Protein accessionYP_003270056 
Protein GI262198847 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0302336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCA TCCTCATCGT CACCCTGCCC GAACGCGGGC ACTACCATCC CTTGCTCGGC 
CCGGCCGAGG AGCTGAGCCG GCGCGGCGCC GAGGTGGTCT TCGCCTGCTC GCACGACATC
GGCGACGACC TCGCCGATGT CGGCGTCGAG CGCTTCGTGG CCCCGCCCGG GGCCGCGCCG
CTGAGCGACG AGCTGCGCGG CGCCGAGCTG GCCCGTCTGC TGGCCGATCC CGAGGCCCTG
CGCGGCTGGA TCCGCGACAT GCTGGCCATC GGCCCGGCGC GCCACGTCGA GCCCATGCGC
GCGATCGTGC GCGAGCTGCG CCCCGACGTG GTCGCCATCG ACGCCATGGC CTACGAGGGC
GCCATCGCCG CCGAGCTCGA GGGCGTGCCC TGGGTGGGCT GGGCGACCTC GCTCAACCCG
GCCATACCCG CGTGGCTCGA CAGCGAGCTC ATCCGCACCC TGCGCGCGCT CGATCCCGTG
CGCCACGCGC TGTTCGCCGA GTTTGGCCTG CGCGCGCGCT TCCGCGTCAG CGACGTGTTG
TCGCCGCGCG GCACCGCGGT GTTTGCGACC GACGCCCTGA TCGCGCCGAC GCGCGCGGGT
GAGCCCGTGG ACGACGACGT CCACCTGGTC GGCCCCTCGC TCGGCGGCCG GCGCTCGCAC
GCGTCCCCGG ACCTGGGGTT TGCGGACGGT CGTCCGCTTA TCTACGCGTC CTTTGGCAGC
CAGGCCTGGC ATCAGCCGCA GCGCTTCGAG CGCCTGTTCG AGGCCGCGCG CACGCTCGAC
GCCGCGCTGC TGGTCGCGGC CGGCGATCTC GCCGCCGAGT ACGCTGCCCA GAACTTGCCC
GCGCACGTGC GCTGCGTGCC CTTTGCCCCG CAGCTCGAGG TCCTCGCCCA CGCCCGCGCG
CTGGTCACCC ACGGCGGCGC CAACTCGGTC ATGGAGGCCC TGGCCGCGGG CGTGCCGCTG
CTGGTGGCGC CGCTGTGCAA CGACCAGCCG CACAACCGCC TGTTCGTCGA GCGCGCCGGC
GCCGGCCTGG GCATCGACCT CGACACCTGC GCCCAGGATG CCCTGCTCTC GGCCCTGCGC
GCGCTGCTCG CCGACGGCCG CGAGCGCACA GCCGCCCAGC GTATCGCGGC CAGCTACGCG
GCCAGCGATG GCGCCCGCGG AGCCGCCGAA CTCGCCCTGC GATGCTGCCC GTAG
 
Protein sequence
MTRILIVTLP ERGHYHPLLG PAEELSRRGA EVVFACSHDI GDDLADVGVE RFVAPPGAAP 
LSDELRGAEL ARLLADPEAL RGWIRDMLAI GPARHVEPMR AIVRELRPDV VAIDAMAYEG
AIAAELEGVP WVGWATSLNP AIPAWLDSEL IRTLRALDPV RHALFAEFGL RARFRVSDVL
SPRGTAVFAT DALIAPTRAG EPVDDDVHLV GPSLGGRRSH ASPDLGFADG RPLIYASFGS
QAWHQPQRFE RLFEAARTLD AALLVAAGDL AAEYAAQNLP AHVRCVPFAP QLEVLAHARA
LVTHGGANSV MEALAAGVPL LVAPLCNDQP HNRLFVERAG AGLGIDLDTC AQDALLSALR
ALLADGRERT AAQRIAASYA ASDGARGAAE LALRCCP