Gene Hoch_5000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5000 
Symbol 
ID8547410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6895772 
End bp6896941 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content73% 
IMG OID646389676 
Productglycosyl transferase group 1 
Protein accessionYP_003269382 
Protein GI262198173 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.704682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTTATG CTCGCGGCCT GATGCGACCG ATGACCATCG CCCACGTGCT GGGCTCGTTT 
GGGATGGGCG GGGCCGAGCG GGTGGCCCTG GATCTTGCGT CTGCGCAGCG CTCGCTCGGA
CTCGATGTCT TGGCCGTATC CTTTGCCGGC GGCTCCGAGG GACCGCTCGG GCCGATGTTC
CGCGAAGCCG GCATTCCCGT CCACACCATC CCCAAGCGGC CGGGTATCGA CCTCACCCTG
CCGCCGCGCG TGGCCCGCCT GCTGGTGCGC GAGCGCGTCG CCGTGCTGCA CACCCACAAC
CCGCAGCCGC TCATCTACGC GGCGCCCGCC GGCCGCGCGG TCGGCACCGC CGTGGTCCAC
ACCAAGCACG GCGAGGGCCA CATGGGCTCG AGCGGCGAGA AGTCCCTGCG CCGGCTGACC
GCCCCCTTCG CCCAGATCTT CGTCGCGGTG TCCGAAGAGA CCGCCGCCCA GGCGCGCGCG
CAGCGCGACT GCCCGCTCGA GCGCCTCACC GTGGTGCCCA ACGGCATCGG CCTGGGCCGC
TTCGCACCCG ACGACGAGAT CCGCCGCGAG GTCCGCGCCG AGCTCGGCAT CTCCGAGGAC
GCCTGGGTGG TGGGCACGGT CGGACGCGTG GACGACAACA AGAACCAGAG CGCGCTGGTG
CGCGCCATGG CGCCGCTGCT CAGCGACGAG GTCCACCTGG TGCTGGTCGG CGACGGGCCG
GCCATGGACA CCTTGCGCGC GGCCCGCGAG GCGGTCGATC GCAGCGATCG CGTGCACATC
CTCGGGCGCC GCACCGACGC CAACCGGCTG TATCGCAGCT TCGACGTGTT CGCCCTGCCC
TCGCTCAGCG AGGGTCTGCC GCTGGTCATC CCCGAGGCCA TGGCCTGCGG TCTGCCGGTG
GTGAGCACGG CCGTCGGCGG CATCCCGGCC GTGGTGCTCG ACGGCGAGAC CGGCCTGCTG
GTGGCCCCGG ACGATGAGGT CACCATGCGC GCGGCGCTCG CCCACCTGGG CTCGCAGCGC
CAGCGGGCGC GCGCTTTCGG ACGCGCGGGT CGGGCACGCG CGCTGAGCGA GTACTCGGTC
GAGCGCATGA GCCGCGACTA CCTGTCGCTG TACGAGCGCG CGCTGGATCA GGCCGGTCCG
CTGACGCGCG CGTGGCGTCG CCGGCCCTGA
 
Protein sequence
MRYARGLMRP MTIAHVLGSF GMGGAERVAL DLASAQRSLG LDVLAVSFAG GSEGPLGPMF 
REAGIPVHTI PKRPGIDLTL PPRVARLLVR ERVAVLHTHN PQPLIYAAPA GRAVGTAVVH
TKHGEGHMGS SGEKSLRRLT APFAQIFVAV SEETAAQARA QRDCPLERLT VVPNGIGLGR
FAPDDEIRRE VRAELGISED AWVVGTVGRV DDNKNQSALV RAMAPLLSDE VHLVLVGDGP
AMDTLRAARE AVDRSDRVHI LGRRTDANRL YRSFDVFALP SLSEGLPLVI PEAMACGLPV
VSTAVGGIPA VVLDGETGLL VAPDDEVTMR AALAHLGSQR QRARAFGRAG RARALSEYSV
ERMSRDYLSL YERALDQAGP LTRAWRRRP