Gene Hoch_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0220 
Symbol 
ID8542599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp322361 
End bp323503 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content71% 
IMG OID646385016 
Productglycosyl transferase group 1 
Protein accessionYP_003264754 
Protein GI262193545 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCAC CCACTCGAAT CCTGCACGTC TTTGGCCGCA TGGGTCGCGG CGGCGCCGAG 
ACCTGGCTGA TGAACGTCTG GCGGCACATC GACCGCGAGC GCTTCCTGTT CGACTTCCTG
GTCCACGACA GCACGCCGGG CGAGTTTGAC GACGAGCTGC GCAGCCAGGG CAGCCGCATC
CTGGTGCGCC CCTACGCGCG CGACCCGGTG CGCTACTCGG CGGCCATGGC CGAGGCCGTG
CGCGCGCGCG GCTACACGGT CATCCACAGC CACGTCGCCT ATTTCTCCGG CTGGGTGCTG
GGGCTGGCGC GCGGCCTGGG CATCGCCGGC CGCATCGCGC ACAGCCACAA CGACTTCGCC
CAGGCCCAAC GCGCCTGGCC CCGGGCCGCG TATGAGGGCG TGATGCGCAA GGCCATCCAG
ATGCACAGCA CCCGCGGCCT GGCCTGCTCG CTGCCGGCGT GCTCGTCGCT GTTCGGCAGC
GAGTGGCAGG CGCAGGGCAA ATACGAGGTG CTGCACTACG GCTACGACTT CTCGCGCTTC
ACCTCCGAGC ACATCGGCGG CGAGCTGCGG CGGGTCACCC GGCGCGCGCT GGGCATCGAC
GAGGACGCCT TCGTCATCGG CCACATCGGC CACTTCTCGC GCCAGAAGAA CCACGAGTTC
ATCGTCGAGC TGGCCGCGGC CCGGCGCCTG CGCGGCCACA CCCGCGATCG CTACGTGCTG
GTCGGCGGCG GCGGTTTGCG CGACTCGATC GAGCGCAGCG TGGTCGAGCG CCAGCTCGGC
GAGGCCTTCG TGTTCACCGG CGTGCGCGCC GACGTCCCCG AGCTGCTCGC GGCCTTTGAT
GTCATGATCC TGCCCTCGCG CTGGGAGGGC CTGGGCATCG TGGTGCTCGA GGCGCAGGCC
TCGGCCGTGC CCTCGCTGGT GAGCGACCGC GTGCCCGAGG AGGCCGTGGT CATCGACGAC
CTGGTCACGC AGGCGCCGCT CGAGACCGGC GCCTGGCTCG AGGCCCTGGC CGAGATCGAG
AATCAGCCGC GGCCGCAGCG CGACCGCGCG GTGGCCGCCA TGCGCGAATC GCGCTTCGGT
CTCGAGCGCA ACGTCGAGCG CCTGAGCGCG ATCTACGAGC AAGAGGCCGC GCGCGCCCGC
TGA
 
Protein sequence
MSSPTRILHV FGRMGRGGAE TWLMNVWRHI DRERFLFDFL VHDSTPGEFD DELRSQGSRI 
LVRPYARDPV RYSAAMAEAV RARGYTVIHS HVAYFSGWVL GLARGLGIAG RIAHSHNDFA
QAQRAWPRAA YEGVMRKAIQ MHSTRGLACS LPACSSLFGS EWQAQGKYEV LHYGYDFSRF
TSEHIGGELR RVTRRALGID EDAFVIGHIG HFSRQKNHEF IVELAAARRL RGHTRDRYVL
VGGGGLRDSI ERSVVERQLG EAFVFTGVRA DVPELLAAFD VMILPSRWEG LGIVVLEAQA
SAVPSLVSDR VPEEAVVIDD LVTQAPLETG AWLEALAEIE NQPRPQRDRA VAAMRESRFG
LERNVERLSA IYEQEAARAR