Gene Hoch_3176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3176 
Symbol 
ID8545564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4375417 
End bp4376337 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content72% 
IMG OID646387843 
Productprotein of unknown function DUF58 
Protein accessionYP_003267571 
Protein GI262196362 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.410481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.342999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTTCC TCGACCCCAG CGCGCTCACC CAGGTGAGCG GCATGCTGCT GCGCGCCCGC 
TTGATCGTGG AGGGCGCGCT CACGGGCGCG CACAGGGCGC GGCTGCGGGG CTCGTCGGTG
GAGTTCGCCG AGCACAAGGA GTACGCGCCC GGCGACGAGA TTCGCCACAT CGACTGGAAG
GCCTACGCCA AGGTCGACCG CTACTACGTC AAGCAGTTCG AGCAGGAGTC GCAGCTCACG
GCGTACCTGG TGCTCGACAC CTCGGCGTCG ATGGACTACG CGGGCGAGGG GCTGAGCAAG
CTGCGCTACG CGGCCTATCT GAGCGCGGCG CTGGCGTATC TGCTGGTGCA GCAGCGCGAT
CGCGTGGGGC TGCTGCCCTT TGGCCAGCTC GACAGCGGCG GCTACGTACC GCCCCGGGCC
CAGCCCGCGC ATCTGCGCAC GCTGCTCGGG TCGCTCGAGG AGTTGTGCGA GCGCGGCGGC
GCCGGCGACG CCTCGGCCGC GGCCGCTCTC GACCGGGTGG CCGAAATCGC CGGTCGGCGG
CGCGCGCTGA TCGCGGTGTG CTCGGATCTC TTTGCCGCCG AGGGCGACGG CCTGGCGGTG
CTGCGGCGGC TGCAGGCGCG CGGCCACGAC GTGGTGGTGT TTCACGTGCT CGACCCCGAT
GAGCTGTCGT TTCCCTTCCG CGGCCTCACG CGCTTCGAAT CGCTCGAGGA TGAGCGCGTG
CTGCTGGCCG AGCCGGAGTC GCTGCGGCGC GCGTATCTGC GCCGGCTCGA GGCGTTTCTG
GCGCGGGTCG AGCGCGGCTG TGCCGACAGC GGCGTGGGCT ATCACCGGGT GCCGACCTCG
CAGCCGGTCG AGCGCACGCT GCTCGAGTTT CTCGAGACGC GCGCGCGGCT GCGGGGGGGA
GGGCGAACGT GGAGTTCCTA G
 
Protein sequence
MSFLDPSALT QVSGMLLRAR LIVEGALTGA HRARLRGSSV EFAEHKEYAP GDEIRHIDWK 
AYAKVDRYYV KQFEQESQLT AYLVLDTSAS MDYAGEGLSK LRYAAYLSAA LAYLLVQQRD
RVGLLPFGQL DSGGYVPPRA QPAHLRTLLG SLEELCERGG AGDASAAAAL DRVAEIAGRR
RALIAVCSDL FAAEGDGLAV LRRLQARGHD VVVFHVLDPD ELSFPFRGLT RFESLEDERV
LLAEPESLRR AYLRRLEAFL ARVERGCADS GVGYHRVPTS QPVERTLLEF LETRARLRGG
GRTWSS