Gene Hoch_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3804 
Symbol 
ID8546197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5222973 
End bp5224136 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content72% 
IMG OID646388474 
Productmetallophosphoesterase 
Protein accessionYP_003268197 
Protein GI262196988 
COG category[R] General function prediction only 
COG ID[COG1409] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0698095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.131699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAC CGCACGCCCG CCTCGTCCTC GTCCTCGCGT CGCTGTCGCT GTCGCTGGCG 
GCCGCGCCTG GTTGTCTGCG CGAGGGTCGC GAGCGCGCGC GCGCCGATCT CGAGGTCGGC
CAGGTCGCGC TCGCCGAGGT CGCGCTCGCG GTCGACGATG GCCTGGCCCA CGTCCGCGCG
CTCACGCCGG GCAGCGACGC CGAGCCGGGC GCGATCGATC TCTGGGGGTC GGCCCCGGAT
TTCCGCCTGA GCCTGCGCGC GCCCGCGGGC TCCTCGTGGT TACTCACGCT TGAGAACGCC
ATGCCCGACG CCGAGCTGAG CGCGCTCGGC GAGGCCGACG GGCTGGCCAT CGACGCGCTC
GAGGGCCCGC GGCCCACGGT GCGGCGCTGG TCGCTGCGCC TGGCCGAGGA CCCGGGCCAG
CCCGGCGCCG AGCGCGCGCT GCGGCTGCGC GTGGCCCCGC CCGACGCCGA CGCCCGCGCC
AGCGCCGGCC AGCCCTGGCG CTTTGCCGTC ATGGGCGACA TCCAGCGCGC GCTGCCCGAG
GTCGACGACA TCTTCGCGCT CATCAACGAA GACCCCAGCG TCCGCTTCGT CGCCTCCACC
GGCGACCTGG TCGATGGCGG CGAACACGAG GAGTACGAGC TGCTCGAGGA GCAGCTCGCG
CTCCTCGAGG TGCCGTACTT CTCGACCATC GGCAACCACG AGCTGTTCGG CCCGGCCGAG
CGCTGGAGCA GCCGCTTCGG CCGCTTCAAC CTGCACTTCC GCTTCAAGGG CGCCGCCTTC
TCGCTCATCG ACTCCGGCAA CTCGAGCATC GATCCCATGG TCTACGACTG GCTGGGCGAG
TGGGCCGAGG ACGCGCGCGA CGACGTGCAC TTCTTCTTCA CGCACTTTCC CGCGGTCGAT
CCCGTGGGCG TGCGCGCTGG CTCGCTGCGC TCCTCGAGCG AGGCCCGCAA GCTGCTCGCC
GTCCTCGCCG AGGGCGCCTT CGACGTCACC TTCTACGGCC ACATCCACTC CTACTACGCC
TTTGAAAACG CCGGGATTCC GGCCTTTATC TCCGGCGGCG GCGGCGCCAT CCCCGAGCGC
TGGGACGGCA TCGGTCGGCA CTTCCTCACC GTCGATGTCG GCCCCGAGGC CGTACGCGCG
GTCTCGCTCG TGCGCGTGGA ATGA
 
Protein sequence
MSTPHARLVL VLASLSLSLA AAPGCLREGR ERARADLEVG QVALAEVALA VDDGLAHVRA 
LTPGSDAEPG AIDLWGSAPD FRLSLRAPAG SSWLLTLENA MPDAELSALG EADGLAIDAL
EGPRPTVRRW SLRLAEDPGQ PGAERALRLR VAPPDADARA SAGQPWRFAV MGDIQRALPE
VDDIFALINE DPSVRFVAST GDLVDGGEHE EYELLEEQLA LLEVPYFSTI GNHELFGPAE
RWSSRFGRFN LHFRFKGAAF SLIDSGNSSI DPMVYDWLGE WAEDARDDVH FFFTHFPAVD
PVGVRAGSLR SSSEARKLLA VLAEGAFDVT FYGHIHSYYA FENAGIPAFI SGGGGAIPER
WDGIGRHFLT VDVGPEAVRA VSLVRVE