Gene Hoch_0106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0106 
Symbol 
ID8542477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp165240 
End bp166316 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content70% 
IMG OID646384894 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_003264640 
Protein GI262193431 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCC CGATACGGCC CCATATCCGA GCCATGCGCG GGTACGTCCC GGGCGAGCAG 
CCCGCGGCCG GTTCCCGCCT CATCAAGCTC AACACCAACG AGAACCCGTA TCCGCCCAGC
CCGCGAGTGG CCGAGGCGAT TCGCGCCGAG CTGGCGGCCA CCGAGGTAGC CGGCGAGCGC
CTGCGCCTGT ACAGCGATCC CAACGGCTCG GCCCTGCTCG ACGCGGCCGC CGAGGTCACC
GGCTTTCCGC GCGACGGAAT CCTCGCCGGC AACGGCTCCG ACGAGCTGCT CGCGGTGCTG
GCGCGCGCCA TCCTCGGCCC CGGCGACGCG GTCGCGTATC CCTACCCGAC CTATCTCCTC
TACGAGACCA TGGCGCGTAT TCAGGACGCG CGGACGACCA CGTTCGACTT CCCGGCCGAC
TTCTCGTTGC CCGAGACGCT CTTCGGCTGC GACGCGCGCA TGGTCTTCGT CGCCAACCCC
AACTCGCCCT CGGGCACCCA GCAATCGAAC CAAGAGCTGA CCCGGCTGGC CAGCAGCCTG
CCCGACAGCC TGCTGGTGAT CGATGAGGCG TACGCGGGCT TTGCCGACGC CAACGCGCTG
GCGCTGGCCC AGGAGCTGCC CAACGTGGTC GTGCTGCGCA CGCTGTCCAA GAGCCACAGC
CTGGCCGGCA TGCGCGTCGG CCTGCTCTTT GGTTCCGCAG AGATCGTCGG CGAGCTGCGC
AAGGTGCGCG ACAGCTACAG CCTCGACCGC CTGGCCATCG TCGCCGGCGC CGCGTCGCTG
CGTGACACCG CCTGGGTGGA CGACACCACC GCCCGCATCC TGCGCACCCG CGAGCGGCTG
GTCGAAGCGC TGCGAGCGCT CGGCGTCGAG ACGCTGCCCA GCCGCGCCAA CTTTGTCTTT
GCCCGCATGG GCAGCGCCGC GCGCGCCAGC GCAGCCCAGC AGTTCCTGCG CGAACGGCAC
ATCCTCGTGC GCTACTTCGC CATGCGTCTG CTCGATGACG GGCTGCGTAT CACCGTGGGA
ACCGACGACG AGACCGACGC CCTGCTGCGC GCGCTGGAGG AGTTCGTCCA GAGCTGA
 
Protein sequence
MSLPIRPHIR AMRGYVPGEQ PAAGSRLIKL NTNENPYPPS PRVAEAIRAE LAATEVAGER 
LRLYSDPNGS ALLDAAAEVT GFPRDGILAG NGSDELLAVL ARAILGPGDA VAYPYPTYLL
YETMARIQDA RTTTFDFPAD FSLPETLFGC DARMVFVANP NSPSGTQQSN QELTRLASSL
PDSLLVIDEA YAGFADANAL ALAQELPNVV VLRTLSKSHS LAGMRVGLLF GSAEIVGELR
KVRDSYSLDR LAIVAGAASL RDTAWVDDTT ARILRTRERL VEALRALGVE TLPSRANFVF
ARMGSAARAS AAQQFLRERH ILVRYFAMRL LDDGLRITVG TDDETDALLR ALEEFVQS