Gene Hoch_6846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6846 
Symbol 
ID8549265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9380992 
End bp9382185 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content71% 
IMG OID646391506 
Productchorismate mutase 
Protein accessionYP_003271203 
Protein GI262199994 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase 
TIGRFAM ID[TIGR01799] chorismate mutase domain of T-protein
[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.235672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACA GCAGTTCCAA GGATCCGCCC GCCCCGGGGG CCAGTGCTCC TGCGCGCGGA 
GAGGCGACGC GCGAGGGTGA CGCCGACAGC GCCAGCCGCG AGGCGCTCGA GGGGCTGCGC
CGTCGCATCG ACGAACTCGA CGCGCGTCTG GTCGCCCTGC TCAACGAACG GGCCGCGGTC
GTGGTCGAGG TCGGCCAGCT CAAGCGCAGC AGCGATGTGC CCATCTACGC GCCGCACCGC
GAGGCCCAGG TGCTGGGCCG GGCCCTGGCC GCCAACCGCG GCCCGCTGCC GGCGCGCACC
ATCGAGGGCG TGTTTCGCGA GCTGATGAGC GGCTCGTTCG CGCTCGAGCG GCCGCTGCGC
ATCGGCTACC TGGGCCCGCC CGGGACCTTC AGTCACGCCG CGGCCACGGC CCAGTTCGGC
TCGAGCGTGG AGTTCGTCGA CGTGCACGCG ATCGGGGCGG TGTTCGACGC GGTCAGCCGC
GAGCACGTGG ACTACGGCGT GGTGCCGATC GAGAACTCCA CCGGCGGCGG CATCGCCGAG
TGCCTGGACG CGTTTCTCGA GGTCGCCAAC CAGGTGACCA TCTACGCCGA GGTGCTGGTG
GCCGTGAGCC ACAACCTGAT CGCCAACTGC GCGCCCGACC AGATTCAGAC CATCTACTCG
AAGCCCGAGA TCTTCACCCA GTGCGCGCGC TGGCTGGCCC ACCAGTACCC GCACGCGCGC
CAGGTGCCGG CGCCCAGCTC GAGCCGCGCG GCCGAGATCG CGGCCCAGGA GATCGAGCGC
GATCCCGGCT GCGGGGCGGC CGCCATCGGC TCGACCCTGG CCGCCCAGAT CCACGGCCTC
AACCTGCTGT ACGCCGACAT CGAGGACAAC CCGCAGAACG TCACCCGCTT CTTTGTGATC
GCCCGCGAGG CGGCCGCGAG CTCGGGCGAC GATAAGACTT CGATCATGTT CCGCACGGCC
GATAGCCCGG GCGCGCTGGT GCGGGTGCTC AGTATTTTCG ACCGCGCGGG CGTCAACCTC
ACGCACATCG ACAAGCGGCC CAGCCGGCGC ACCAACTGGG ACTATACGTT CTTCATCGAC
GCCGACGGCC ACCTCGACGA TCCCAAGCTG GCCCACGTGA TCCGCGAGGC CGGCGCGCAC
TGCCGCGACT TCACCGTGCT CGGCAGCTAC CCGCGGGCCA AGCGGGTGCT GTAA
 
Protein sequence
MSDSSSKDPP APGASAPARG EATREGDADS ASREALEGLR RRIDELDARL VALLNERAAV 
VVEVGQLKRS SDVPIYAPHR EAQVLGRALA ANRGPLPART IEGVFRELMS GSFALERPLR
IGYLGPPGTF SHAAATAQFG SSVEFVDVHA IGAVFDAVSR EHVDYGVVPI ENSTGGGIAE
CLDAFLEVAN QVTIYAEVLV AVSHNLIANC APDQIQTIYS KPEIFTQCAR WLAHQYPHAR
QVPAPSSSRA AEIAAQEIER DPGCGAAAIG STLAAQIHGL NLLYADIEDN PQNVTRFFVI
AREAAASSGD DKTSIMFRTA DSPGALVRVL SIFDRAGVNL THIDKRPSRR TNWDYTFFID
ADGHLDDPKL AHVIREAGAH CRDFTVLGSY PRAKRVL