Gene Hoch_6599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6599 
Symbol 
ID8549016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9052194 
End bp9053321 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content67% 
IMG OID646391259 
Productchorismate mutase 
Protein accessionYP_003270958 
Protein GI262199749 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase 
TIGRFAM ID[TIGR01791] chorismate mutase, archaeal type
[TIGR01799] chorismate mutase domain of T-protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTCG ACACCTTGCG TAACGACCTG CAGTCCCTCG ACCGGGAGAT TCTCGCGCTG 
GTGGCCAAGC GCCAGGCCTT GGCCGCCGAG ATCGGCAGCA TCAAGCGCGC CGCCGGGGTG
CCCACGCGCG ACTACGGGCA GGAGCGCGCG GTGCTCGAGC GCGCCCGCGA GCACGCCGAC
GAGATGGGCA TCTCGCCCGC GCTCGCCGAG CAGATTCTGC TGCTGCTCAT CCGCTCCTCG
CTCACCGTGC AGGAGCGCGA CCGCGTGGCC GCGCTGGGCA GCGGCACCGG TCAGCGCGTG
CTGGTCATCG GCGGCAGCGG CAACATGGGC CGCTGGTTCG CGCGCTTTCT CGGCTCCCAG
GGCTACGCGG TGACCATCGC CGACCCGACG CCGGCGCCGG CCGAGCTGCG CGACTGCGAC
CAGGTGAGCG ACTTCCGCGA CACCTCGCTG GACCAGGACA TCATCGTGGT GGCGACGCCG
ATGATGACGG CCAACGCGAT CTTGCACGAG CTGGCGGAGC GCAAGCCCAA GGGTCTGGTG
TTCGACGTCG GCTCGCTCAA GAGTCCGCTG CGCACCGGCC TCGCCGCGCT GGTGCAGGCC
GGCGTGAGCG CGACCTCGCT CCATCCCATG TTCGGTCCCA ACACCGAGCT GCTCAGCGGT
CGCCACGTGG TGTTCGTCGA TATCGGCGTG CCCGAGGCGA CCAGCCGCGC GCGCGATCTG
TTTGCGTCGA CCATGGTCGT GCAGGTCGAG CTCGACCTGG AGAATCACGA TCGCCTGATC
GCCTACGTGC TGGGATTGTC GCACGCGCTC AACATCGCAT TTGCGAGCGC GCTGGCCGAG
AGCGGAGAGG CCGCGCCCAG GCTGGCCAAG ATGTCGAGCA CGACCTTCGA CGCGCAGCTC
GAAGTGTCCA CGCGCGTGGC CATGGAGAAT CCGCAACTTT ACTACGAAAT CCAATCACTC
AACGACTATG GAACCGAGTC TTTGACCGCG TTGCTGTATG CGGTGGAGCG TTTGCGCTCC
TTGGTTCGCG CCGGTGATGC CAAGGGCTTC GCCGCGTTGA TGGAGCGCGG ACGCGCGTAC
TTACAAGACC GTCGCAGCGA TGTGGATCCC CGGACGCGCT CCCTATAG
 
Protein sequence
MSLDTLRNDL QSLDREILAL VAKRQALAAE IGSIKRAAGV PTRDYGQERA VLERAREHAD 
EMGISPALAE QILLLLIRSS LTVQERDRVA ALGSGTGQRV LVIGGSGNMG RWFARFLGSQ
GYAVTIADPT PAPAELRDCD QVSDFRDTSL DQDIIVVATP MMTANAILHE LAERKPKGLV
FDVGSLKSPL RTGLAALVQA GVSATSLHPM FGPNTELLSG RHVVFVDIGV PEATSRARDL
FASTMVVQVE LDLENHDRLI AYVLGLSHAL NIAFASALAE SGEAAPRLAK MSSTTFDAQL
EVSTRVAMEN PQLYYEIQSL NDYGTESLTA LLYAVERLRS LVRAGDAKGF AALMERGRAY
LQDRRSDVDP RTRSL