Gene Hhal_0568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0568 
Symbol 
ID4710227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp642859 
End bp643947 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content70% 
IMG OID639855026 
Productchorismate mutase 
Protein accessionYP_001002156 
Protein GI121997369 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGTCG ACAATGACGA CCTCGCCGCG CTGCGGCAAC GGATCGACGG GATCGACGAC 
CAGATCCTCG AGCTGGTCAG TGAGCGGGCC CGCCTGGCCG AGGAGGTGGC CCGCGCCAAG
GCCCGCCGCG GCGAGGCGCT CAAGGTCTAC CGCCCGGAGC GCGAGGCGGA GATCCTGCGC
CGGTTGACCG AGCACAACCG TGGGCCGCTG ACCCGTGAGC AGGTCACGGT GCTCTTCCGC
GAGGTGATGT CCGCCTGCCG GGCGCTCCAG CAGCCGCTGA CCGTCGCCTT TCTCGGCCCC
CAGGGGACCT TCACCGAAGA GGCGGCCAGC AAGCACTTCG GCCACGATGC CGGGATGTCG
CCCCAGGCCA CCATCGGCGG GGTCTTCCGC GAGGTGGAGT CGGGCGCGGC GCACTACGGG
GTGGTGCCGG TGGAGAACTC CTCCGAGGGC GTCGTCAGTC ATACCCTGGA TCGCTTCCTG
GATTCGGAGC TGGCCATCGT CGGTGAGGTG GAGCTACGCA TCCACCACGC CCTGGCCAGT
CACGCCGGGG GGCTGACCTC GATTGACCGG GTCTACTCCC ACCAGCAGGG GCTGTCGCAG
TGCCGCGCGT GGCTGGAGAC CCACCTCCCG CAGGCCGAGC GCCACCCGGT CTCCAGCACG
GCCGAAGCGG CCCGGCTGGC GGCCCTGGAG CCGAGCGCGG CGGCCATCGC CAGCGAGGCG
GCCGCGGAGC GCTACGGCGT GCCGCTGCTT CAGGAGCGCA TCGAGGACTA CCACGGCAAC
ACCACCCGTT TTCTGGTTCT GGGCTACCAG TCCCCGCCGC CCAGCGGCCA CGACAAGACC
TCGCTGGTGG TCTCCAGCGC CAACCGCTCC GGGTTGCTCT TTCAGCTGCT CGAGCCGCTG
GCGCGCAACG GCATCGACAT GACCCGGATC GAGTCGCGTC CGGCACGCCA GCGCGGGGTT
TGGGAGTACG TCTTCTTCAT CGATATCCTC GGGCACGCCG AGGACGAGAG CCTGCGCGGT
CCCCTTGCGG AGATGCGCGA GCGTGCCAGC CTCTTCCGTA TCCTCGGCTC CTATCCAAGG
GCGATCTGA
 
Protein sequence
MSVDNDDLAA LRQRIDGIDD QILELVSERA RLAEEVARAK ARRGEALKVY RPEREAEILR 
RLTEHNRGPL TREQVTVLFR EVMSACRALQ QPLTVAFLGP QGTFTEEAAS KHFGHDAGMS
PQATIGGVFR EVESGAAHYG VVPVENSSEG VVSHTLDRFL DSELAIVGEV ELRIHHALAS
HAGGLTSIDR VYSHQQGLSQ CRAWLETHLP QAERHPVSST AEAARLAALE PSAAAIASEA
AAERYGVPLL QERIEDYHGN TTRFLVLGYQ SPPPSGHDKT SLVVSSANRS GLLFQLLEPL
ARNGIDMTRI ESRPARQRGV WEYVFFIDIL GHAEDESLRG PLAEMRERAS LFRILGSYPR
AI