Gene HS_0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0365 
SymbolpheA 
ID4239841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp373254 
End bp374411 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content35% 
IMG OID638103908 
Productchorismate mutase / prephenate dehydratase 
Protein accessionYP_718575 
Protein GI113460511 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01797] chorismate mutase domain of proteobacterial P-protein, clade 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0036667 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTAG ATTTAAGCGA AATTCGTCAA CAAATTACAC AAATTGACCG CAGTTTATTA 
AAGTTGCTTT CGGAGCGTCA TCGTTTAGCA TTTGATGTCG TACGTAGTAA AGAAATCTCG
CAGAAAGCAT TACGTGATGT TGAACGTGAG CAGCAATTAT TGAAAGAGTT GGTTCAATTT
GCGGAAAATG AAAATTATCA ATTGGAACCA CAATATATTA CTCAGATTTT CCAAAAAATT
ATCGAAGATT CCGTGTTGAC TCAGCAAGTA TATTTGCAAA AGAAATTGAA TGAGCAAAGA
GAGAAGAATA TTCATATTGC TTTTTTAGGT AAAAGAGGTT CTTACTCGCA TTTAGCCGCC
AGAAATTATG CAACTCGTTA TCAGGAGCAA CTAATTGAGC TAAGTTGTGC GTCTTTTGAT
GAAGTATTTT CCTCTGTGCA AAATGAGGAG GCAAGTTATG GCATTTTACC GTTGGAGAAT
ACAACCTCAG GAGCGATTAA TGAAGTGTAT GATTTATTAC AGCATACAGA TCTTTCTTTA
GTAGGTGAAT TGGCTTATCC AATTAAACAT TGTGTTCTGG TAAATGCTCA AGATGATTTG
GATAAGATTG ATACTTTATA CAGTCATCCT CAAGTGATTC AGCAATGTAG CCAATTTATT
CGTACTTTAG CGCGAGTTCA TATTGAATAT TGTGAAAGCA GCTCACATGC AATGCAACTT
GTTGCCAGCT TAAATAAACC TAATATTGCA GCTTTAGGCA ATGAAGATGG TGGGAATTTA
TATGGTTTAA AAGTATTAAA GTCCGGTATA GCAAACCAAG AAAACAATAT TACGAGATTT
ATTGTTCTTG CTAAGAATCC GATTGCAGTA TCACCGCAAA TTCATACAAA GACATTATTA
TTAATGAGTA CTGCACAAAA AGCGGGGGCA TTAGTTGATG CTTTATTGGT CTTCAAAAAA
TATAACATCA ATATGACGAA GTTAGAGTCA CGTCCAATTT ATGGTAAACC ATGGGAAGAG
ATGTTTTATT TAGAAATTGA GGCTAATATT AATAACCCTA TCGCTCAGCA AGCTTTTACT
GAACTAAAAG CATTCAGTAA CTACTTGAAA ATCTTAGGTT GTTATCCAAG TGAAATTGTG
AAACCTGCCG AAGTCTAA
 
Protein sequence
MSLDLSEIRQ QITQIDRSLL KLLSERHRLA FDVVRSKEIS QKALRDVERE QQLLKELVQF 
AENENYQLEP QYITQIFQKI IEDSVLTQQV YLQKKLNEQR EKNIHIAFLG KRGSYSHLAA
RNYATRYQEQ LIELSCASFD EVFSSVQNEE ASYGILPLEN TTSGAINEVY DLLQHTDLSL
VGELAYPIKH CVLVNAQDDL DKIDTLYSHP QVIQQCSQFI RTLARVHIEY CESSSHAMQL
VASLNKPNIA ALGNEDGGNL YGLKVLKSGI ANQENNITRF IVLAKNPIAV SPQIHTKTLL
LMSTAQKAGA LVDALLVFKK YNINMTKLES RPIYGKPWEE MFYLEIEANI NNPIAQQAFT
ELKAFSNYLK ILGCYPSEIV KPAEV