Gene Pars_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2038 
Symbol 
ID5055909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1822178 
End bp1823065 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content66% 
IMG OID640469587 
Productchorismate mutase 
Protein accessionYP_001154236 
Protein GI145592234 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0710] 3-dehydroquinate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01093] 3-dehydroquinate dehydratase, type I
[TIGR01808] monofunctional chorismate mutase, high GC gram positive type 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATATGCG GAGCGGTCCC CGTCAGGAGA CCGAGAGACG TGGAAAGGGC GCTGGAGGCG 
CCGCTTACGT GCCTTGAGCT TAGACTCGAC TACCTAGAGG CGCCGCTGTC TGAGGCATGG
CCTGTGCTGG AGGAGGCGGC GGCGCGCCGC ACGGTTATAG TCACGGTGAG GAGGAGGGAG
GAGGGCGGGC ACTGGCGGGG CGGCGAGGAG GAGAGAGAGG CGTTGTACAG AAAGCTCCTC
GACCTCAACC CCCACTACGT CGACGTCGAG GCGGAGTCCC CCATCGCCCC GAGAATTGCC
GAGGTAAAGG GCAGGGCCAA GCTCATAGCC AGCAGACACG ACTTCGGGGG GACGCCCCCG
CTGGAGGTTC TCAGAAGCTG GGCGGAGAAG GCGGCGGCGC TGGGCGACGT GGTAAAGGTG
GTTACCTACG CCCGGGAGCC GGCCGACGGG CTTAGAGTCC TCTCGCTTAT AGGAGCCGTG
GAGAAGCTTG TTGTCGCCTT CGCCATGGGC CCCGCCGGGA CGTACACCAG AGTGGCGGCG
GCGGCGCTGG GCAGCCCCAT TATGTACGTC TCGCTGGGCG AAGCCACGGC GCCGGGGCAG
ATCGCCGCAG ATGCCTACTT CGCCGCCCTC ACCGCGCTGG GCATCGCCCC GGCGGGGGAG
GGCCTCCCCT CGCTTAGAGA GGCGCTGGAC TGGATAGACG GCGGCCTCAT GTACCTGCTT
AGGAAGAGGC TAGAGATCTG CCGCGACATG GGCAAGCTCA AGAAGGCAGC CGGCCTGCCT
GTGTACGACG ACGTGAGGGA GGCCCAAGTG TTAAGACGAT CCGGCGACTT CAAGCAGATC
TTCGAGCTCG TCGTCCAGAT GTGCAAGGCT GTGCAACTGG TGGCATAG
 
Protein sequence
MICGAVPVRR PRDVERALEA PLTCLELRLD YLEAPLSEAW PVLEEAAARR TVIVTVRRRE 
EGGHWRGGEE EREALYRKLL DLNPHYVDVE AESPIAPRIA EVKGRAKLIA SRHDFGGTPP
LEVLRSWAEK AAALGDVVKV VTYAREPADG LRVLSLIGAV EKLVVAFAMG PAGTYTRVAA
AALGSPIMYV SLGEATAPGQ IAADAYFAAL TALGIAPAGE GLPSLREALD WIDGGLMYLL
RKRLEICRDM GKLKKAAGLP VYDDVREAQV LRRSGDFKQI FELVVQMCKA VQLVA