Gene Pars_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2119 
Symbol 
ID5055775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1894166 
End bp1895272 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content62% 
IMG OID640469671 
Productchorismate synthase 
Protein accessionYP_001154317 
Protein GI145592315 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCT TCGGCAGGGA ACTCCGCATC ACCACTTTCG GCGAGTCCCA CGGCCGGGCC 
ATAGGCGTAG TTATAGACGG GGTCCCCGCC GGGCTCCCCC TTACCGAGGA GGACATAAGG
AAGGAGCTGG ACAGGAGGAT GTTCTGCCAC ATCCACTGGC TAAACCCCCG GTGCGAGCCT
GAGGAGTTCG AAATACTGTC AGGCGTAAAA GACGGCCACA CCCAAGGCAC GCCCATCGCC
ATTGTGATAT GGAACAAGAA GGCCATATCC AGCTACTACG ACGAGCTCTG GATGAAGCCC
AGGCCTGGCC ACGCCGACCT CGCTTACTAC CTCAAGTATG GCAAGTTCTA CGACCACAGA
GGCGGCGGAC GGGCCTCTGG CCGCACCACA GCGGCAATCG TGGCGGCGGG GGCCGTGGCC
AAGAAGCTCT TGGCGCTGGT GGGAGCCGAG GTGGCAGGCC ACATAGTGGA GCTGGGGGGC
GTAGAGGTGA AGCGGCCGTA CACCTTTGAA GACGTGAAGA AGAGCTGGGA GAAGCCCCTT
CCAGTGGTCG ACGACGATGC CCTAGCCGCC ATGCTTGAGG TGTTGCGCAA AAACGCCGCC
GAGGGAGACA GCGTGGGGGG CGGCGTTGAG ATCTGGGCGG TGGGCGTCCC GCAGGGCCTG
GGCGAACCTC ACTTTGGGAA AATAAGGGCA GATCTCGCTC ACGCTGCCTT CTCGGTGCCC
GCCGTGGTGG CCCTAGACTG GGGCGCCGGG AGGCAACTCG CCAAGATGCG CGGCTCAGAG
GCCAACGACC CCATAGTGGT GAAGGGCGGC AAGCCGGGGC TGGAGACTAA CAAGATAGGG
GGGGTCCTCG GCGGCATAAC GATAGGCGAG CCCTTATACT TCAGGGTGTG GTTAAAGCCT
ACACCATCGG TGAGGAAGCC GCAGAGGACT GTGGACTTGG CAAAGATGGA GCCGGCCACG
TTGCAGTTCA AGGGCCGATA CGACGTATCT GTAGTGCCCA AGGCCCTCGT GGCGCTGGAG
GCGATGACGG CAATAACGCT AGCCGATCAC CTCCTCCGCG CGGGGGTGAT CAGAAGAGAC
CGGCCGCTGA AAGATCCTGT GGTTTAA
 
Protein sequence
MNTFGRELRI TTFGESHGRA IGVVIDGVPA GLPLTEEDIR KELDRRMFCH IHWLNPRCEP 
EEFEILSGVK DGHTQGTPIA IVIWNKKAIS SYYDELWMKP RPGHADLAYY LKYGKFYDHR
GGGRASGRTT AAIVAAGAVA KKLLALVGAE VAGHIVELGG VEVKRPYTFE DVKKSWEKPL
PVVDDDALAA MLEVLRKNAA EGDSVGGGVE IWAVGVPQGL GEPHFGKIRA DLAHAAFSVP
AVVALDWGAG RQLAKMRGSE ANDPIVVKGG KPGLETNKIG GVLGGITIGE PLYFRVWLKP
TPSVRKPQRT VDLAKMEPAT LQFKGRYDVS VVPKALVALE AMTAITLADH LLRAGVIRRD
RPLKDPVV