Gene Pars_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2112 
Symbol 
ID5054716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1887509 
End bp1888705 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content62% 
IMG OID640469664 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001154310 
Protein GI145592308 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.308639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTGTA TTGAGGCGGG GCGCCTCGAG GGGAGATTCC CCCTGCCCCC TTCTAAGCCA 
TACTCACAGC GCCTACTGCT GGCAAGCGCG TTGGCCGAGG GGGAGACCGT AGTGAGGGGT
CTTGAGTTAA GCGACGACGT GGTGGCGATG GTTAGGGCTA TACAGCCAAT TGCCTCTATA
ACGCTGAGGG CGGACACGGC GGTTGTCTCG AAGAGGGAGC CCGACAAGTA CAGGGCCTTC
AACGTGATGG AGAGCGGCTT CACCCTGAGG ACCGCGGTGG CTGTATACGC CGGCATCCCA
GGACTCACGG CAGTGTACTT CGGCGGCACC CTCAGGGGGA GGCCCATCGA CGAGCTGGTG
GAGGTGTTGA GGAGGCTCGT CTCCGTGTCT AAGTTGCCGG GTGCGGTGGT GATTGATGGG
AGGCGGCTGG GGCGGTTTCG GGTTGAGATC AGGGCCGACG TCTCTTCGCA GTATATCTCA
GGCCTTATGT TCCTCGCCGC GGCTGGTGAC GGCGGCGTTG TTGTGCCGAA AGGGGAGAGG
AAGTCCTGGA GCTTCGTCGA GGCTACCGCA GATGTGTTGA GGCTCTTCGG CGCAGAGGTT
TCGATGGGCG ACGAAGTGGT TGTCGAGGGC GGGCTGAGAA GCCCCGGCAC TGTGGACGTG
CCGGGTGATC TAAGCCTTGC CTCCTTCCTC CTAGTGGCCA GTCTCGCCAC TGGCGGGAAG
GTCCGCCTCG AAGGCGCTGT CACGAAGCTC GACGCCGTTG TCCTAGACAT ATTCAAGTTT
ATGGGCGCCG ATATTGCCTA CGGCGATGGC TACGTCGAGG CGCGGGGCGG ATTCACCAAG
GGAGTGGACG TGGATCTAGG CGGCAACCCC GACCTCGTCA TGCCGGTGGC GTTGGCTGCG
GCGATGGTGG AGGAGCAGTC GGCCATACGG GGGGTTGAGC ACTTGCGTTT CAAGGAGAGC
GACAGGGTAG CCACTGTGCT TGACGTGTTG TGGAGGCTGG GGGTCGACGC GAGATATGAG
GGCGGCGTCC TGTACATAAA GGGCCCGCCT AAGCGCCGCG ATGTCCGCTT CTCCTCTAGC
GGAGACCACA GGATTGGCCT CATGGCCATG GCCGCGGCTA AGGCCGTCGG CGGTTGTGTA
GACGACATTA GCCCAGTTGC CAAGAGTTGG CCGTCGGCGA TTTTATACTT TAAATAA
 
Protein sequence
MLCIEAGRLE GRFPLPPSKP YSQRLLLASA LAEGETVVRG LELSDDVVAM VRAIQPIASI 
TLRADTAVVS KREPDKYRAF NVMESGFTLR TAVAVYAGIP GLTAVYFGGT LRGRPIDELV
EVLRRLVSVS KLPGAVVIDG RRLGRFRVEI RADVSSQYIS GLMFLAAAGD GGVVVPKGER
KSWSFVEATA DVLRLFGAEV SMGDEVVVEG GLRSPGTVDV PGDLSLASFL LVASLATGGK
VRLEGAVTKL DAVVLDIFKF MGADIAYGDG YVEARGGFTK GVDVDLGGNP DLVMPVALAA
AMVEEQSAIR GVEHLRFKES DRVATVLDVL WRLGVDARYE GGVLYIKGPP KRRDVRFSSS
GDHRIGLMAM AAAKAVGGCV DDISPVAKSW PSAILYFK