Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2112 |
Symbol | |
ID | 5054716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1887509 |
End bp | 1888705 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640469664 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001154310 |
Protein GI | 145592308 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.308639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTGTA TTGAGGCGGG GCGCCTCGAG GGGAGATTCC CCCTGCCCCC TTCTAAGCCA TACTCACAGC GCCTACTGCT GGCAAGCGCG TTGGCCGAGG GGGAGACCGT AGTGAGGGGT CTTGAGTTAA GCGACGACGT GGTGGCGATG GTTAGGGCTA TACAGCCAAT TGCCTCTATA ACGCTGAGGG CGGACACGGC GGTTGTCTCG AAGAGGGAGC CCGACAAGTA CAGGGCCTTC AACGTGATGG AGAGCGGCTT CACCCTGAGG ACCGCGGTGG CTGTATACGC CGGCATCCCA GGACTCACGG CAGTGTACTT CGGCGGCACC CTCAGGGGGA GGCCCATCGA CGAGCTGGTG GAGGTGTTGA GGAGGCTCGT CTCCGTGTCT AAGTTGCCGG GTGCGGTGGT GATTGATGGG AGGCGGCTGG GGCGGTTTCG GGTTGAGATC AGGGCCGACG TCTCTTCGCA GTATATCTCA GGCCTTATGT TCCTCGCCGC GGCTGGTGAC GGCGGCGTTG TTGTGCCGAA AGGGGAGAGG AAGTCCTGGA GCTTCGTCGA GGCTACCGCA GATGTGTTGA GGCTCTTCGG CGCAGAGGTT TCGATGGGCG ACGAAGTGGT TGTCGAGGGC GGGCTGAGAA GCCCCGGCAC TGTGGACGTG CCGGGTGATC TAAGCCTTGC CTCCTTCCTC CTAGTGGCCA GTCTCGCCAC TGGCGGGAAG GTCCGCCTCG AAGGCGCTGT CACGAAGCTC GACGCCGTTG TCCTAGACAT ATTCAAGTTT ATGGGCGCCG ATATTGCCTA CGGCGATGGC TACGTCGAGG CGCGGGGCGG ATTCACCAAG GGAGTGGACG TGGATCTAGG CGGCAACCCC GACCTCGTCA TGCCGGTGGC GTTGGCTGCG GCGATGGTGG AGGAGCAGTC GGCCATACGG GGGGTTGAGC ACTTGCGTTT CAAGGAGAGC GACAGGGTAG CCACTGTGCT TGACGTGTTG TGGAGGCTGG GGGTCGACGC GAGATATGAG GGCGGCGTCC TGTACATAAA GGGCCCGCCT AAGCGCCGCG ATGTCCGCTT CTCCTCTAGC GGAGACCACA GGATTGGCCT CATGGCCATG GCCGCGGCTA AGGCCGTCGG CGGTTGTGTA GACGACATTA GCCCAGTTGC CAAGAGTTGG CCGTCGGCGA TTTTATACTT TAAATAA
|
Protein sequence | MLCIEAGRLE GRFPLPPSKP YSQRLLLASA LAEGETVVRG LELSDDVVAM VRAIQPIASI TLRADTAVVS KREPDKYRAF NVMESGFTLR TAVAVYAGIP GLTAVYFGGT LRGRPIDELV EVLRRLVSVS KLPGAVVIDG RRLGRFRVEI RADVSSQYIS GLMFLAAAGD GGVVVPKGER KSWSFVEATA DVLRLFGAEV SMGDEVVVEG GLRSPGTVDV PGDLSLASFL LVASLATGGK VRLEGAVTKL DAVVLDIFKF MGADIAYGDG YVEARGGFTK GVDVDLGGNP DLVMPVALAA AMVEEQSAIR GVEHLRFKES DRVATVLDVL WRLGVDARYE GGVLYIKGPP KRRDVRFSSS GDHRIGLMAM AAAKAVGGCV DDISPVAKSW PSAILYFK
|
| |