Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2119 |
Symbol | |
ID | 5055775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1894166 |
End bp | 1895272 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640469671 |
Product | chorismate synthase |
Protein accession | YP_001154317 |
Protein GI | 145592315 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCT TCGGCAGGGA ACTCCGCATC ACCACTTTCG GCGAGTCCCA CGGCCGGGCC ATAGGCGTAG TTATAGACGG GGTCCCCGCC GGGCTCCCCC TTACCGAGGA GGACATAAGG AAGGAGCTGG ACAGGAGGAT GTTCTGCCAC ATCCACTGGC TAAACCCCCG GTGCGAGCCT GAGGAGTTCG AAATACTGTC AGGCGTAAAA GACGGCCACA CCCAAGGCAC GCCCATCGCC ATTGTGATAT GGAACAAGAA GGCCATATCC AGCTACTACG ACGAGCTCTG GATGAAGCCC AGGCCTGGCC ACGCCGACCT CGCTTACTAC CTCAAGTATG GCAAGTTCTA CGACCACAGA GGCGGCGGAC GGGCCTCTGG CCGCACCACA GCGGCAATCG TGGCGGCGGG GGCCGTGGCC AAGAAGCTCT TGGCGCTGGT GGGAGCCGAG GTGGCAGGCC ACATAGTGGA GCTGGGGGGC GTAGAGGTGA AGCGGCCGTA CACCTTTGAA GACGTGAAGA AGAGCTGGGA GAAGCCCCTT CCAGTGGTCG ACGACGATGC CCTAGCCGCC ATGCTTGAGG TGTTGCGCAA AAACGCCGCC GAGGGAGACA GCGTGGGGGG CGGCGTTGAG ATCTGGGCGG TGGGCGTCCC GCAGGGCCTG GGCGAACCTC ACTTTGGGAA AATAAGGGCA GATCTCGCTC ACGCTGCCTT CTCGGTGCCC GCCGTGGTGG CCCTAGACTG GGGCGCCGGG AGGCAACTCG CCAAGATGCG CGGCTCAGAG GCCAACGACC CCATAGTGGT GAAGGGCGGC AAGCCGGGGC TGGAGACTAA CAAGATAGGG GGGGTCCTCG GCGGCATAAC GATAGGCGAG CCCTTATACT TCAGGGTGTG GTTAAAGCCT ACACCATCGG TGAGGAAGCC GCAGAGGACT GTGGACTTGG CAAAGATGGA GCCGGCCACG TTGCAGTTCA AGGGCCGATA CGACGTATCT GTAGTGCCCA AGGCCCTCGT GGCGCTGGAG GCGATGACGG CAATAACGCT AGCCGATCAC CTCCTCCGCG CGGGGGTGAT CAGAAGAGAC CGGCCGCTGA AAGATCCTGT GGTTTAA
|
Protein sequence | MNTFGRELRI TTFGESHGRA IGVVIDGVPA GLPLTEEDIR KELDRRMFCH IHWLNPRCEP EEFEILSGVK DGHTQGTPIA IVIWNKKAIS SYYDELWMKP RPGHADLAYY LKYGKFYDHR GGGRASGRTT AAIVAAGAVA KKLLALVGAE VAGHIVELGG VEVKRPYTFE DVKKSWEKPL PVVDDDALAA MLEVLRKNAA EGDSVGGGVE IWAVGVPQGL GEPHFGKIRA DLAHAAFSVP AVVALDWGAG RQLAKMRGSE ANDPIVVKGG KPGLETNKIG GVLGGITIGE PLYFRVWLKP TPSVRKPQRT VDLAKMEPAT LQFKGRYDVS VVPKALVALE AMTAITLADH LLRAGVIRRD RPLKDPVV
|
| |