Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0846 |
Symbol | |
ID | 5054681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 751213 |
End bp | 752187 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640468406 |
Product | anthranilate synthase |
Protein accession | YP_001153083 |
Protein GI | 145591081 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.481482 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGAGGC GGGGCCCGTA CGCGGTGGGG CTACTGCCCT TCCACGCCGT GTCGCCGTTC GATTCAGCAG AGGCGAGGCG CAGGGAGCCT TGGCCCGAGG CCCTCTTCGT GGTGGGCCCG CCCTCGGCCC CGAGCTTGAG GGGAGGGGGG ATCTCGCTGA GGTTGGAGGA GGAGGTGCCG TGCGGGGAGT ACGAGGAGGC TGTGGAGGAG GCCAAGAGGG CGCTGGCTCG GGGGGAGCTG TTCCAGCTGG TGCTCTCCCG CTTCAAGAAA TTTAAAGGGT GGGCAACCCC CGACGCGGTT TTGAAGAGGC TTGCCGCCGT CATGGATGGG AAGTACTACT TCTTCTTGGA GGCTGGCGAT TTGTGGGTGG CCGGCATCTC GCCCGAGACC TTGGTCTCAG TCGAGGAGGG ACGGGCATGG AGTTCTCCTA TAGGTGGCAC TAGGCCTAGG GGGGCTACTT CTGAGGAGGA CTTGGCGCTG GAGGCAGAGC TGGTAAACAG CGTGAAGGAC AGGGCTGAGC ACATAATGCT AGTGGACAGC GTCAGAAATG ACCTGGGCCG AGTATGCGCG TGGGGCACTG TTTCGGCCAG CCGCGTGGCT GTCGTCGAGA AGTTCAGCTA CGTCCAGCAC CTAGTCTCGT ATGTGGGGTG CCGGCTGGCG AGGGGCGTAA CGCCGCTGAG AGCCGCCGCC GCGTTGAACC CAACAACGAC GGTGACAGGC GTGCCGAAGC CAAGGGCAAT AGAATACATA AACGCCCTTG AGAGGGAGCC GCGCGGCCCA TTCGCCGGAT CTTTCGGGGC TGTTTGGCCA GGTGGCGGCG ACTTCGCAGT GGTCATCCGG TCGCTGTACG GCGAGGGGGA CACAGTCTAT CTCTGGGGCG GCGCCGGGAT TGTGATGGAC TCCGATCCAA AAGGGGAGTG CCGTGAGACG GAGGTTAAGA TGGGGCCAAT CGCCCGGGCG CTCACAAGCC CATAG
|
Protein sequence | MERRGPYAVG LLPFHAVSPF DSAEARRREP WPEALFVVGP PSAPSLRGGG ISLRLEEEVP CGEYEEAVEE AKRALARGEL FQLVLSRFKK FKGWATPDAV LKRLAAVMDG KYYFFLEAGD LWVAGISPET LVSVEEGRAW SSPIGGTRPR GATSEEDLAL EAELVNSVKD RAEHIMLVDS VRNDLGRVCA WGTVSASRVA VVEKFSYVQH LVSYVGCRLA RGVTPLRAAA ALNPTTTVTG VPKPRAIEYI NALEREPRGP FAGSFGAVWP GGGDFAVVIR SLYGEGDTVY LWGGAGIVMD SDPKGECRET EVKMGPIARA LTSP
|
| |