Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1419 |
Symbol | trpD |
ID | 5054451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1280094 |
End bp | 1281095 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640468960 |
Product | anthranilate phosphoribosyltransferase |
Protein accession | YP_001153629 |
Protein GI | 145591627 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.195691 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATCTAA AAGCCCTATT GAGAAAGCTC GGCAACGGCC TTAGGCTTAC GGCCGACGAG GCTTACCTCC TCGGGAGGGG GATCCTCTCA GGTTCCCTTT GCGACGTCGA GGTGGCGGCT TCCCTCACTG CTATGAGGGT GCGGGGTGAA TCGCCGGAGG AGGTGGCTGG GTTTGTGAAG ATGGCTAGGG AGTTCGCGGT TAGGGTGCCC CTCCGCATAG AGGCGGTGGA CACGGCGGGG ACTGGCGGGG ACGGCGCCGG TACCATAAAC CTCTCGACGG CAGCGGCCAT CGTCGCCGCG GCGGCTGGGG CTAAGGTGTT GAAGCACGGG AATAGGTCAG CCTCAGGCCT CTTCGGCAGC GCCGACTTTA TGGAGGCGGT TGGCTACAAC TTAGAAGTAG GGCCTGAGAA GGCGGCTGAG CTTGTGGAAA AGGTGGGCTT CGCCTTCGTC TTCGCGCCTA GATACCACCC GGCCTTCGCC AAAGTGGCGC CTGTGCGCCG CGCCCTGCCG TTCCGAACTA TTTTCAACAT CGTCGGCCCC TTGGCCAACC CGGGGCTGGT GAAGAGGCAA CTCATCGGCG TTGCTGAAGA GAGGCTTCTC GAAGTTGTGG CGGCCGCCGC GGCTGAGCTG GGTTTTGAAC ACGCAGTGGT TGTCCACGGC TCTGGAGTAG ACGAGGTGTC CAGTGAGGGG GCGACGACGG TGTACGAGGT GAAGAGGGGG TCGTTGGAGA GGTACCAAAT AGCGCCGGAG GATTTAGGCG CCCCCCGCGT CCCCATACCG CGTGCCTCTG ACAAAGAAGA GGCGGTGGCT AAGGCCTTGG CTGGGCTTCG GGGAGAGCTG AGGGAGGCCT CCGTGGCCAT TGCCCTAAAC GCCGCCTTTG CCCTCTACGT AGCAGGAGTG GTTGGAGATC CGAGAGATGG CTTTGAGCTT GCCATGAGGG CGATACAAGA GGGGGTAGCG TATCGGAAGC TGGTAGAGGC GGTGGAGGCA TCTCGGACAT GA
|
Protein sequence | MDLKALLRKL GNGLRLTADE AYLLGRGILS GSLCDVEVAA SLTAMRVRGE SPEEVAGFVK MAREFAVRVP LRIEAVDTAG TGGDGAGTIN LSTAAAIVAA AAGAKVLKHG NRSASGLFGS ADFMEAVGYN LEVGPEKAAE LVEKVGFAFV FAPRYHPAFA KVAPVRRALP FRTIFNIVGP LANPGLVKRQ LIGVAEERLL EVVAAAAAEL GFEHAVVVHG SGVDEVSSEG ATTVYEVKRG SLERYQIAPE DLGAPRVPIP RASDKEEAVA KALAGLRGEL REASVAIALN AAFALYVAGV VGDPRDGFEL AMRAIQEGVA YRKLVEAVEA SRT
|
| |