Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2208 |
Symbol | trpD |
ID | 4445269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2484310 |
End bp | 2485368 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639690017 |
Product | anthranilate phosphoribosyltransferase |
Protein accession | YP_831688 |
Protein GI | 116670755 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00486869 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACTTCAC AGGCATCTGC ACAGGCGGCG GGCAACACCT GGCCGCGGCT CATTTCGGCA CTGATCAACG GCGCGGACCT GACGGCCGAC AATACGGAAT GGGCGATGGA CACCATCATG TCCGGGGAGG CCACCCCTTC GCAGATCGCC GGGTTCCTGG TGGCGCTGCG GTCAAAGGGC GAGACGGTGG ACGAGATAGC CGGGCTTGTC GAGGCCATGC TGGCGCACGC GAACCCTGTC GACATCGCCG GCGAGAAGCT GGATATTGTG GGCACCGGCG GTGACCAGCT GAACACCGTC AACATTTCCA CCATGGCCGC CCTCGTCTGT GCGGGAGCGG GCGCAAAGGT GGTTAAGCAT GGCAACCGTG CGTCGTCGTC GTCGTCCGGG TCGGCGGACG TGCTGGAAGC GCTCGGCGTC CGGCTGGACC TGCCCATCGA ACGCGTTGCG CGCAACGCAG AAGAAGCGGG GATCACGTTC TGCTTCGCCC AGGTGTTCCA CCCCTCCTTC CGGCACACTG CCGTCCCGCG CCGGGAGCTT GCCATACCCA CGGCGTTCAA CTTCCTCGGC CCGCTGACCA ACCCCGCGCG GGTTCAGGCC TCGGCGGTTG GTGTCGCCAA CGAAAGGATG GCCCCCTTGG TTGCCGGGGT CCTGGCCAAG CGCGGAAGCC GCGGCCTGGT TTTCCGGGGG AGTGACGGAC TGGACGAACT GACAACCACC GGACCGTCCA CTGTCTGGGA GATCCGGAAC GGCGAAGTGA CGGAACAGAC GTTTGATCCC CAGGCGCTGG GCATCCGTGC CGCAACAGTG GAGGAGCTCC GCGGCGGCGA CGCGACAGCC AATGCCGCCG TCGTCCGTGA TGTCCTGAGC GGCGTGGCGG GCCCGGCCCG TGACGCCGTC CTTCTGAATG CCGCCGCGGG CCTGGTGGCA TTCGACGTCG ATGCCGAGGG CACGCTCACG GACCGGATGG CGGCCGCGTT GAAGCGAGCG GAAGAGTCCA TTGACTCCGG TGCCGCGGCG GCCGTGCTGG AAAAGTGGGT CGCCCTTACC CGGGGCTAG
|
Protein sequence | MTSQASAQAA GNTWPRLISA LINGADLTAD NTEWAMDTIM SGEATPSQIA GFLVALRSKG ETVDEIAGLV EAMLAHANPV DIAGEKLDIV GTGGDQLNTV NISTMAALVC AGAGAKVVKH GNRASSSSSG SADVLEALGV RLDLPIERVA RNAEEAGITF CFAQVFHPSF RHTAVPRREL AIPTAFNFLG PLTNPARVQA SAVGVANERM APLVAGVLAK RGSRGLVFRG SDGLDELTTT GPSTVWEIRN GEVTEQTFDP QALGIRAATV EELRGGDATA NAAVVRDVLS GVAGPARDAV LLNAAAGLVA FDVDAEGTLT DRMAAALKRA EESIDSGAAA AVLEKWVALT RG
|
| |