Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0603 |
Symbol | trpD |
ID | 6374267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 635498 |
End bp | 636535 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642683116 |
Product | anthranilate phosphoribosyltransferase |
Protein accession | YP_001959043 |
Protein GI | 189499573 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.421903 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0114034 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTACA AGGAGTTACT TCACAAGCTG CTGACCGGCA CCGATCTTTC AGGCAAAGAG ATGGAAGAGT GTTTCTCGGG TATCATGCTG GGAGAGTACC CGGACAGTGT CATAGCGGCA ATTCTCGCCT TGCTTCAGAA AAAAGGAGTA ACCCCCGAAG AAGTTGCCGG AGCGTATTTC GCAATTATCT CAAAAGCCCT TCCGGTTCAG CTTGGTGATA ATGCCGTCGA TACGTGCGGT ACCGGAGGTG ATCAGGCAGG CACCTTCAAC ATCTCAACGG TAGCGGCAAT TATTGCAAAC GGCGCAGGAG TACCGATAGC CAAACATGGC AACAGGTCTG TGACGAGCCG GTGCGGCAGC GCCGATGTGC TTGAACAGCT TGGCTACCGA ATTCTTCTTC CTCCTGACAA AACCGAAATG CTTTTCCGTG AGACCGGATT CGCCTTTCTT TTCGCGCCCC TCTATCATCC GGCGATGAAA GCTGTCGCGC ATATACGCAG AGAACTCGGC ATAAAAACCA TTTTCAACAT GCTGGGGCCT CTGGTCAATC CGGCTAAAGT GCACAGGCAG GTGGTAGGCG TGTTCGATAT GCGCGTCATG GAAATCTACG CTCAGTCACT CATCAGGACA GGATGCAGCC ATGCCCTTGT CGTTCACGGC AAAACCGAAA ATGGGGACGG ACTTGATGAA GCAAGCATAT GCGGTCCGAC CCGTATTGTA GAAATTCAGA ACGGAGAAAT CACCTGTCAC GACGTAGAAC CTGAAACCTT CAGCCTGTCA CGGTGTACCA TAGCCGAACT TCAAGGAGGC GACAGCAGCC GGAATGCAGA CATACTTCTC AGGATTCTCG ACGGAAGCGC AACAAAAGCC CAGACAGATG CCGCGCTATT CAGTGCGGCT ATGGCATGTT ACGTATCCGG TAGAGCAACA TGCATTGACG ACGGCCTGAG CAAAGCAAAA GGCTCTCTGG AAAGCGGAAA CGCCTCGAAA CAATTCTCAC GCATCCTTGC CCTCAATGCA GAACTTGCCG GCAAATAG
|
Protein sequence | MQYKELLHKL LTGTDLSGKE MEECFSGIML GEYPDSVIAA ILALLQKKGV TPEEVAGAYF AIISKALPVQ LGDNAVDTCG TGGDQAGTFN ISTVAAIIAN GAGVPIAKHG NRSVTSRCGS ADVLEQLGYR ILLPPDKTEM LFRETGFAFL FAPLYHPAMK AVAHIRRELG IKTIFNMLGP LVNPAKVHRQ VVGVFDMRVM EIYAQSLIRT GCSHALVVHG KTENGDGLDE ASICGPTRIV EIQNGEITCH DVEPETFSLS RCTIAELQGG DSSRNADILL RILDGSATKA QTDAALFSAA MACYVSGRAT CIDDGLSKAK GSLESGNASK QFSRILALNA ELAGK
|
| |