Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1821 |
Symbol | trpD |
ID | 6980559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1868091 |
End bp | 1869107 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643396543 |
Product | anthranilate phosphoribosyltransferase |
Protein accession | YP_002281332 |
Protein GI | 209549415 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0027923 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGATC TGAAGCCGTT CCTGGCCAAG GCCGCAAGCC GCGAGCCGCT GACGCGTGAC GAGGCGCGCG CCGCCTTCGA CATTCTGATG TCGGGCCAGG CGACGCCCTC GCAGATCGGC GGCTTCCTGA TGGCGCTGCG CGTGCGTGGC GAAACCGTCG ACGAGATCGT CGGCGCCGTC ACCGCGATGC GCTCGAAAAT GCTGACCGTC GAAGCGCCTG AAGATGCGAT CGACATTGTC GGCACTGGCG GCGATGCCAG CGGTACCTAC AATATCTCGA CGCTGGCGGC GCTCATCGTC GCCGGCGCCG GTGTCCCCGT CGCAAAACAC GGCAACCGGG CGCTGAGCTC GAAATCGGGT GCGGCGGATA ATCTCGCCGC ACTCGGCGTC AAGCTCGATG TCGGCCCTGA GATCATCTCC CGCTGCATCG CCGAGGCCGG CGTCGGCTTC ATGTTCGCGC AGCTTCATCA TTCCGCCATG CGCCATGTCG GCCCGTCCCG CGTCGAACTC GGCACCCGCA CCATCTTCAA CCTGTTAGGC CCGCTTTCCA GTCCGGCCGG CGTCCGCCGC CAGCTGCTTG GCGTCTTCTC GCCGCAATGG CTGGTGCCGC TCGCCGAGGT CATGCGCGAT CTCGGCTCCG AATGCGTCTG GGTGGTCCAT GGCGACGGCC TCGACGAGAT CACCACGACA GGCATCACCA AGGTGGCCGC GCTCGAAGAC GGCAAGATCC GCACCTTCGA GCTATCGCCC GCCGATTTCG GCGTTAGCCC TTGCGTGCTC GCCGACATCA GGGGCGGCGA TGGCGTCGCC AATGCCGCAG CGCTTCGCGA GGTGCTCAGC GGCGCGAAGA ACGCCTATCG TGATATTTCG CTCGCCAATG CCGCGGCATC GCTCGTCATC GCCGGCAAGG TCGAAACGAT TCGCGATGGC ATGACGCTTG CCGCGCAGTC GCTCGATAGC GGTGCCACCG CGCTTGCCCT CGACAAACTC ATCGCCGTTT CCAACGATAT CGACTAG
|
Protein sequence | MTDLKPFLAK AASREPLTRD EARAAFDILM SGQATPSQIG GFLMALRVRG ETVDEIVGAV TAMRSKMLTV EAPEDAIDIV GTGGDASGTY NISTLAALIV AGAGVPVAKH GNRALSSKSG AADNLAALGV KLDVGPEIIS RCIAEAGVGF MFAQLHHSAM RHVGPSRVEL GTRTIFNLLG PLSSPAGVRR QLLGVFSPQW LVPLAEVMRD LGSECVWVVH GDGLDEITTT GITKVAALED GKIRTFELSP ADFGVSPCVL ADIRGGDGVA NAAALREVLS GAKNAYRDIS LANAAASLVI AGKVETIRDG MTLAAQSLDS GATALALDKL IAVSNDID
|
| |