Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2016 |
Symbol | trpD |
ID | 8013049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2008297 |
End bp | 2009313 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644824603 |
Product | anthranilate phosphoribosyltransferase |
Protein accession | YP_002975834 |
Protein GI | 241204738 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0352471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.432808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC TGAAGCCGTT CCTGGCCAAG GCCGCAAGCC GCGAGCCGCT GACGCGTGAC GAGGCCCGCG CTGCCTTCGA CATCCTGATG TCGGGCCAGG CGACACCCTC GCAGATCGGT GGCTTCCTGA TGGCGCTGCG CGTGCGCGGC GAAACCGTCG ACGAGATCGT CGGCGCCGTC ACCGCAATGC GCTCGAAAAT GCTGACCGTC GAGGCGCCGG CCGATGCGAT CGACATTGTC GGCACCGGCG GCGATGCCAG CGGCACCTAC AATATCTCGA CGCTGGCGGC GCTAATCGTC GCCGGCGCTG GTGTTCCCGT CGCCAAACAC GGCAATCGGG CGCTGAGTTC GAGATCGGGC GCGGCCGACA ATCTGGCCGC ACTCGGCGTC AAGCTCGACG TCGGCCCCGA GATCATCTCC CGCTGCATTG CCGAGGCCGG CGTCGGATTC ATGTTCGCGC AGATGCATCA TTCCGCCATG CGCCATGTCG GCCCCTCAAG GGTCGAGCTC GGCACGCGGA CGATCTTCAA CTTGCTCGGG CCGCTCTCCA ATCCGGCCGG CGTTCGCCGC CAACTGCTCG GCGTCTTCTC GCCGCAATGG CTGGTGCCGC TTGCCGAAGT CATGCGCGAT CTCGGCTCCG AATGCGTCTG GGTCGTCCAT GGCGACGGCC TCGACGAGAT CACCACCACC GGCATCACAC AAGTCGCGGC ACTCGAAGGC GGCAAGATTC GCACCTTCGA GCTCTCGCCG GCCGATTTCG GCGTCAGCCC TTGCCTGCTC GCCGACATCA AGGGCGGTGA CGGTGTCGCC AATGCTGCAG CCCTTCGCGA GGTGCTCGGC GGCGCCAAGA ATGCCTATCG CGATGTCTCG CTCGCCAATG CCGCCGCCTC GCTCGTCATC GCCGGCAAGG TCGAGACGAT CCGCGACGGC ATGACGCTGG CCACGCAGTC GCTGGATAGC GGCTCCACCG CGCTTGCCCT CGACAAACTC ATCGCCGTTT CCAACGATAT CGACTAG
|
Protein sequence | MTDLKPFLAK AASREPLTRD EARAAFDILM SGQATPSQIG GFLMALRVRG ETVDEIVGAV TAMRSKMLTV EAPADAIDIV GTGGDASGTY NISTLAALIV AGAGVPVAKH GNRALSSRSG AADNLAALGV KLDVGPEIIS RCIAEAGVGF MFAQMHHSAM RHVGPSRVEL GTRTIFNLLG PLSNPAGVRR QLLGVFSPQW LVPLAEVMRD LGSECVWVVH GDGLDEITTT GITQVAALEG GKIRTFELSP ADFGVSPCLL ADIKGGDGVA NAAALREVLG GAKNAYRDVS LANAAASLVI AGKVETIRDG MTLATQSLDS GSTALALDKL IAVSNDID
|
| |