Gene Rleg_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2016 
SymboltrpD 
ID8013049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2008297 
End bp2009313 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content66% 
IMG OID644824603 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_002975834 
Protein GI241204738 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0352471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.432808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC TGAAGCCGTT CCTGGCCAAG GCCGCAAGCC GCGAGCCGCT GACGCGTGAC 
GAGGCCCGCG CTGCCTTCGA CATCCTGATG TCGGGCCAGG CGACACCCTC GCAGATCGGT
GGCTTCCTGA TGGCGCTGCG CGTGCGCGGC GAAACCGTCG ACGAGATCGT CGGCGCCGTC
ACCGCAATGC GCTCGAAAAT GCTGACCGTC GAGGCGCCGG CCGATGCGAT CGACATTGTC
GGCACCGGCG GCGATGCCAG CGGCACCTAC AATATCTCGA CGCTGGCGGC GCTAATCGTC
GCCGGCGCTG GTGTTCCCGT CGCCAAACAC GGCAATCGGG CGCTGAGTTC GAGATCGGGC
GCGGCCGACA ATCTGGCCGC ACTCGGCGTC AAGCTCGACG TCGGCCCCGA GATCATCTCC
CGCTGCATTG CCGAGGCCGG CGTCGGATTC ATGTTCGCGC AGATGCATCA TTCCGCCATG
CGCCATGTCG GCCCCTCAAG GGTCGAGCTC GGCACGCGGA CGATCTTCAA CTTGCTCGGG
CCGCTCTCCA ATCCGGCCGG CGTTCGCCGC CAACTGCTCG GCGTCTTCTC GCCGCAATGG
CTGGTGCCGC TTGCCGAAGT CATGCGCGAT CTCGGCTCCG AATGCGTCTG GGTCGTCCAT
GGCGACGGCC TCGACGAGAT CACCACCACC GGCATCACAC AAGTCGCGGC ACTCGAAGGC
GGCAAGATTC GCACCTTCGA GCTCTCGCCG GCCGATTTCG GCGTCAGCCC TTGCCTGCTC
GCCGACATCA AGGGCGGTGA CGGTGTCGCC AATGCTGCAG CCCTTCGCGA GGTGCTCGGC
GGCGCCAAGA ATGCCTATCG CGATGTCTCG CTCGCCAATG CCGCCGCCTC GCTCGTCATC
GCCGGCAAGG TCGAGACGAT CCGCGACGGC ATGACGCTGG CCACGCAGTC GCTGGATAGC
GGCTCCACCG CGCTTGCCCT CGACAAACTC ATCGCCGTTT CCAACGATAT CGACTAG
 
Protein sequence
MTDLKPFLAK AASREPLTRD EARAAFDILM SGQATPSQIG GFLMALRVRG ETVDEIVGAV 
TAMRSKMLTV EAPADAIDIV GTGGDASGTY NISTLAALIV AGAGVPVAKH GNRALSSRSG
AADNLAALGV KLDVGPEIIS RCIAEAGVGF MFAQMHHSAM RHVGPSRVEL GTRTIFNLLG
PLSNPAGVRR QLLGVFSPQW LVPLAEVMRD LGSECVWVVH GDGLDEITTT GITQVAALEG
GKIRTFELSP ADFGVSPCLL ADIKGGDGVA NAAALREVLG GAKNAYRDVS LANAAASLVI
AGKVETIRDG MTLATQSLDS GSTALALDKL IAVSNDID