Gene Rleg2_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3830 
Symbol 
ID6982593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3962210 
End bp3963271 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content62% 
IMG OID643398552 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002283318 
Protein GI209551401 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCT ACTGGTCTGA TATCGTCAGC AAGCTCCGAC CCTATGTCGC CGGGGAGCAA 
CCCCGTATCC CCGGTCTGGT CAAGCTCAAC ACCAACGAGA ACCCCTACGG CCCGTCTCCA
GCGGCGCTGG AAGCGATCGG CCAAGCGGCG GATGATCGTC TGCGGCTCTA TCCCGATCCG
GCGGCGACGG AATTGCGCGA GACGATCGCT GCCCGTCACG GCCTGACGGC GGATGAAGTC
TTCGTCGGCA ACGGCTCAGA CGAGGTCCTC GCCCATGCGT TTCAGGCGCT GCTGAGACAT
GAACTGCCGC TTCTCTATCC CGACATAAGC TACAGCTTCT ATCCGACTTA TAGCCTGCTA
TACGACATCG AAGCGATCGA AGCGCCGGTC GATGATACGT TCCAGATCCG GCTGGCGGAT
TACGACAGGC CGTGCGGGGC GATCATCATC CCCAATCCGA ATGCGCCGAC CGGCATCGGC
TTGCCGCTTG CCGACATAGA GGCGCTTGTC GCCACCCATC CGGACGCGGT CGTGGTGATC
GACGAGGCCT ATGTCGATTT CGGCGGTGAC AGTGCCATCC CGCTCATTTC CAAATATCCC
AACCTGCTTG TCGTTCAGAC CTTGTCGAAA TCCCGCTCCT TTGCCGGCCT GCGCGTCGGT
TTCGCGCTTG GGCAGCGGGA GCTGATCGAG GCGCTGGTGC GCGTCAAGGA CAGCTTCAAT
TCCTATCCGC TCGATCGCCT GGCGCAGGTT GCCGCAACGG CGGCGATCAA GGACGAGGCG
TGGTTCGAGG CATGCCGGAC GAAGCTCATC GCCAGCCGGG ACGGTCTCGT CCGGGACCTC
GAAGCGCTGG AATTCGAAGT GCTGCCGTCT CAGGCGAATT TCGTTTTCGC ACGGCATGAA
AGCCGGTCGG GTGCCGCGCT GCAAGCCGCT CTGCGGGAGC GAGGTGTTCT CGTTCGGCAT
TTCGCCAAGC CGCGCATTTC GGATTTCCTG CGCATCAGCA TCGGCACGAA CGAGGAGTGC
GCCCGTCTGG TTTCCGCTCT CAAGGAAATA CTGGCAGCCT GA
 
Protein sequence
MSRYWSDIVS KLRPYVAGEQ PRIPGLVKLN TNENPYGPSP AALEAIGQAA DDRLRLYPDP 
AATELRETIA ARHGLTADEV FVGNGSDEVL AHAFQALLRH ELPLLYPDIS YSFYPTYSLL
YDIEAIEAPV DDTFQIRLAD YDRPCGAIII PNPNAPTGIG LPLADIEALV ATHPDAVVVI
DEAYVDFGGD SAIPLISKYP NLLVVQTLSK SRSFAGLRVG FALGQRELIE ALVRVKDSFN
SYPLDRLAQV AATAAIKDEA WFEACRTKLI ASRDGLVRDL EALEFEVLPS QANFVFARHE
SRSGAALQAA LRERGVLVRH FAKPRISDFL RISIGTNEEC ARLVSALKEI LAA