Gene Rleg2_3580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3580 
Symbol 
ID6982341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3707227 
End bp3708336 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content63% 
IMG OID643398305 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002283073 
Protein GI209551156 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0639524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTG AGATGAGCAA GCCCGTTCCG CGTCCCGGTA TTCTCGATAT CGCATCCTAT 
GTGCCGGGCA AGGAACATGC GCCGGGGGTT GCCCGCGTCT ACAAGCTTTC GTCCAACGAA
ACGCCGCTCG GCGCCAGCCC GAAGGCGATC GAAGCCTGCA AGGCGGCCGC CGACCATCTG
GGACGTTATC CCGACGGGCA GGCGGTGGAA TTGCGCGAGG CGATCGCCGC GGTGCACGGC
CTCAACCCGG CAAACATCCT CTGCGGTAAC GGTTCCGACG AGCTACTCGG CCTGCTCTGC
CATGTCTATC TCGGCGCCGG CGACGAGGGC ATCATCACCG AGCACGGTTT CCTGGTCTAC
AAGATCCAGA TCCAGGGCGC CGGGGCCACG CCCGTCGTCG TCAGGGAAAA GGATCATACC
GTCGATGTCG ATGCGATCCT TGCTGCGGTG ACGGAGAAGA CGAAGATCGT CTTCATCGCC
AATCCCGGCA ATCCGACAGG AACCTATGTC CCGGTCAGCG AGATCCGCCG CCTACAGGCC
GGGCTGCCGA AACATGTCGT CTTGGTTCTC GATGCCGCCT ATGCCGAATA TGTGCGCCGC
AACGATTATG AAGCCGGCAT CGAGGTCGTG TCCTCCAATG CCAATGTGGT GATGACCCGC
ACCTTCTCGA AGGCCTACGG ACTTGCAGCG CTACGCGTCG GCTGGATGTA TGCGCCCGCC
GAGATCGTCG ACGCGGTGAA CCGCGTGCGC GGCCCCTTCA ATCTGAACGC GCCGGCAATC
GCCGCCGGGG CCGCGGCCAT CCGCGACCAG GCCTTCGTCC AGCAGGCCGT CTCCTTCAAC
CAGACGTGGG TAGAAACGCT CACCCAGGCC CTCGAAGCGA TCGGTCTGAA GGTGACGCCG
TCGGTCGCCA ATTTCGTCCT CATCCATTTC CCCGAGATTG ACGGCAAGCG CGCCGCCGAT
GCCGACGATC TGCTGACGAG CCGGGGCTAC ATCCTGCGCG CCGTGCGCAG CTACGGCTTT
TCCAATGCGC TGCGCATGAG CATCGGCCCG GAAGAGGCCA ATCGCGGCGT CATCGCCGCC
CTCACCGAAT TCATGGGACA TAAGGCATGA
 
Protein sequence
MSVEMSKPVP RPGILDIASY VPGKEHAPGV ARVYKLSSNE TPLGASPKAI EACKAAADHL 
GRYPDGQAVE LREAIAAVHG LNPANILCGN GSDELLGLLC HVYLGAGDEG IITEHGFLVY
KIQIQGAGAT PVVVREKDHT VDVDAILAAV TEKTKIVFIA NPGNPTGTYV PVSEIRRLQA
GLPKHVVLVL DAAYAEYVRR NDYEAGIEVV SSNANVVMTR TFSKAYGLAA LRVGWMYAPA
EIVDAVNRVR GPFNLNAPAI AAGAAAIRDQ AFVQQAVSFN QTWVETLTQA LEAIGLKVTP
SVANFVLIHF PEIDGKRAAD ADDLLTSRGY ILRAVRSYGF SNALRMSIGP EEANRGVIAA
LTEFMGHKA