Gene Rleg_3873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3873 
Symbol 
ID8014696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3943617 
End bp3944726 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content62% 
IMG OID644826443 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002977655 
Protein GI241206559 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0584274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTG AGATGAGCAA GCCCGTTCCG CGTCCCGGTA TTCTCGATAT CGCAGCCTAT 
GTGCCGGGCA AGGAACATGC GCCGGGTGTT GCCCGCGTCT ACAAGCTTTC GTCCAACGAA
ACGCCGCTCG GCGCCAGCCC GAAGGCAATC GAAGCCTTCA AGACGGTTGC CGACAATCTG
GGGCGTTATC CTGACGGGCA GGCGATCGAA CTGCGTGAGG CGATTGCCGC CGTGCACGGC
CTCAATCCGG CAAACATTCT CTGCGGCAAC GGTTCCGACG AACTGCTCGG CTTGCTCTGC
CATGTCTATC TCGGTGCCGG CGACGAGGGC ATCATCACCG AGCACGGCTT CCTCGTCTAC
AAGATCCAGA TCCTGGGCGC CGGCGCCACG CCTGTTGTCG TCAAGGAGAA AGACTATACC
GTCGATGTCG ATGCGATCCT TGCCGCGGTG ACCGAGAAGA CGAAGATCGT CTTCATCGCC
AATCCCGGCA ATCCAACCGG CACCTATGTT TCCGTCAGCG AGATCCGCCG CCTTCAGGCC
GGACTGCCGA AACATGTCGT CCTCGTGCTC GATGCCGCCT ATGCCGAATA TGTGCGCCGC
AACGATTATG AAGCCGGCAT CGAGGTCGTC TCCTCCAATG CCAACGTGGT GATGACCCGC
ACCTTCTCGA AGGCTTATGG CCTTGCGGCG CTGCGCGTCG GCTGGATGTA TGCGCCCGCC
GAGATCGTCG ATGCGCTGAA TCGCGTGCGC GCGCCGTTCA ACTTGAACGC GCCGGCAATC
GCCGCCGCTG CCGCTGCCAT CCGCGACCAG GCCTTCATCC AGCAGGCCGT CTCCTTCAAT
CAGATGTGGG TCGAGACGCT CACCCAGGCA CTCGAAGCGA TCGGGTTGAA GGTGACGCCG
TCCGTCGCCA ATTTCGTCCT CATTCATTTC CCCGAGATCG ACGGCAAGCG CGCCGCGGAT
GCCGATGATT TGTTGACGAG CCGCGGCTAC ATCCTGCGCG CCGTGCGCGG CTATGGTTTC
GCCAATGCGC TGCGCATGAG CATCGGCCCC GAAGAGGCCA ACCGCGGCGT GATTGCCGCG
CTCACCGAAT TCATGGGTCA TCAGGCATGA
 
Protein sequence
MSVEMSKPVP RPGILDIAAY VPGKEHAPGV ARVYKLSSNE TPLGASPKAI EAFKTVADNL 
GRYPDGQAIE LREAIAAVHG LNPANILCGN GSDELLGLLC HVYLGAGDEG IITEHGFLVY
KIQILGAGAT PVVVKEKDYT VDVDAILAAV TEKTKIVFIA NPGNPTGTYV SVSEIRRLQA
GLPKHVVLVL DAAYAEYVRR NDYEAGIEVV SSNANVVMTR TFSKAYGLAA LRVGWMYAPA
EIVDALNRVR APFNLNAPAI AAAAAAIRDQ AFIQQAVSFN QMWVETLTQA LEAIGLKVTP
SVANFVLIHF PEIDGKRAAD ADDLLTSRGY ILRAVRGYGF ANALRMSIGP EEANRGVIAA
LTEFMGHQA