Gene Rleg_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1810 
Symbol 
ID8012868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1801297 
End bp1802313 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content64% 
IMG OID644824401 
ProductTIM-barrel protein, nifR3 family 
Protein accessionYP_002975634 
Protein GI241204538 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.752321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00592071 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTGCCCGA AAGATAATCA TTTGATTTCC AAGGACCTCG CAGCGCCTTT CCAAATCGGA 
CCCGTGTCCG TGCGGAACCG CGTTGTACTG GCGCCGATGT CCGGCGTCAC GGATATGCCC
TTCCGCGAGC TTGCCTGGCG CTTCGGCGCT GGCCTCGTCG TCACCGAGAT GGTGGCGAGC
CGTGAACTGG TCAACGACAC GGCCGAATCC TGGTCGCGGC TTAGCGCTGC GGGCTTCCGG
CCGCATATGG TGCAGCTTGC CGGGCGCGAG GCGCACTGGA TGGCGGAGGC GGCCAAGATC
GCCGCCGATC ACGGCGCCGA TATCATCGAC ATCAACATGG GTTGCCCGGC AAAGAAAGTG
ATCGGCGGTT ATTCCGGCTC GGCGCTGATG CGCGATCCCG ATCACGCGCT CGGCCTCATC
GAGGCGACGG TCAAGGCCGT CGACATTCCG GTGACGCTGA AGATGCGCCT TGGCTGGGAT
GAGAATTCGA TCAACGCGCC TGATATCGCC CGCCGCGCCG AGGCGGCCGG CATCCAGCTT
GTGACCATTC ATGGGCGCAC CCGCATGCAA TTCTATGAAG GCCGCGCCGA TTGGGATGCG
ATCCGCGCCG TCCGCGAGGT GATCTCCATT CCGCTGATCG CCAACGGTGA TGTCGAAACG
GCAAGCGATG CGCAGGAAAT ATTGCGCCGC TCCGGCGCCG ATGCCGTGAT GATCGGCAGG
GGCTGCCAGG GCAGGCCATG GCATGCCGGC GTCATATCGG GGGCGCCCGC ACCGCAATCC
CTGAAGATCG CCGATATCGC CGTCGAGCAT TACCGGATGA TGCTGGATTT CTACGGCGAG
GCGGTGGCGA TCCGCCATGC CCGCAAGCAC CTTGGCTGGT ATCTCCAGCG TTTCGCGCCT
GATCTGTCAG GCCCTGAAAA GGCTGAGATC ATGACCTCGC GCGACCCGCG CGAGGTGGCC
GCGCGCCTTT ACGATGCATT GGCGGCCAGT GTTGTCGACA GCCGGGAGGC GGCATGA
 
Protein sequence
MCPKDNHLIS KDLAAPFQIG PVSVRNRVVL APMSGVTDMP FRELAWRFGA GLVVTEMVAS 
RELVNDTAES WSRLSAAGFR PHMVQLAGRE AHWMAEAAKI AADHGADIID INMGCPAKKV
IGGYSGSALM RDPDHALGLI EATVKAVDIP VTLKMRLGWD ENSINAPDIA RRAEAAGIQL
VTIHGRTRMQ FYEGRADWDA IRAVREVISI PLIANGDVET ASDAQEILRR SGADAVMIGR
GCQGRPWHAG VISGAPAPQS LKIADIAVEH YRMMLDFYGE AVAIRHARKH LGWYLQRFAP
DLSGPEKAEI MTSRDPREVA ARLYDALAAS VVDSREAA