Gene Rleg_5047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5047 
Symbol 
ID8007640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp429264 
End bp431114 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content63% 
IMG OID644821962 
Producthypothetical protein 
Protein accessionYP_002973222 
Protein GI241113387 
COG category[S] Function unknown 
COG ID[COG4289] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.193591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATATG ATCCCGCCCG GGCCAATCCG CTTCTCGGCA ATCCCCTGAA GACACGCGAC 
GACCTCGCCA AGGCAGTCAC CGATCTCTTC GAGCCGCTGC TGCCGTATTT TTCCGAAGGC
GGCGCCCGTG TGCGCCTCGG CGCAGCCGGC GCGATTTTCG ATCGGGCGGC GGCGGATCTG
GAGGGATTTG CGCGGCCGCT CTGGGGGATC GTTCCGCTCG TTGCCGGCGG CGGCGCGTTT
CCGCATTGGG ACCTCTATCG CCGCGGGCTG GCAAACGGCA CCAATCCTGC TCATCCCGAA
TATTGGGGCG ATCTTGCCGA CCGCAATCAG CGGCTGGTCG AGCTCGCCGC GGTCGGCTTC
GCCCTGGCGC TCGTGCCCGA GCACATCTGG GAACCGCTCA ACGACGGCGA GAAGAAGACG
GTCGCTGCCT ATCTTCTTCG AGCGCGCGAG TTGGAATTCA TCGACAATAA CTGGAAATTC
TTCCGTGTGC TCATCGATCT CGGGTTGGAA CGCGTGGGCG TGGCGTTCGA TCACCGGAAA
ACCCTCGCCT ATCTCGAAGA ACTGGAGGCC TTCGACCTCG GAGAAGGCTG GTATCGCGAC
GGGCCGGTTC GGCGGGTCGA TCATTACATT CCCTTTGCCA TGCATTTCTA CGGAATGATC
TATGCCGTCC TGGCCAAGGG CGACGAGGCG CGCAAGGATC GCTTCCGCGA TCGCGCCGAG
ATCTTCGCCA GCGATATCCG CCACTGGTTC GGCCCGGATG GGGCAGCCCT TGCCTTCGGC
CGCAGCCAGA CCTACCGCTT CGCGGCCGGA GGTTTCTGGG GCGCGCTTGC CTTTGCCGGT
GTCGAAGCCC TGCCCTGGGC CGAGATCAAG GGCTATTACA TGCGCCATAT CCGCTGGTGG
GCGGCGATGC CGATTGCCGA TCGCGACGGC GTTCTTTCGG TCGGCTATGG CTATCCGAAC
CTCTTCATGA GCGAGAGCTA CAACTCTCCC GGCTCGCCCT ATTGGGCGCT GAAATTCTTC
CTGCCGCTCG CCCTTCCGGG GGATCATCGC TTCTGGGCGG CCGAGGAGGC GTCGCAGCCG
GAATTTCCGG AGCCGGTTGC GTTGAAGCCG GCGGGAATGG TCGCCATGCA CACGCCGGGA
AACGTGGTCG TGCTCTCCTC AGGGCAGCAG CACGACAAGA TGCTCGGTGC AAACGAGAAA
TATTCGAAAT TCGTCTATTC CACCCGCTAC GCCTTCAACG TCGAAGCCGA CGACCGGAAT
TTCTCCGCCG CAAGCTTCGA CGGCATGCTC GGCCTCTCCG ACGACGGCGT CCATTTCCGC
ATGCGCGAAA CCCTCGAAGA GGCGTTGATC GCAGGCGACC TGCTCTATTC GCGCTGGCGC
CCCTGGAGCG ATGTCACTAT CGAAACCTGG CTGCTTCCTG AAAATCCGTG GCACATCCGC
ATTCACCGCA TCGCCACGCC ACGCACACTC AGCACCATCG AGGGCGGTTT TGCGATCGAG
CGCGCGGATT TCAATGCCGA CCGCTCCGAT GCAAGGGATG GCCGGGCTGT CTGGTACGGG
CAGACCGACG TCAGCGCCAT CGTCGATCTA TCGCCCAATC CAAGGGCCGG CCATGCGATG
AGCCCGATCC CGAACACCAA TCTCATCCAC GCCAAGACCC TACTGCCGCA GCTGCGCGGC
AACATCGGCG CAGGCACCAT CGTGCTGGTG ACCGCCGCGA TGGCCCTGCC CAGCCGTGAG
AACTGGGCAA AAGCGCTCGA TAATCCGCCA GCCCGTCCCC GCCTCGACGA GGTAGAGCGG
CTCTTCCGCG AGAAAGGCGT ACAGGTGCCG GCATTCGCCC TCGGGATGTA G
 
Protein sequence
MIYDPARANP LLGNPLKTRD DLAKAVTDLF EPLLPYFSEG GARVRLGAAG AIFDRAAADL 
EGFARPLWGI VPLVAGGGAF PHWDLYRRGL ANGTNPAHPE YWGDLADRNQ RLVELAAVGF
ALALVPEHIW EPLNDGEKKT VAAYLLRARE LEFIDNNWKF FRVLIDLGLE RVGVAFDHRK
TLAYLEELEA FDLGEGWYRD GPVRRVDHYI PFAMHFYGMI YAVLAKGDEA RKDRFRDRAE
IFASDIRHWF GPDGAALAFG RSQTYRFAAG GFWGALAFAG VEALPWAEIK GYYMRHIRWW
AAMPIADRDG VLSVGYGYPN LFMSESYNSP GSPYWALKFF LPLALPGDHR FWAAEEASQP
EFPEPVALKP AGMVAMHTPG NVVVLSSGQQ HDKMLGANEK YSKFVYSTRY AFNVEADDRN
FSAASFDGML GLSDDGVHFR MRETLEEALI AGDLLYSRWR PWSDVTIETW LLPENPWHIR
IHRIATPRTL STIEGGFAIE RADFNADRSD ARDGRAVWYG QTDVSAIVDL SPNPRAGHAM
SPIPNTNLIH AKTLLPQLRG NIGAGTIVLV TAAMALPSRE NWAKALDNPP ARPRLDEVER
LFREKGVQVP AFALGM