Gene Rleg_6017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6017 
Symbol 
ID8016282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp45725 
End bp46795 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content59% 
IMG OID644827328 
Producthypothetical protein 
Protein accessionYP_002978528 
Protein GI241258644 
COG category[R] General function prediction only 
COG ID[COG5621] Predicted secreted hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGTA GAAAGTCGGC ATCTGTCTTG ATAGCGGCGG CTATGATCTG CGCGAGCGGT 
CAGGCATTCG CGCAAGGCTT CGCCGGATTG GGATCGGATG CGCAAGGTTT TGCGATCCCG
GAGCGGGGTT CTGTTCTTTC TTTCCCCGCC GATCATGGCG CTCATCCTGA TTATCGCATT
GAGTGGTGGT ATGTGACTGC CAATCTCAAA GACGAGGATG GCAGGCAATA TGGAGCGCAG
TGGACGCTGT TTCGCTCTGC GCTGGCTCCG GGAGACAAGG CAGGTTTCGC GGATCCGCAG
ATCTGGGCTG GGCACGCGGC GATCACCACC CAAGGTCATC AGTACGTCAC TGAGCGCTTG
GGGCGGGGAG GCGTCGGGCA GGCAGGCGTT GCCGCAAGAC CATTTCGGGC TTGGATAGAC
GACTGGCGGT TGGAGGGTAG CGAGCGAACC GGCTCGGACG CCTTTGGCAA CCTTTCAGTC
TCTGCGGGCG GGCTGGACTT CAGCTACACC TTAGACCTCA AGGCCGACGG CCCGCTCGTT
CTTCAGGGGG AAAACGGCTT TTCCGTGAAG TCGGCGAACG GGCAGGCGAG CTACTATTAC
TCGCAGCCTT TCTATGAGGT GGCGGGAACG ATCACGACAT CCGGAGCACC GGTTAAGGTC
ACTGGCAAAG CCTGGCTGGA TCGGGAGTGG TCGTCGCAGC CGCTTGCGTC CAATCAGACG
GGGTGGGATT GGTTCTCACT GCATCTGAAT TCCGGCGACA AGCTGATGGC TTTTCGCCTT
CGTGATGACA AGGACGGGTT TATCTCCGCG AACTGGATAT CGGCGGATGG ACGAACGACA
CCTTTGTCGA AAGACGACGT CCAACTGGAG CCGACGCGGA AGGCAACGGT CGATGGGCGC
CGGATGCCGG TTGAGTGGCG CATACGCGTG CCGAGTAAGT CACTTGATAT TACGACGAAA
CCGCTGAACG AGCAGTCCTG GATGGCGACC TCTACGCCTT ATTGGGAGGG GCCGATCAAC
TTCACAGGCT CCACGTCAGG TGTTGGATAT CTTGAAATGA CCGGCTATTA G
 
Protein sequence
MNGRKSASVL IAAAMICASG QAFAQGFAGL GSDAQGFAIP ERGSVLSFPA DHGAHPDYRI 
EWWYVTANLK DEDGRQYGAQ WTLFRSALAP GDKAGFADPQ IWAGHAAITT QGHQYVTERL
GRGGVGQAGV AARPFRAWID DWRLEGSERT GSDAFGNLSV SAGGLDFSYT LDLKADGPLV
LQGENGFSVK SANGQASYYY SQPFYEVAGT ITTSGAPVKV TGKAWLDREW SSQPLASNQT
GWDWFSLHLN SGDKLMAFRL RDDKDGFISA NWISADGRTT PLSKDDVQLE PTRKATVDGR
RMPVEWRIRV PSKSLDITTK PLNEQSWMAT STPYWEGPIN FTGSTSGVGY LEMTGY