Gene Rleg_5717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5717 
Symbol 
ID8016680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp298869 
End bp300149 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID644827866 
Producthypothetical protein 
Protein accessionYP_002979066 
Protein GI241518438 
COG category[S] Function unknown 
COG ID[COG4949] Uncharacterized membrane-anchored protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0570912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.514525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCAA AGGGCGCAGA GGAAGCATTC TTCCCAACGG GGTCTGCTTC CGCACCACAG 
GTGTCTCGTC CAGCTGCTTT TGAACAGAAG CCCGCAGACT TCAACTCGGA GCTTCACGCT
CGACCGTCGA TTTATTTCAC CGGTCCGGCG ATCGTCGAAC ACTTTGCTTT CATGCCGTCG
GATGGCGTGA TCAAGGAGTT CCACGATAGT CTCCAAGCTG ATGGTGGAAT TTCCGTCAGA
GTGGAACGGC ACACTGAGTT CGTGACTGTC ACGCGGGTCC GAAAATTGGC CAGCGAGCCC
GAGGATTGGC CGGAAACTGA CCTTTGTGAA GGTGATTTTG CGCGGCTAGC AGGGTTGAGC
TCTCCTCTCC TCGTTTGCCA CGTGAGTATC CTTGTCCTCG GAAACCCTCC GGACCAGCTG
GGAACGGTTC TAAAATCCCT CGACTTCGGC GACACCGCCG CGTCATCAAT CGGCGGCGGG
GCGGCGCAAG TTTGCTCCGA TTTTCGCGTT CGAGGGGACA ATTCAAGCAG GATCATCCTG
TTCAACAAGG ACCTGAATGC ACATCGGCTG GGGCGCATGG TACGGCGGAT CTTTGAGATC
GAAACCTATA GGTCAATGGC GCTGCTCGGA TTGCCGGAGG CGCGTCGTCT TGCCCCGCTT
CTGGGCGGAT ATGACGCGGA GCTCGTTCGG CTGACCAATC GAAACTTGAG TACGCCTGCA
CATCAGCACA AACAATTGCT CGAGGAAATT ACTGTTCTCT CCTCGCATAT CATTTCAGCC
ACCGCGGAAA CCAGAAACAG GTTTGGCGCA ACCGCTGCCT ATGCCAAAAT CGTTGAAGAA
AGGATCGCCC TTTTACGGGA AACCCATGTC CCCGGCTTTC AACGTTTCGG TACCTTCGTG
GAGCGCCGGT TCAAGCCTGC GGTGCGTACC TGCGAAGCAA CCGCGTTGAG GCTTGAGCAC
CTATCAAGGG CTGCGATGCA CCTGCTCGAC CTGCTACAAA CCCGAATCCA GGTCGAGATT
GAGTTCCAGA ACTCTACACA GATCCAGGCG ATGGCTGATC GGGCCGCGAC GCAGGTCAAG
ATCCAGCGCG CGGTCGAAGG CTTTTCGATG ATCGCGATTA GCTACTACTT GCTGAGCTTG
CTGAAATTTA TATTTGAGAC AGCAGACCAC GCAGGATTCC ATTTCGATCC GATGATCATG
CTCGTCGCTG TTCCGGTGGT TGTAGGATCT GTTGTGATTA CCATCCTCCG CGTCAAGCAT
GCCTTAAAGG CAGAGAGCTA G
 
Protein sequence
MNSKGAEEAF FPTGSASAPQ VSRPAAFEQK PADFNSELHA RPSIYFTGPA IVEHFAFMPS 
DGVIKEFHDS LQADGGISVR VERHTEFVTV TRVRKLASEP EDWPETDLCE GDFARLAGLS
SPLLVCHVSI LVLGNPPDQL GTVLKSLDFG DTAASSIGGG AAQVCSDFRV RGDNSSRIIL
FNKDLNAHRL GRMVRRIFEI ETYRSMALLG LPEARRLAPL LGGYDAELVR LTNRNLSTPA
HQHKQLLEEI TVLSSHIISA TAETRNRFGA TAAYAKIVEE RIALLRETHV PGFQRFGTFV
ERRFKPAVRT CEATALRLEH LSRAAMHLLD LLQTRIQVEI EFQNSTQIQA MADRAATQVK
IQRAVEGFSM IAISYYLLSL LKFIFETADH AGFHFDPMIM LVAVPVVVGS VVITILRVKH
ALKAES