Gene Rleg_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1888 
Symbol 
ID8012938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1875234 
End bp1876313 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content63% 
IMG OID644824477 
ProductABC transporter related 
Protein accessionYP_002975709 
Protein GI241204613 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.2512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGC TCGAGCTCAG GAACATCGTC AAGAATTTCG GCGCCGTCGA GGTCATTCGC 
GATGTCTCGC TTCATGTCAA TGACGGCGAG TTCGTCGCTT TCGTCGGCCC TTCCGGTTGC
GGGAAATCGA CGCTCTTGCG CCTGATTGCC GGCCTCGATA AGCCGACTGA CGGCAGCATC
GCCATCGACG GCAAGGATGT TACCGCTATC AGCGCTGCCG ATCGCGGCCT GGCCATGGTC
TTCCAGTCCT ATGCGCTCTA TCCGCATATG AGTGTCAGGG AGAACCTCGC CTTTGGTCTC
GAGAACACCA AGGTGGCGAA AGCCGAGATC GAAGCGCGCA TTACCGACGC CGCGCGCATG
CTGGAGATCG AGCCTTTCCT GCAACGCCGT CCGGGCCAAC TCTCCGGCGG CCAGCGCCAG
CGCGTCGCCA TCGGCCGCGC CATCGTGCGG CGGCCGGATG CCTTTCTGCT CGACGAGCCG
CTATCCAATC TCGACGCCGA ACTCAGGGTC AGCATGCGGG CCGAACTGGC GGCCCTTCAC
GCCCGCCTGA AGGCGACGAT GATCTACGTC ACCCACGATC AGGTCGAGGC AATGACACTG
GCCGACCGCA TCGTCGTGCT GAGAGGCGGC AGGATCGAGC AGGTGGGAAC ACCGCTGGAA
CTCTACAACA AGCCGGCCAA CCGCTTCGTC GCCGGCTTCA TCGGCGCGCC GCACATGAAT
TTCCTCGAAG GTGCGATTGT CGGTCACGAG GGCGGTTTCG CTGAAGTCGA AACCGTCGGC
GGCCATCGCC TTTCCGTCAT TGCCAAGGAG GCCCCCCCGG CGGGCGAAAG GGTCAGCATC
GGCATTCGGC CGCAGCATAT CACCCTCGCC GAAGCGGGCT CAGCGGGCAG ACTGGATACA
AGCGTTACCC TTGTCGAGGA ATTGGGCTCG GAGACTGTCG TCCACGCCGA CGCAGGCGGG
AAGAAGCTGA TTGCGGTTTT TGCCGGCCAG CAGCGGATGA AATCGGGTGA CAGCCTGCCG
CTGCATCTCG ACCCCGATGT GCTGCACCTC TTCGGCGAGG ACGGCAGGCG CTTGTCCTAA
 
Protein sequence
MSGLELRNIV KNFGAVEVIR DVSLHVNDGE FVAFVGPSGC GKSTLLRLIA GLDKPTDGSI 
AIDGKDVTAI SAADRGLAMV FQSYALYPHM SVRENLAFGL ENTKVAKAEI EARITDAARM
LEIEPFLQRR PGQLSGGQRQ RVAIGRAIVR RPDAFLLDEP LSNLDAELRV SMRAELAALH
ARLKATMIYV THDQVEAMTL ADRIVVLRGG RIEQVGTPLE LYNKPANRFV AGFIGAPHMN
FLEGAIVGHE GGFAEVETVG GHRLSVIAKE APPAGERVSI GIRPQHITLA EAGSAGRLDT
SVTLVEELGS ETVVHADAGG KKLIAVFAGQ QRMKSGDSLP LHLDPDVLHL FGEDGRRLS