Gene Rleg_5080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5080 
Symbol 
ID8007673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp466887 
End bp467993 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content63% 
IMG OID644821995 
ProductABC transporter related 
Protein accessionYP_002973255 
Protein GI241113420 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.199927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAC TGAAACTTTC CAACGTCAAC AAATCGTATG GCTCGGTCAA AGTCCTGCAT 
GACGTCGAAC TCGATATCAC GGACGGCGAG TTCGTCGTCT TCGTCGGGCC GTCGGGATGT
GGCAAGTCGA CTCTGCTGCG TGTCATTGCC GGCCTCGAAG AGGTGACGGA GGGCGCAATC
GCGATCGGGG GCCGCGATGT CAGCGCGCTC TCGCCGGCCG AGCGCAAGAT CGCAATGGTC
TTCCAGTCCT ACGCGCTCTA TCCGCATATG AGCGTTCGCA AAAACCTCGC TTTCGGCCTG
GAGAACCTGA AGTTCAAGCG TGCCGAGATC GAGGCGCGGA TTGCCGAGGC CGCCAGGATG
CTGGCGATCG AGCCCTACCT GGACCGACGC CCGAAGCAGC TTTCGGGCGG CCAGCGCCAG
CGCGTGGCGA TCGGCCGGGC TATCGTGCGC GAACCGGACA TTTTTCTCTT CGACGAGCCG
CTGTCGAATC TCGACGCGGC GCTGCGCGTT CAGACCCGCG CCGAGATCAC CAAGCTCCAC
CGCGAGATCA AGACGACGAT GATTTATGTC ACGCATGACC AGGTCGAGGC GATGACGATG
GCCGACAAGA TCGTCGTGCT GCGCGCCGGG CGGGTCGAGC AGGTCGGCGC GCCGCTGGAC
CTGTTCGACA GCCCACGCAA TCTCTTCGTC GCCGGCTTCC TCGGCTCGCC GCGCATGAAC
ATCATCAAGG GCAAGGTCGC TGGCATCGAG GAAGGCGGCG TCGTCATCGA TGTCGGCAAT
GGTGGCAAGG TCGTCAGCGA TGTCGATCCC GCCGGAGTTG CGGTCGGACA GGCTGTTCTC
GCCGGCATCC GGCCCGCGCA TTTTTCACGC TCCAGCGAGC AGGGCCTGCC GTTCATCGTC
CAGTATCACG AGGGCCTCGG TACGGAGACC TATGTCTATG GCAATCTTGC AGGCCATGAC
GAGCAGATCA TCATTCACGA GGCCGGCCAT TTCGCGCCGG CGCCTGGTGA TCGCATCCTG
ATCGATGCCG CCCCGGGGCG GGTTCATCTG TTCGATCCCG AAAGCGGCCT GGCTTTTGCC
CGGCGGCCCG GACAGGGGAG GCGCTGA
 
Protein sequence
MAELKLSNVN KSYGSVKVLH DVELDITDGE FVVFVGPSGC GKSTLLRVIA GLEEVTEGAI 
AIGGRDVSAL SPAERKIAMV FQSYALYPHM SVRKNLAFGL ENLKFKRAEI EARIAEAARM
LAIEPYLDRR PKQLSGGQRQ RVAIGRAIVR EPDIFLFDEP LSNLDAALRV QTRAEITKLH
REIKTTMIYV THDQVEAMTM ADKIVVLRAG RVEQVGAPLD LFDSPRNLFV AGFLGSPRMN
IIKGKVAGIE EGGVVIDVGN GGKVVSDVDP AGVAVGQAVL AGIRPAHFSR SSEQGLPFIV
QYHEGLGTET YVYGNLAGHD EQIIIHEAGH FAPAPGDRIL IDAAPGRVHL FDPESGLAFA
RRPGQGRR