Gene Rleg2_5333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5333 
Symbol 
ID6978427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp958100 
End bp959209 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content62% 
IMG OID643394435 
ProductABC transporter related 
Protein accessionYP_002279253 
Protein GI209547335 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA TTTCGCTTAA AGAGCTGAAC AAATCCTACG GCGCGCTCAC CGTCGTCCAC 
GATATCGATC TTGAGATCGC CGATAAGGAA TTCATCATCC TGGTCGGCCC CTCCGGCTGC
GGCAAATCGA CGACGCTCAG GATGATCGCC GGCCTCGAGG AGATCTCCGG AGGAGAACTC
AAGATCGGCG GCGACGTCAT GAACGACGTC CCCTCCAAGG ACCGGGATAT TGCCATGGTC
TTTCAGAACT ATGCGCTCTA CCCACATATG ACCGTCTACA AGAACATGGC CTTCGGCCTG
CAGCTCAGGA AGGTGTCGCG CGACTTCATC GATGCCCAGG TGCAGGACGC CGCCAAGATC
CTCGACATCA CCCATCTCCT GAACCGCAAG CCGAAGGCGC TTTCGGGCGG TCAGCGTCAG
CGGGTGGCGC TCGGCCGCGC CATGGTGCGC AATCCGGCCG TCTTCCTCCT CGACGAGCCG
CTTTCCAACC TCGACGCCAA GCTGCGCGGC ACAATGCGCT CCGAAATCAC CAAGCTGCAC
AAGCGCCTCA ACGCCACCTT CATCTATGTC ACCCACGACC AGGTGGAGGC CATGACCATG
GCCGACCGGA TCGTCGTCAT GAAGGATGGC CACATCCAGC AGGTCGACAC GCCGCAGAAC
CTCTATGACC GTCCCGTCAA CATGTTCGTC GCCGGCTTCA TCGGCGCACC GCAGATGAAC
ATGCTGCCCT CGACCATTCA GCGCCGCGGC GATGGCTATG TCGCCGTCTT CGACGGCCGG
GAACTGCCGC TGCCCGATCA TTTCGACAAG AGCAGGATCG CACCCTATGA GGGCCGCGAA
CTGGTGCTCG GGCTTCGTCC GGAGAATTTC CACGAACTGC CGCCGGCCGA TATCCCGGCC
GAGAACCTGG CGCCCCTCAA GGCAGTGGTC GAACTTGCCG AACCGATGGG CTCGGAGGTG
CATCTGAACA TGGTGGCCGG CGGACGCAAT CTCATCGCCC GTGTGTCGCC GCGCTTCCGG
CCAGCAATCG GCGACGAGGC GACGCTCACC GCCGATATGA GCAACGCGCA GCTGTTCGAC
AAGGAAACGG AACGCTCGAT TCTTTACTGA
 
Protein sequence
MASISLKELN KSYGALTVVH DIDLEIADKE FIILVGPSGC GKSTTLRMIA GLEEISGGEL 
KIGGDVMNDV PSKDRDIAMV FQNYALYPHM TVYKNMAFGL QLRKVSRDFI DAQVQDAAKI
LDITHLLNRK PKALSGGQRQ RVALGRAMVR NPAVFLLDEP LSNLDAKLRG TMRSEITKLH
KRLNATFIYV THDQVEAMTM ADRIVVMKDG HIQQVDTPQN LYDRPVNMFV AGFIGAPQMN
MLPSTIQRRG DGYVAVFDGR ELPLPDHFDK SRIAPYEGRE LVLGLRPENF HELPPADIPA
ENLAPLKAVV ELAEPMGSEV HLNMVAGGRN LIARVSPRFR PAIGDEATLT ADMSNAQLFD
KETERSILY