Gene Rleg2_4593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4593 
Symbol 
ID6977687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp226205 
End bp227269 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content62% 
IMG OID643393769 
Productputative sugar ABC transporter, substrate-binding protein 
Protein accessionYP_002278587 
Protein GI209546669 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCT CGACAAAAGC GGCAGGCGCC GCGGCACTTG CCCTTTCCCT GCTTGGCGGG 
ACGTCCGCCT TTGCGCAAAA CGCCGTTGCC GATGCGACGG TTGCATTCCT CATGCCCGAC
CAGGGCTCGA CCCGATACGA AGAGCATGAC CATCCGGGAT TCGTCGCGGA AATGAAGAAG
CTTTGCGCAT CGTGCAAGGT CCTCTATCAG AATGCCGATG CCGATATCGC CAAGCAGCAG
CAGCAGTTCA ACTCGGCTAT CACCCAGGGC GCCAAGGTCA TCGTGCTCGA CCCGGTGGAT
TCGGCAGCGG CCGCCTCTCT CGTTCAGCTC GCCCAGAGCC AGGGGGTCAA GGTCATCGCC
TATGACCGTC CGATCCCGAA GGGCAAGGCC GATTTCTACG TCTCCTTCGA CAACAAGGCG
ATCGGCAAGG CGATCGCGGA ATCGCTCGTC CAGCATCTGA AGGCGAAGAA CGTGCCGGCC
GATGGCGGCG GCATTCTGCA GATCAATGGT TCGCCGACCG ATGCGGCTGC CGGGTTGATC
AAGGACGGTA TTCACGAGGG GCTCGCCAGC GGCGGCTACA AGACGCTTGC CGAATTCGAC
ACGCCGAACT GGCAGCCGGC GAATGCGCAG CAATGGGCGG CCGGCCAGAT CACCCGCTTC
GGCAAACAGA TCGTCGGCGT CGTCGCCGCC AATGACGGCA CCGGTGGCGG CGCCATTGCC
GCCTTCAAGG CAGCCGGCGT CGATCCCGTA CCGCCGGTGA CCGGCAATGA CGCGACGATC
GCCGCGCTGC AGCTGATCAT ATCTGGGGAT CAGTACAACA CGATCTCCAA GCCGAGCGAA
ATCGTCGCCG CGGCTGCCGC CGACGTCGCC GTCAAGCTTT TGGCAGGAGA AACGATCAAG
GCCGAAATGA CGCTTTACGA CACGCCGGCA CAGCTCTTCG TCCCTGCCGT CGTCACCGCC
GAAAACCTCA AGGCCGAGAT CATCGACAAG AAGATCAACA CGGCGGAAGA ACTCTGCACC
GGCCGTTATG CCGACGGCTG CAAGAAGCTC GGCATCACCA AGTAA
 
Protein sequence
MKFSTKAAGA AALALSLLGG TSAFAQNAVA DATVAFLMPD QGSTRYEEHD HPGFVAEMKK 
LCASCKVLYQ NADADIAKQQ QQFNSAITQG AKVIVLDPVD SAAAASLVQL AQSQGVKVIA
YDRPIPKGKA DFYVSFDNKA IGKAIAESLV QHLKAKNVPA DGGGILQING SPTDAAAGLI
KDGIHEGLAS GGYKTLAEFD TPNWQPANAQ QWAAGQITRF GKQIVGVVAA NDGTGGGAIA
AFKAAGVDPV PPVTGNDATI AALQLIISGD QYNTISKPSE IVAAAAADVA VKLLAGETIK
AEMTLYDTPA QLFVPAVVTA ENLKAEIIDK KINTAEELCT GRYADGCKKL GITK