Gene Rleg2_4978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4978 
Symbol 
ID6978072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp620839 
End bp622152 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content56% 
IMG OID643394124 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002278942 
Protein GI209547024 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.346425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCT TTGCTGCATT GCTGCTGGGA GCGACGGCGC TCGTCGGTAC TTCGGCCTTG 
GCCCAAACCA CGCTGACGAT TGCGACAGTC AATAACAACG ACATGATTGT GATGCAGAAG
CTGTCGAAGG ACTTTGAAGA GAAGAATCCT GACATCAAGC TCAACTGGGT CACCCTGGAA
GAGAACGTCC TTCGTCAGAA GATCACGACC GACATCGCCA CCCAGGGCGG TCAGTACGAC
ATCATGACGA TTGGCATGTT CGAGACCCCG CTGTTCGGCG AAAAGGGTTG GCTGTCCGAA
TTCAAGGACG TTCCGGCCGA CTACAAGCTC GACGACGTTC TGAAGTCGGT CCGTGACGGC
TTGTCCTTCG ATGGAAAGCT CTATGCTCTG CCTTTCTATG CTGAAAGCCA GATGACCTTC
TACCGCAAGG ACCTGTTCGA CAAGGCTGGG ATAACGATGC CTGATCAGCC GACCTGGGAA
CAGATCGGCC AGTTCGCAGA GAAGATCACC GACAAGGACA AGGAAATCTA TGGTGTGTGC
CTGCGTGGCA AGCCGGGCTG GGGAGAAAAT ATGGGTCAGA TCGGCCCAGT CGTAAACAGC
TACGGTGGCC GCTGGTTCGA TATGGACTGG AAGCCGCAGC TGACGACCGA GCCTTGGAAG
GAAGGCGTCA CTACCTACGT CGATCTCCTA AAGAAGTATG GCCCTCCCGG CGCATCGTCC
AACGGCTTCA ACGAGACCCT GTCGCTTTTC GCCAGCGGCA AATGCGGCAT GTGGGTTGAC
GCAACCGTCG CCGCGGGCTT CCTGACCGAC AAGAAGCAGA GCCAGGTTGC TGACAAGATG
GGTTACGCCC ATCCGCCGAT TGGCAAGTTC GATAAGGGAA ACCATTATCT GTGGTCCTGG
GCACTGGCAG TTCCGGTTTC GTCGAACGAA CCTGACGCAG CGAAAAAGTT CATCTACTGG
GCGACCTCGC AGGACTATAT CAAGCTGGTA GCCAAGGAAA ATGGATGGGC GGCGGTACCT
CCCGGCACCC GCACCTCCAC ATATGACACA CCGGAATACA TCAGCGCTGC TCCGTTCGCA
AAGTTGACCC TGGAGACGAT CCAGACGGCA AACCCGACCG ACGCTACGCA GGAGAAGGTT
CCGTACCGTG GCATTTCCTA CGTTGGTATT CCGGAGTTCC AGAGCTTCGG TACTGCCGTC
GGTCAGAAGA TGTCCGCTGT TATTGCTGGC CAGAGCACCG TCGATGAAGC GCTGAACGAA
TCTCAGAAGC TTGTCGAGCG CACGATGAAG CAGGCCGGTT ACCCGAAGAA ATAA
 
Protein sequence
MKTFAALLLG ATALVGTSAL AQTTLTIATV NNNDMIVMQK LSKDFEEKNP DIKLNWVTLE 
ENVLRQKITT DIATQGGQYD IMTIGMFETP LFGEKGWLSE FKDVPADYKL DDVLKSVRDG
LSFDGKLYAL PFYAESQMTF YRKDLFDKAG ITMPDQPTWE QIGQFAEKIT DKDKEIYGVC
LRGKPGWGEN MGQIGPVVNS YGGRWFDMDW KPQLTTEPWK EGVTTYVDLL KKYGPPGASS
NGFNETLSLF ASGKCGMWVD ATVAAGFLTD KKQSQVADKM GYAHPPIGKF DKGNHYLWSW
ALAVPVSSNE PDAAKKFIYW ATSQDYIKLV AKENGWAAVP PGTRTSTYDT PEYISAAPFA
KLTLETIQTA NPTDATQEKV PYRGISYVGI PEFQSFGTAV GQKMSAVIAG QSTVDEALNE
SQKLVERTMK QAGYPKK