Gene Rleg_4940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4940 
Symbol 
ID8007533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp318427 
End bp319881 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content56% 
IMG OID644821857 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002973117 
Protein GI241113282 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCG ATATCGGAAA TATGTCGCGA CGTGACCTGC TGAAAAGCGC TTCCGTCGCA 
GCTCTTGTGG CTGGCGCGGG ATCACTGGCC ATTCCACGGC GAGGAGCCGC GCAGGATGCG
AACACGGTTC GCGTCCTGTC CGTTGAAGAC CCATTCTTCT TCTCAATGAA GGAGTTGGTG
CCGGAATACG AGAAAGAAAC CGGTATCAAG GTCGAACTGG AGAGTCTCTC CTATGACGCC
CTTCAATCGC GCCTCGTGTC TGCCTTTGTC GCCAAGACCT CAGACGCCGA CGTCATCGTT
GTCGATCAGA TGTGGCTCGG GCAGTACCTC GACAATGGCT GGATCATCTC GCTGAACGAT
TTTATTGCCA AGGACAGCGA ATTCGACCTC TCAGACTTCA TCCCGGAGGT CCTTTACTCC
TCGAACATGT GGCGCGGCCA GATCGGGACA TTGCCGGTCG CCGCCTATGC CCAAGGGGTT
ATGTACCGCA AGGACGTGTT TGACAAACTC GCCATTGAAG CGCCGCCGAC CAAGACCTCG
GAAGACTGGA CCTGGACCAA ATATGTCGAC ACACTGAAGT TGATGGAAGG CAAGTCATTT
GGCGGCAAAC CGTTGTTTCC CACGGTTGTC TGCGGCTCCC AACCGTCGCC GATCGTCCAC
ATGTTTACGC AGGTGTCGGC AAGCCACGGT GCCAACTGGT TCAAATCATT CCCTGCCGAC
CCGTGGGATT TCTCTCCGCA GTTGACAAGC CCCGCCTGGG TCAAATCTGT TGAAGTCTAT
AGGCAGCTCT ACAAGCTGTC TCCGCCTGAA GCGATCAACT ATGTCTGGTT CGACGCCGGC
ACCCGTTTTG CCAAAGGTGA CATCGGGATG TTCTACTGGT GGACGCCGTA CTTCTACCTG
ATCAAAAACT CGGGCTACAT GACCGGCAAG AAGTCGGACG TGATGGACAA GTACGCGACG
GCGGCCTTGC CAAAGGCTGA GGGCGTGCCT CAGACGGTCA GTCTCGGCGG ATGGAGCCTT
GGCATCCCAT CCAGTTCCGA AAGGCAAGAA GCAGGCTACG CCTTCATCAA ATGGGCGACC
TCGAAAACCA CGCAGAAGAA AATGGCTCTT TGGCCGGACC TTAACTACCA ATTCTCCGAC
TTTGCGCGCG TTTCACTCTA CGAAGACGAG GAAGTCAAAG CGCTCTACCC GTACCTCGAT
GTGCAGTATG CGATGATGAA GCAGGGTAAC GGCAAGGTCA CACGCCCGCC TGTACCTGGT
TACACGGCCA TTGAAAGCGT GCTGGGCCTC ACATTGAACC AGCTTTTGAC CGGTAGCGAA
GAGCCGAAGA CCGGCCTTGA ACGTGCCAAC AGCCTGTTCG AGAGCATCCT GAAGGGTAAT
CTCATGATCC CTTATCAAAA AGCCAGCTAC GCAGACACTC TTGACGGGGC CAAAGCCCAG
ATCGCCAAGA GGTAA
 
Protein sequence
MTIDIGNMSR RDLLKSASVA ALVAGAGSLA IPRRGAAQDA NTVRVLSVED PFFFSMKELV 
PEYEKETGIK VELESLSYDA LQSRLVSAFV AKTSDADVIV VDQMWLGQYL DNGWIISLND
FIAKDSEFDL SDFIPEVLYS SNMWRGQIGT LPVAAYAQGV MYRKDVFDKL AIEAPPTKTS
EDWTWTKYVD TLKLMEGKSF GGKPLFPTVV CGSQPSPIVH MFTQVSASHG ANWFKSFPAD
PWDFSPQLTS PAWVKSVEVY RQLYKLSPPE AINYVWFDAG TRFAKGDIGM FYWWTPYFYL
IKNSGYMTGK KSDVMDKYAT AALPKAEGVP QTVSLGGWSL GIPSSSERQE AGYAFIKWAT
SKTTQKKMAL WPDLNYQFSD FARVSLYEDE EVKALYPYLD VQYAMMKQGN GKVTRPPVPG
YTAIESVLGL TLNQLLTGSE EPKTGLERAN SLFESILKGN LMIPYQKASY ADTLDGAKAQ
IAKR