Gene Rleg_5204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5204 
Symbol 
ID8007099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp615295 
End bp616617 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content56% 
IMG OID644822113 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002973373 
Protein GI241113538 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.306263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGA TCGCTTTTTT CCTGTCCGCA GGATTGACCA GCCTATCCTT ATCCGTTCCG 
GCAATGGCGG CCACGAAGAT CCAGTGGTGG CACGCAATGG GCGGCGAGAA TGGCGCGAAG
CTTGAACAGA TCGCCAAGGG GTTCAACGCA TCTCAATCTG ACTATGAGAT CGTCCCGGTA
TTCAAAGGCA CCTATGACGA GACGCTGACG GGCGCCATTG CTGCGTTCCG CGCCAACCAG
CAGCCGGCAA TCGTGCAGGT CTATGAAGTC GGTACCGGCA CGATGATGGC AGCACAGGGC
GCGATCTATC CCGTCTACCA GTTGATGAAG GACCAAGGGG AGGCCTGGGA CCAGAGCAAA
TTCATTGCTC CGGTCGTCGG TTACTACTCA GACACCAGCG GCAACGTTCT GTCGTTGCCG
TTCAATTCCT CGACGCCGAT CATGTATTAC AACAAGGATG TCTTCAAAAA GGCGGGGCTT
GATCCGGAAA CACCGCCGAA AACATGGGCG GACGTTGAAG CCTTTTCGCG GACGATCATG
AAGTCCGGCG CTGCGAAGTG TGGCTTTACC AGCGCCTGGA TCTCCTGGAT CCAGACTGAA
AACCTCAATG CTTTGCACGA CAAGCCCTAC TCCACCAAGG CCAACGGCTT TGGCGGCTTG
GATGCGGAGT TCACCTTCAA CAACGATCTC ACGATCCGCC ATTGGGGCAA CTTGAAGAAG
TGGCAGGACG AGGGGCTCTT CAAATTCGGC GGGCCTGGCG GCGGCGATAA TGCTCCTCCG
ATGTTCTATT CGCAGGAATG CGCGATGTAC ATGAACTCGT CGGCCGGCCG GGCAGGCGTT
ATCAATAACG CAAAGGCTTT CAAGGTCGGG TTTGCGCCGC TTCCCTACTA TGACGACGTC
ATTACGCAGC CGCTCAACTC GATTATTGGC GGCGCCACGC TCTGGACACT GAAAGGTCGC
CCAGAAGAGG AATACAAGGG TGTCGCGAAG TTCTTCACCT ACCTGCAGAA GCCGGAAGTG
CAAGCCGATT GGCATCAGTT CTCCGGCTAC CTGCCGATAA CCGAGGCTGC CTATAAGCTG
GGCCAGGATC AGGGCTATTA CGAGAAGAAT CCTGGAGCAG ATATCGGCAT CAAGCAGCTG
ACGCGGGTGA CACCCACCGA TAATTCCAAG GGTATCCGGT TCGGCAACTA CGTCCAGGTG
CGTGGCATCA TCGACGATGA GTTTGCAGCA TTGCTGGGCG GGAAGAAGAC GGCGAAGGAA
GCGGTTGATT CCGTGGTCGC ACGAGGCAAC GAACAGCTTC GCGATTTCCA GTCCGCCAAC
TAA
 
Protein sequence
MNKIAFFLSA GLTSLSLSVP AMAATKIQWW HAMGGENGAK LEQIAKGFNA SQSDYEIVPV 
FKGTYDETLT GAIAAFRANQ QPAIVQVYEV GTGTMMAAQG AIYPVYQLMK DQGEAWDQSK
FIAPVVGYYS DTSGNVLSLP FNSSTPIMYY NKDVFKKAGL DPETPPKTWA DVEAFSRTIM
KSGAAKCGFT SAWISWIQTE NLNALHDKPY STKANGFGGL DAEFTFNNDL TIRHWGNLKK
WQDEGLFKFG GPGGGDNAPP MFYSQECAMY MNSSAGRAGV INNAKAFKVG FAPLPYYDDV
ITQPLNSIIG GATLWTLKGR PEEEYKGVAK FFTYLQKPEV QADWHQFSGY LPITEAAYKL
GQDQGYYEKN PGADIGIKQL TRVTPTDNSK GIRFGNYVQV RGIIDDEFAA LLGGKKTAKE
AVDSVVARGN EQLRDFQSAN