Gene Rleg_5365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5365 
Symbol 
ID8007323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp774533 
End bp775789 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content59% 
IMG OID644822269 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002973529 
Protein GI241113694 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.559287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAAT TGATCTTGCC TCTACTTGCA TCGGCGGCCT TGGCAATCGC CGCACCCGCT 
CTGGGCCAGG ACGCCAAGCC GCTTGCCGGT CAGTCGATCA CGGTGCTCAT GCCATCGCCG
CAGGGACCGA ACATCGCCTC TGACTTCGAG GCCGAAACCG GCATCCACGT CGACCTGCAG
ACCCTGTCGT GGGACGACAT CCGCCCGAAG CTAGTGACAG CCTTGGTTGC CGGTACAGCG
CCAGCGGATG TGACCGAATT CGATTGGTCC TGGACCGGTC AGTTCAGCGC GGCCGGCTGG
TATATGCCGC TCAACGACGT GATCGACGCC GACACGTTGA AAGATATCGG CGTTGCCAAG
ATCTTCACGG TTGACGGCAA ATTATTGGGA ATTCCCTACA CCAACGACTT CCGGGTGATG
CTCGTCAACA AGAAGCATTT CGCGGATGCA GGCATCACCG AGATGCCGAA AACATTGGAC
GCCCTGGTCG CTGCAGCTAA GAAGATCAAG GAAAAGGGTA TTGTCGAGTA TCCGGTCGGC
CTGCCGGTTT CAGCGACCGA AGGCGCGTCA ACGAGCTGGT ATCTGCTGAC CAAGGCCTTC
GGTGGCGAAC TCTTCGACAA GGACTTCAAT CCGCTCTTCA CCTCGCCGGA TTCGGCTGGT
TACAAGGCGC TTGCTTTCGA GTTGATGCTC CTCAAGGAAG GGCTGGTCGA TCCGGCGTCT
ACCGGCCTGA AGGACAGCCA AATCAATGAG AGCATGTTCG CGCAGGGCAT CACCAGCATC
ATGATCTCCG GCGAGCCTGG TCGTCTTGGC CAGATGAACG ATCCCAAGCA GTCGAAAGTC
GCCGGCCAGG TAGAGGCGAT CCTCGTGCCG ACGGCAAGCG GCGAAACGCG CAGTTTCGGC
CTTCCGGAAG CGCTGGCTAT TCCAAACGTC TCTCCCAACA AGGAGGCGGC GATTGCCTTC
GTCAAATGGT TTACCAGCAA GGACTTCCAG AAGAAGAATG CCGTGAACGG CTTCCTGCCG
ACACGCACGT CGGCGCTTTC CGAACTCAAC GAGTCCGGAA AGCTGAACAG CGGCGACGCT
CTCGTGGCGC AATCGAAGAC GGTTGAGCCG CTGTTCCCGC AAGGCACGCC GCCTTGGTAC
CCGCAATTTT CCAGCGGCGT GAACACAGCG ATCAATAGCG CTGCCAAGGG ACAGATGAGC
GTCGACCAGG CGATGGAGGC CATCGCTTCC GCCGCCAAGC AGGCAATGGC GCAATGA
 
Protein sequence
MHKLILPLLA SAALAIAAPA LGQDAKPLAG QSITVLMPSP QGPNIASDFE AETGIHVDLQ 
TLSWDDIRPK LVTALVAGTA PADVTEFDWS WTGQFSAAGW YMPLNDVIDA DTLKDIGVAK
IFTVDGKLLG IPYTNDFRVM LVNKKHFADA GITEMPKTLD ALVAAAKKIK EKGIVEYPVG
LPVSATEGAS TSWYLLTKAF GGELFDKDFN PLFTSPDSAG YKALAFELML LKEGLVDPAS
TGLKDSQINE SMFAQGITSI MISGEPGRLG QMNDPKQSKV AGQVEAILVP TASGETRSFG
LPEALAIPNV SPNKEAAIAF VKWFTSKDFQ KKNAVNGFLP TRTSALSELN ESGKLNSGDA
LVAQSKTVEP LFPQGTPPWY PQFSSGVNTA INSAAKGQMS VDQAMEAIAS AAKQAMAQ