Gene Rleg_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3752 
Symbol 
ID8014585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3803164 
End bp3804474 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content61% 
IMG OID644826315 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002977534 
Protein GI241206438 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.561914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTGA GAACTTTTCT GCTGGGCGCC TGCTCAGCAC TGGCGTTTGC CGGTATGGCT 
TCGGCCGAAA CGCTGACAAT CGCGACCGTG AATAACGGCG ACATGATCCG GATGCAGAAG
CTGACGGATG ATTTCAAGGC GAAGAACCCC GGTATCGACC TTGAATGGGT AACCCTGGAA
GAAAACGTGC TGCGCCAGAA GGTCACGACC GATATCGCGA CCAAGGGCGG CCAGTACGAC
GTTCTGACGA TCGGCACCTA TGAAGTTCCG ATCTGGGCAA AACAGGACTG GTTGCTGCCG
CTCGACAATC TCGGCGCCAA TTACGACGTC GACGACTTGC TGCCGGCAAT TCGCAGCGGC
CTGACTGTGG ACGGCAAGCT CTATGCTTCG CCGTTCTATG GTGAAAGCTC GATGGTCATG
TACCGTAAGG ACCTGTTCGA AGCTGCCGGC CTGAAAATGC CCGACGCGCC GACCTGGGAC
TTCGTTGCCG ACGCTGCCCG CAAGATCACC AACAAGGACA AGGAAATCTA CGGCATCTGC
CTTCGCGGCA AGGCCGGCTG GGGCGAGAAC ATGGCCTTCT TGACGGCCAT GTCCAATTCC
TTCGGCGCAC GCTGGTTTGA CGAGAAGTGG AAGCCGCAGT TCGATCAGCC GGAATGGAAG
GACACGCTCG ACTTCTACGT CAAGCTGATG AAGGACGCCG GCCCTCCGGG CGCCTCCTCC
AACGGCTTCA ACGAGAACCT GGCGCTCTTC CAGACCGGTA AGTGCGGCAT GTGGATCGAT
GCAACGGTTG CCGCTTCCTT CGTCGCCGAT CCGAAGCAGT CGCAGGTCGC CGACAAGGTC
GGCTTCGCGC TCGCCCCGGA CAAGGGCCTC GGCAAGCGCG GCAACTGGCT CTGGGCCTGG
AGCCTCGCCA TCCCGGCAGG GACCCAGAAG GCCGAAGCTG CTGAGAAGTT CGTTGCCTGG
GCAACCAGCA AGGAATACAG CAACCTCGTC GCCGAGAAGG AAGGTTGGCT GAACGCACCT
CCGGGCACCC GCAAATCGCT CTATGCGAAT GCGGACTACC AGAAGGCGGC TTCGTTCGCC
AAGATGACGC TCGACTCGAT CGAGTCGGCC GATCCGACCA AGCCGACCGT CAAGCCGGTT
CCCTATGTCG GCGTCCAGTT CGTGGCGATC CCGGAATTCC AGGGCATCGG CACGGCGGTG
GGCCAGCAGT TCTCCGCAGC TCTTGCCGGC CAGCTCTCGG TCGACCAGGC CCTGCAGGCA
GCGCAGCAAC TGACCACTCG CGAAATGACC AAGGCCGGCT ACATAAAATA A
 
Protein sequence
MTLRTFLLGA CSALAFAGMA SAETLTIATV NNGDMIRMQK LTDDFKAKNP GIDLEWVTLE 
ENVLRQKVTT DIATKGGQYD VLTIGTYEVP IWAKQDWLLP LDNLGANYDV DDLLPAIRSG
LTVDGKLYAS PFYGESSMVM YRKDLFEAAG LKMPDAPTWD FVADAARKIT NKDKEIYGIC
LRGKAGWGEN MAFLTAMSNS FGARWFDEKW KPQFDQPEWK DTLDFYVKLM KDAGPPGASS
NGFNENLALF QTGKCGMWID ATVAASFVAD PKQSQVADKV GFALAPDKGL GKRGNWLWAW
SLAIPAGTQK AEAAEKFVAW ATSKEYSNLV AEKEGWLNAP PGTRKSLYAN ADYQKAASFA
KMTLDSIESA DPTKPTVKPV PYVGVQFVAI PEFQGIGTAV GQQFSAALAG QLSVDQALQA
AQQLTTREMT KAGYIK