Gene Rleg2_3450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3450 
Symbol 
ID6982204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3565387 
End bp3566697 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content61% 
IMG OID643398168 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002282943 
Protein GI209551026 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTGA GAACTTTTCT GCTGGGCGCC TGCTCAGCAC TGGCGTTTGC CGGCATGGCT 
TCGGCTGAGA CGCTGACAAT CGCAACCGTC AACAACGGCG ACATGATCCG GATGCAAAAG
CTGACGGATG ATTTCAAGGC GAAGAATCCC GGCATCGACC TTGAATGGGT CACCCTCGAA
GAGAACGTGC TGCGCCAGAA GGTCACGACC GACATCGCGA CCAAGGGCGG CCAGTACGAC
GTTTTGACGA TCGGCACTTA CGAAGTTCCG ATCTGGGCAA AGCAGGGTTG GCTGCTGCCG
CTCGACAATC TCGGCGCCAA TTATGACGTC GACGACCTGC TGCCGGCGAT CCGCAGTGGC
CTGACCGTGG ACGGCAAGCT CTATGCTGCG CCGTTCTACG GCGAAAGCTC GATGGTCATG
TATCGCAAGG ACCTGTTTGA CGCCGCCGGC CTGAAGATGC CCGACGCGCC GACCTGGGAT
TTCGTTGCCG ACGCTGCCCG CAAGATCACT AACAAGGACA AGGAAATCTA CGGCATCTGC
CTGCGTGGGA AGGCCGGCTG GGGCGAGAAC ATGGCCTTCC TGACGGCCAT GTCCAACTCC
TTCGGCGCTC GCTGGTTCGA TGAAAAGTGG AAGCCGCAGT TCGATCAGCC GGAGTGGAAG
GACACGCTCG ACTTCTACGT CAAGCTGATG AAGGATGCCG GCCCTCCGGG CGCTTCCTCC
AACGGCTTCA ACGAGAACCT GGCGCTGTTC CAGACCGGCA AGTGCGGCAT GTGGATCGAC
GCAACGGTTG CCGCTTCCTT CGTCGCCGAT CCGAAGCAGT CGCAGGTCGC CGACAAGGTC
GGCTTTGCGC TCGCCCCGGA CAAGGGCCTC GGCAAGCGCG GCAACTGGCT CTGGGCCTGG
AGCCTCGCCG TCCCGGCAGG TACGCAGAAG GCGGAAGCTG CCGAGAAGTT CGTCGCCTGG
GCGACGAGCA AGGAATACAG CAATCTCGTC GCTGAGAAGG AAGGCTGGCT GAACGCACCT
CCGGGCACCC GCAAGTCGCT CTATGCGAAT GCGGACTACC AGAAGGCGGC GTCGTTTGCC
AAGATGACGC TCGACTCGAT CGAGGCGGCC GATCCGACCA AGCCGACCGT CAAGCCGGTT
CCTTATGTCG GCGTCCAGTT CGTGGCGATC CCGGAATTCC AGGGTATCGG TACGGCGGTG
GGTCAGCAGT TCTCGGCAGC CCTTGCCGGC CAGATTTCGG TCGACCAGGC GTTGAAGAGC
GCACAGCAGC TGGCGACGCG CGAAATGACC AAAGCCGGCT ACATTAAGTA A
 
Protein sequence
MTLRTFLLGA CSALAFAGMA SAETLTIATV NNGDMIRMQK LTDDFKAKNP GIDLEWVTLE 
ENVLRQKVTT DIATKGGQYD VLTIGTYEVP IWAKQGWLLP LDNLGANYDV DDLLPAIRSG
LTVDGKLYAA PFYGESSMVM YRKDLFDAAG LKMPDAPTWD FVADAARKIT NKDKEIYGIC
LRGKAGWGEN MAFLTAMSNS FGARWFDEKW KPQFDQPEWK DTLDFYVKLM KDAGPPGASS
NGFNENLALF QTGKCGMWID ATVAASFVAD PKQSQVADKV GFALAPDKGL GKRGNWLWAW
SLAVPAGTQK AEAAEKFVAW ATSKEYSNLV AEKEGWLNAP PGTRKSLYAN ADYQKAASFA
KMTLDSIEAA DPTKPTVKPV PYVGVQFVAI PEFQGIGTAV GQQFSAALAG QISVDQALKS
AQQLATREMT KAGYIK