Gene Rleg2_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3669 
Symbol 
ID6982431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3797646 
End bp3799124 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content59% 
IMG OID643398391 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002283158 
Protein GI209551241 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGGA TTTCACTCGG TGGTGCCATC CTGCGCGGCA CTGTTTCCAC TGCTTTGATG 
GTATCGTTGA TGTCCGCGTC GGCGCTGGGA GCGCCGGTCG ACCTGAGCAA GTGGTCGCCG
GAATATGTGC GCTCCATTGC CGGCACGCAG GATTTCGACA CGGCGGCCGA TTGCGGCAAG
GTCACCCCGC TCGACTACAA GGGGCGACTC ACTTTCTGGT ATCAGGGTGT GTTCGAGGGC
GACCCCGATC TCCTGCGCCA GTATTACAAG GAGTTCTTCG AGACCTTCCG CAAAACCTAT
CCGAACATCC AGCTTGAGGA GCAGGCCCTC ACCTATAACG ACCTGCTGGA TAAGTTCCGG
ACCGCGCTCC TTGGCAATGC AGCGCCCATG GCGGTCCGCC TGCAGATCCT GGGTGGCACG
GAGTTCGCCT CAAAGGGCTA TTTGCAGCCG CTCAAACCCG AGGATGTAGG CTATTCGACC
GAGGATTTCT GGCCCGGCGC AATGAAGGCT GTAACCTGGG ATGGGGTAAC TTACGGCATC
CCGACCAATA ACGAGACGAT GGCGTTCATC TGGAACGCCG ACATCTTCAA GCGTGCAGGC
GTCGATCCGG ATAAGGCTCC GGCAACATGG GACGACGTCG TCAAGGATTC CAAGCAGATC
CACGACAAGC TCGGCATTGC CGGTTACGGC CTCGTGGCTC GCAAGAATGC CGGCAATACG
CCGTACCGCT TCATGCCGCA GCTGTGGGCC TATGGCGGCG GCGTCTTCGA CGAAGCGACC
GCCAACCCGA CCTATAAGGA GGTCGAGCTC AACAGTCCGC AGAGCAAGGC GGCATTGCAA
GCCTCCTACG ATATGTATGT TCGCGACAAG TCGGTTCCGG TTTCGGCGCT CACCAACCAG
CAGGCCGACA ACCAGCCCCT CTTCCTCGCT GGCCAGCTCG GCATGATGAT CTCGCACCCG
TCCGACTATA ACGTCATGCT CGACTTGCAG AAAAAGGCGA CGGATACCGA CAAGGACAAG
GCGCAGACCG TCATCGACAA TATGCGCTAC GGCCTCATTC CGACTGGGCC CGATGGCAAG
CGTGCCGTCG TGTTCGGCGG CTCCAACATT CACATCCTGA AGCCAGAATA TGTCGAGGGC
GGCAAGGTAG ACGAGCCGGC TGCAAAGGCG ATCATCTGCA TGTGGACGAG CCCGGAATGG
TCGCTGAAGA TGGCCTATGC CGGCTCGAAC CCGGGAAACC TCAACGGCTT CAAGACAAAA
TGGATGAAGG AACGTCTTGA TAAAATCAAG TTCCTCGATG TCACGACCTC GATGCTGCCA
TACGGCATTC CGTTCCCGGC GCTGCCACAG TCTCCCGAGA TCATGAACAT CATCGTCCCG
GACATGCTGC AGAATGCCCT GACCGGGGCC ATGACCGTCG ACCAAGCCGC CGACGACGCA
GCCAAGAAGG TAAAAGACCT GATGGACGGC GGACTCTAG
 
Protein sequence
MTRISLGGAI LRGTVSTALM VSLMSASALG APVDLSKWSP EYVRSIAGTQ DFDTAADCGK 
VTPLDYKGRL TFWYQGVFEG DPDLLRQYYK EFFETFRKTY PNIQLEEQAL TYNDLLDKFR
TALLGNAAPM AVRLQILGGT EFASKGYLQP LKPEDVGYST EDFWPGAMKA VTWDGVTYGI
PTNNETMAFI WNADIFKRAG VDPDKAPATW DDVVKDSKQI HDKLGIAGYG LVARKNAGNT
PYRFMPQLWA YGGGVFDEAT ANPTYKEVEL NSPQSKAALQ ASYDMYVRDK SVPVSALTNQ
QADNQPLFLA GQLGMMISHP SDYNVMLDLQ KKATDTDKDK AQTVIDNMRY GLIPTGPDGK
RAVVFGGSNI HILKPEYVEG GKVDEPAAKA IICMWTSPEW SLKMAYAGSN PGNLNGFKTK
WMKERLDKIK FLDVTTSMLP YGIPFPALPQ SPEIMNIIVP DMLQNALTGA MTVDQAADDA
AKKVKDLMDG GL