Gene Rleg_3430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3430 
Symbol 
ID8014303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3452102 
End bp3453349 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content58% 
IMG OID644825988 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002977215 
Protein GI241206119 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCCG CATTAGCAGG TATCGCAGCA TCCGCGTTGA CATTGTGCAT ATCGACGTCT 
GCCGTCTTCG CGACGGATCT CCGCATGACG GTCTGGACCG GGAGCGAGGC GCATCTGAAG
ATGCTGAATG GCATTGCGGA GAGCTTCAAA GCCACACACC CCGATGTGAA CGTGAAGTTC
GAGACCGTGC CGGTCAGCGA CTACACGCAG AAACTGACCT TCCAGATCGC CGGCGGCAAT
GCTCCCGACA TAGCCTGGAT GATGGAGGAT GCCGCTCCGG CTTTCGAAAA CGCCAATCTT
CTGATGGATC TCGGCCCGAC GCTCAAGGCG GCGGAAGGCT ATGATTTCGA CGATTTTTCG
AAGCCGGCCA TGGGCCTCTG GCAGAAGGAT GAAACGGTCT ACGGCATTCC GTTCTCCACC
TCGCCTTTCA TGATCTACTA CAACAAGGAC ATGTTCGACA AAGCCGGGCT CGAAGATCCG
CTGACGCTCG CCACCAAGGG CGAATGGAAC ATGGACAAGT TCCAGGAAGT CTCCAAGAAG
CTCGCGGAAA CCAATCCCGG CAAATGGGGC TTCGAGTTCA AGGATGGGGA AGGCTATGCC
TCTCGCATGA CCCATGCCCT TCTGCCGCCA ATCCGCGCCT ATGGTGGCGA TATCTGGTCG
AACAAGGAAT GCGGCTTCGA CAAGCCCGAA GCGGTCAAGG CGGTCAAGCA GCTGCATGAC
ATGGTCTTCA AGGACAAGTC CATCGTTCCG CCGGGCGAAC AGGGCGATTA CTTTTCCGGC
AATTCGGCGA TGACGGTCAA CCAGATTTCC CGCGCTTCGA AGATGGCGGA AGCCGGCTTC
AAGTGGGGCA TCGCACCGTT GCCCACCGGC CCAGGTGGTG AGTCACCCGT TATCGGCCAG
GCCGGTCTTG TTGTGTTCGC CCAAGGCAAG AATACGGAAA TCGCCGCGGA ATTCGTGGCG
CATATGACCA ACAAGGAAAA CGTCGCCACC ATGGCGCAGT TCTTCCCGCC CGCCCGCAAG
AGCGTTCTGC AGGCCGATGC ATTCATCAAC GGCAACAAGC TCGTGCCGCC CGAGATGATG
AAGAATGTGG CTGCCGCCAT AGAAAAGGGC CGGGTGGTTT CGGCTAACGA AAAAGCGCCA
CAGATCCTTG CCGCCATGGC GCCTCGCGTC GATGCCTTGT GGAAGCCGGA TGCCGATGTC
GATGCCGCCA TCAAGGGCAT CTGCGCGGCA ATCCAACCGC TGCTTTGA
 
Protein sequence
MKAALAGIAA SALTLCISTS AVFATDLRMT VWTGSEAHLK MLNGIAESFK ATHPDVNVKF 
ETVPVSDYTQ KLTFQIAGGN APDIAWMMED AAPAFENANL LMDLGPTLKA AEGYDFDDFS
KPAMGLWQKD ETVYGIPFST SPFMIYYNKD MFDKAGLEDP LTLATKGEWN MDKFQEVSKK
LAETNPGKWG FEFKDGEGYA SRMTHALLPP IRAYGGDIWS NKECGFDKPE AVKAVKQLHD
MVFKDKSIVP PGEQGDYFSG NSAMTVNQIS RASKMAEAGF KWGIAPLPTG PGGESPVIGQ
AGLVVFAQGK NTEIAAEFVA HMTNKENVAT MAQFFPPARK SVLQADAFIN GNKLVPPEMM
KNVAAAIEKG RVVSANEKAP QILAAMAPRV DALWKPDADV DAAIKGICAA IQPLL