Gene Rleg_4335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4335 
Symbol 
ID8015114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4456289 
End bp4457575 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content60% 
IMG OID644826911 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002978114 
Protein GI241207018 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000444207 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGTCA ATCGCCGTTC ATTTCTGATG GGTTCAGCCG GTGCGGCCGC CGGTCTCGCC 
CTTGGTGCCG GAAGCGCCAT TCCGGCTTTT GCCGAAGACG CACAGCTCCG CGCCATGTGG
TGGGGATCCA ACGACCGCGC CAAGCGCACG CTTGAGGTTG CCAAGCTCTA TCAGTCGAAA
TCGCCTGGCG TCACTCTCGT CGGTGAATCA CTTTCGGGCG ACGGCTACTG GACGAAGCTC
GCAACGCAGA TGGCCGGGCG CTCGATCGCC GACATCTTCC AGCTCGAGCC GGGAACAATT
TCAGACTACT CGAAGCGCGG CGCCTGCCTG CCACTCGATG AATTCGTGCC CTCCACGCTG
GACGTTCAGT CCTTTGGCGC CGACATGCTG AAACTGACCA CCATCGACGG CAAGCTCTAT
GGTGTTGGCC TCGGCCTCAA TTCCTTCTCG ATGTTCTTCG ACACAGTCGA ATTCGAAAAG
GCAGGCATCC CATTGCCGAC GCCCGACCTC ACCTGGGATG AGTACGCCAA GCTCGCCGTC
GAACTCGCCA AGTCTTCCGG CAAGGGCGGC GGCCCCTATG CGGCGCGCTA CGCCTATGTG
TTCGATGCCT GGCTGCGCCA GCGCGGCAAG AGCCTCTTTG CGAGGGAAAG CGTTGGGCTC
GGCTTCACGG CCGACGATGC CAAGGAATGG TTCGACTACT GGGAGAAACT GCGCAAAGCG
GGTGGCACCG TTGCCGCCGA CGTGCAGACG CTCGATCAAA ATACAATCGA CACCAATTGC
CTTGGTCTCG GCAAATCGGT GATTGGGATG GCCTATTCAA ACCAGATGGT CGGATATCAA
CTGATCATCA AGAACAAGCT TGGCATCACC ATGCTGCCAC GGGACAAGAA GGGCGGTCCG
TCCGGCCATT ATTACCGTCC GGCACTGATC TGGAGTGTGG GCGCGACGAG CAAGCATGGC
GAAGCTGCCG CGAAGTTTAT CAGCTTCTTC GTCAACGATC CCGAAGCCGG CAAGATCCTT
GGCGTGGAAC GCGGCGTGCC AATGTCGCCT ACCGTGCGCG AAGCCATCCT GCCGCAACTC
AACCCGACAG AGCAGGAAAC GGTCAAATAC GTGAATCTGC TCAAGGATCA GGTCGGCGAA
TATCCGCCAC CGGTGCCGAT GGGCGCAACC CAATTCGACC AGCGCGTGCT GCGCCCGCTT
TGTGACGAAC TCGCCTTCGA ACGGATTTCG CCCGCCGATG CGGCGACCCG GCTCATCGAA
GAGGGTAAGG CAACGATCAA GGGATGA
 
Protein sequence
MQVNRRSFLM GSAGAAAGLA LGAGSAIPAF AEDAQLRAMW WGSNDRAKRT LEVAKLYQSK 
SPGVTLVGES LSGDGYWTKL ATQMAGRSIA DIFQLEPGTI SDYSKRGACL PLDEFVPSTL
DVQSFGADML KLTTIDGKLY GVGLGLNSFS MFFDTVEFEK AGIPLPTPDL TWDEYAKLAV
ELAKSSGKGG GPYAARYAYV FDAWLRQRGK SLFARESVGL GFTADDAKEW FDYWEKLRKA
GGTVAADVQT LDQNTIDTNC LGLGKSVIGM AYSNQMVGYQ LIIKNKLGIT MLPRDKKGGP
SGHYYRPALI WSVGATSKHG EAAAKFISFF VNDPEAGKIL GVERGVPMSP TVREAILPQL
NPTEQETVKY VNLLKDQVGE YPPPVPMGAT QFDQRVLRPL CDELAFERIS PADAATRLIE
EGKATIKG