Gene Rleg_5698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5698 
Symbol 
ID8016661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp281885 
End bp283123 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content58% 
IMG OID644827851 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002979051 
Protein GI241518423 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.145305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.141272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATCA CCAAACGCGA ATTCCTTGTT GCGACAGCAG CGCTAGCGCT TGCCAGTGGT 
GTCCGCTCCG CAAGTGCTGC CACAGCGATC AATTATTGGC ATCACTTCGC CAGCCAGTCG
GAAATGGCCG GCCTGGTGAA AATCATCGAG CTGTTTGGGA AATCCCATCC AGGCATCACC
GTCACGCAAG AGAGCATTCC GAATTCGGAA TATATGGCCA AGGTTTCATC GGCTGTCGTG
GCCGGCGGGC GGCCCGACAC CGGAATGGTC ATTGCGGAAC GCTTTGCCGA TCTGACGGCG
ATGGGCGCGC TGACCGATAT CACCGAGCGG GTAAAAGGAT GGAAGGGAAA AGCCAATCTG
CCGGACAACC GGTGGGCTGG CATGTCTCAG GACGGCGCGA TATACGCGGT TCCGGCCTAT
GCTTTCGTCG ACTGGATGTA CTACCGGAAA GACTATTTCG AAGAAGCGGG CCTTTCGGGC
CCACCAAGGA CTTTTGATGA GTTTGTCACC GCTTGCCGGA AGCTCACCGA TCCGGCAAAG
GGACGCTACG CTTTCGGAAT GCGCGGGGGC GCGGGGGCGT TCAAATACGT CATCGACGTC
ATGGAGGCCT TTGGCTCGCC AATTGTAAAG GACGGTCAGG CGGCCATCGA TAAGGCTGCC
GCGGTGGAGG CGATCACCTT CTACTCAAGT CTGTTCTTGA AAGAGAAAGT CGTTCCTCCA
AGTGTGCCGA ACGACAGCTA TCGGCAGATC ATGGAAGGTT TCCGAACTGG CCAGACAGCC
ATGGTCTGGC ACCATACCGG ATCGCTGATC GAAATCTCGG CCGCCTTGAA GCCGGGAGAG
CAGTTCGCCA CCGCTCCAAT GCCCGCGGGA CCGAAGGCAC ATATCGCGCG TGTTGCCTAC
GCCGGCAACG GCATCATGAA GGACGACAAT ATCGACGCTG CCTGGGACTG GATCAGCTTC
TGGGGAGAAA AAGACGCGGC GATAGCCTTG TTGGAAGCGA CGGGTTATTT CCCGGCATCA
ACGGCAGCGC TTGAGGATGA GCGCATCAAG ACCAATCCAA TCTACCAAGC CGCTTCGCAG
ACGCTCGACT TCGGTCGTCT GCCGAACAGT TTCGTCGGCG CTGCGGGCTG GTCCGAAAAT
GTCGTCAATC CCACGTTCCA ATCCGTTCTG ACGGGTCAAC TCACCCCTGA GCAGGCCGTC
GACCGAATGA TCGAGGGTCT GGAGACCGCG CTCCGGTAG
 
Protein sequence
MQITKREFLV ATAALALASG VRSASAATAI NYWHHFASQS EMAGLVKIIE LFGKSHPGIT 
VTQESIPNSE YMAKVSSAVV AGGRPDTGMV IAERFADLTA MGALTDITER VKGWKGKANL
PDNRWAGMSQ DGAIYAVPAY AFVDWMYYRK DYFEEAGLSG PPRTFDEFVT ACRKLTDPAK
GRYAFGMRGG AGAFKYVIDV MEAFGSPIVK DGQAAIDKAA AVEAITFYSS LFLKEKVVPP
SVPNDSYRQI MEGFRTGQTA MVWHHTGSLI EISAALKPGE QFATAPMPAG PKAHIARVAY
AGNGIMKDDN IDAAWDWISF WGEKDAAIAL LEATGYFPAS TAALEDERIK TNPIYQAASQ
TLDFGRLPNS FVGAAGWSEN VVNPTFQSVL TGQLTPEQAV DRMIEGLETA LR