Gene Rleg_3910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3910 
Symbol 
ID8015868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3977137 
End bp3978375 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content59% 
IMG OID644826480 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002977691 
Protein GI241206595 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.198843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGC GTCTATTGGC TGCGACCAGC ATCGCTACCT TATGTCTGGT GTCGGCCGCG 
TCAGCTGCCG AAAATGTCGA AATGTGGGTT CGCTCGGGGA TTGGCGATGC CTTCAAGAAG
GTCGTCGAAG CCTATAATTC CGGTCACGAG AACAAGGTCG TGATGACCGA GGTGCCGTTC
TCCGAGCTGG TACAGAAATA TGCGACGGCA ATCGCCGGCG GACAGGCGCC GGATGCCTTG
TCGATGGATC TCATTTATAA TCCCGCCTTT GCCGCGGCCG GCCAGCTGGA AGACCTGACG
GACTGGGCAA AATCCCTGCC CTATTTCAAT TCGCTTTCGC CATCGCATGT TCGCCTCGGC
ACCTATCAGG ACCGGATTTA CGGCCTGCCG CTTTCGGTCG AAACCTCCGT CTTCGCCTGG
AACAAGGATC TCTACAAGAA GGCCGGTCTC GACCCGGAAA AAGCGCCGGC GAATTGGGAC
GAAATTACCG CCAATGCTGA GAAGATCCGG GCGCTGGGTG ACGATACCTA CGGCTTCTAT
TTCTCCGGTG GCGGCTGCGG CGGCTGCATG ATCTTCACAT TCACGCCGCT TGTCTGGGGT
GCCGGCGCTG ATATTCTGTC GGCCGACAGC AAGACGGCGA CGCTCGATAC GCCTGAGATG
CGCAAGGCCG TCGATATCTA CCGCAACATG GTCAAGAAGG ACCTCGTACC GGCGGGTGCC
GCCAGCGACA ACGGCGCCAA CTTCCTGACC TTCACCAACG GCAAGATCGG CCAGCAAAGC
CTCGGCGCCT TTGCCATCGG CACGCTGGTA ACCGAGCATC CCGATATCAA CTTCGGCGTG
ACCCTCATTC CTGGCGTCGA CGGCAAGCCC TCGTCCTTTG CCGGCGGCGA CAACTTCGTC
ATCACCAAGG GCACGAAGAA GATCGACGCG GTGAAGGAGT TCCTCGAATA TATCTATTCG
ATGGACGGCC AGAAGATCAT GGCGAAGTAT GGCAGCCTGC CGACGCGCGG CGATATCGCC
GACAAAGTGC TTGAGGGCCT CGATCCGCGC ATGCAGGTCG GCCTGAAGGC GATCGGCGTC
GCCAAGACAC CCTATACGCT GCAGTTCAAC GACCTGATCA ACAGCGCCAA CGGGCCTTGG
GCCAGCTTCA CCAACGCCTC GATCTTCGGC GACGATGTCG ACGGGGCGTT TTCGAGCGCC
CAGTCGGAGA TGCAATCGAT CATCGATAGC GGCCAATAA
 
Protein sequence
MIKRLLAATS IATLCLVSAA SAAENVEMWV RSGIGDAFKK VVEAYNSGHE NKVVMTEVPF 
SELVQKYATA IAGGQAPDAL SMDLIYNPAF AAAGQLEDLT DWAKSLPYFN SLSPSHVRLG
TYQDRIYGLP LSVETSVFAW NKDLYKKAGL DPEKAPANWD EITANAEKIR ALGDDTYGFY
FSGGGCGGCM IFTFTPLVWG AGADILSADS KTATLDTPEM RKAVDIYRNM VKKDLVPAGA
ASDNGANFLT FTNGKIGQQS LGAFAIGTLV TEHPDINFGV TLIPGVDGKP SSFAGGDNFV
ITKGTKKIDA VKEFLEYIYS MDGQKIMAKY GSLPTRGDIA DKVLEGLDPR MQVGLKAIGV
AKTPYTLQFN DLINSANGPW ASFTNASIFG DDVDGAFSSA QSEMQSIIDS GQ