Gene Rleg2_4947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4947 
Symbol 
ID6978041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp589138 
End bp590175 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content57% 
IMG OID643394099 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002278917 
Protein GI209546999 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0229057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTAA TATCCTCTAT CGCGTATATA TCCGCCGTGA TCGCTCTCGC CGGGGCGGCT 
GCCACATCGG CGCGCGCTTC CGTGCTCGAC AGCGTAAAAC AGCGCGGCGT GCTCAATTGC
GGCACTGACA ACACGACTCC GGGGTTCGGT TATCTCAACC CAAAAACCTC CAAGATGGAG
GGTGTGGATG TGGATCTTTG CCGCGCGACT GCAGCCGCCG TTCTTGGTGA TCCGGACAAG
GTTAACTTCG TGGTCGTCAC GGACAAAAGC CGGTTCAATG CCGTTCAGAC AGGGCAGGTA
GACATTGTCT ATGCCCATAC GTCTGTGTTC GCGTCGCGCG CTGCAGCGCT CGCAGTTGAC
TTTTTACCGT CGTATTTTTT TGATGGTGGC GGCGTGATGG TGACGGCCGC ATCTGGCGTG
AAGTCGATCA ACGATCTGTC GGGGGCTACG ATCTGCACCA CTCAGGGGTC TGGCAGCGAG
GCCACACTTG CCCAAGAGGT TAAGGCTAGG AATCTAACGA ACACAAAGAT TCTGACGTTC
GATACCAGCG AGAAACTGTT CTCGGCGCTG ACCAGCGGGC GGTGCAACGG TATGTACACC
GACAAGTCGG CCCTTGCCGC CTGGCGCGGT AACTCGCAGA AATCTGCCGA CTACGTGATC
CTACCAGAGA CGCTGGCAGT GGCTCCATTC GCTGGTATCA TCGTTCAAAA CGATCCAGAA
TGGCGAAAGC TGATGACGTG GACGCTCTAC GCCTTGTTTC AGGCCGAGGA ATGGGGCATC
ACCAGTGCTA ACCTGAGCGA GATGCAGAAA TCTGCCGACC CCGCAATTCA GAAGTTCTTG
GGTGTAAACG GCGGCTTCGG CGCGGACTTC CATGTGTCGG ACAGCTTCAT CGCCGACATG
ATCAAGGGCG TCGGCAATTA CGGCGAGATC TATGACCGGT CTCTGGGGCC GAAGACGCCG
CTCTATCTGG AGCGCGACAA GACGTCGAAC GCACTCTCGA AGAATGGCGG TCTGCTGTAC
TCGATCCTGT GGCTCTGA
 
Protein sequence
MRLISSIAYI SAVIALAGAA ATSARASVLD SVKQRGVLNC GTDNTTPGFG YLNPKTSKME 
GVDVDLCRAT AAAVLGDPDK VNFVVVTDKS RFNAVQTGQV DIVYAHTSVF ASRAAALAVD
FLPSYFFDGG GVMVTAASGV KSINDLSGAT ICTTQGSGSE ATLAQEVKAR NLTNTKILTF
DTSEKLFSAL TSGRCNGMYT DKSALAAWRG NSQKSADYVI LPETLAVAPF AGIIVQNDPE
WRKLMTWTLY ALFQAEEWGI TSANLSEMQK SADPAIQKFL GVNGGFGADF HVSDSFIADM
IKGVGNYGEI YDRSLGPKTP LYLERDKTSN ALSKNGGLLY SILWL