Gene Rleg_3723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3723 
Symbol 
ID8014561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3776798 
End bp3777958 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content61% 
IMG OID644826286 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002977505 
Protein GI241206409 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.415779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.44187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC TGTTGAAATC CTGCACGGCA GCGCTCGCTT GCCTCAGCTT CGCGACGCAG 
GGAATCGCCG CCGAACCGCT GAAGGCTCTG GGCAAAGGCG AAGGTGCGGT CAGCATCGTC
GCCTGGGCCG GCTATATCGA ACGCGGCGAA ACCGACAAGA ACTACGACTG GGTCACCGAT
TTCGAAAAGG AGACCGGCTG CAAGGTTTCA GTCAAGACCG CCGCCACCTC GGATGAAATG
GTGTCGCTGA TGAACGAAGG CGGCTTCGAT CTCGTCACGG CATCGGGCGA CGCCTCGCTC
CGCCTTATCG CCGGCAAGCG TGTCCAGCCG ATCAACACCG ACCTGATCCC GAGCTTCAAG
ACCGTCGACG AGCGTCTGCA GAACGGACCG TGGTATACGG TCGGCGGCGT GCATTACGGC
GTGCCCTATC TCTGGGGGCC GAACGTGCTG ATGTACAATA CCGATGCCTT CAAGGACAAG
GCGCCGACCA GCTGGAACGT CGTCTTCGAA GAGCAGACGC TGCCCGACGG CAAGTCCAAC
AAGGGCCGCG TCCAGGCCTA TGACGGCGCT ATCTACATCG CCGATGCCGC TATGTATCTG
ATGGCCCATA AGCCGGATCT CGGCATCAAG GATCCCTACG AGCTGAACGA GGACCAGTAC
AAGGCCGCCC TCGACCTGCT GCGCGGCCAG CGCAAGCTCG TCTCCCGCTA CTGGCACGAC
GCGATGATCC AGATCGACGA TTTCAAGAAC GAAGGCGTCG TCGCCTCCGG CTCCTGGCCC
TTCCAGGTGA ACCTGCTGCA AGCCGACAAG CAGAAGATCG CCTCCACTTT TCCGGATGAA
GGCGTCACCG GCTGGGCCGA CACCACCATG CTGCATGCCG ACAGCGAACA TCCGAACTGC
GCCTATATGT GGATGGAACA TTCGCTGAAG GCCAAGGTCC AGGGCGACGC CGCCGCCTGG
TTCGGCGCCG TGCCCTCCGT TCCCGCTGCC TGCAAGGGCA ACGAGCTGAT AGGCGACAGC
GGTTGCGCCA CCAACGGCTT CGATCACTTC GACAAGATCA AGTTCTGGAA GACCCCGGTC
GCCAAATGCA CCACGCAGAG CGAATGCGTG CCGTACCATC GTTGGGTGTC TGATTATATC
GGCGTGATCG GCGGGCGGTA A
 
Protein sequence
MTNLLKSCTA ALACLSFATQ GIAAEPLKAL GKGEGAVSIV AWAGYIERGE TDKNYDWVTD 
FEKETGCKVS VKTAATSDEM VSLMNEGGFD LVTASGDASL RLIAGKRVQP INTDLIPSFK
TVDERLQNGP WYTVGGVHYG VPYLWGPNVL MYNTDAFKDK APTSWNVVFE EQTLPDGKSN
KGRVQAYDGA IYIADAAMYL MAHKPDLGIK DPYELNEDQY KAALDLLRGQ RKLVSRYWHD
AMIQIDDFKN EGVVASGSWP FQVNLLQADK QKIASTFPDE GVTGWADTTM LHADSEHPNC
AYMWMEHSLK AKVQGDAAAW FGAVPSVPAA CKGNELIGDS GCATNGFDHF DKIKFWKTPV
AKCTTQSECV PYHRWVSDYI GVIGGR