Gene Rleg2_0375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0375 
Symbol 
ID6979089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp383318 
End bp384415 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content60% 
IMG OID643395087 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002279900 
Protein GI209547983 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00751674 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000421612 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGGTCAA TCATTGCAAG TGTGACGGCC GCCGCCGTTG CCGCGCTGCT TACCGCCGCG 
CCGGCCTTTG CGCAGGAGCG CGTGGTCAAC GTCTACAACT GGTCGGATTA TATCGACGAC
AGCATTCTTG CCGACTTCAC CAAGGAAACC GGCATCAAAG TCGTCTACGA CACCTTCGAT
TCCAACGAGA CCGTGGAAAC CAAGCTGCTG GCCGGCGGCA CCGGTTATGA CGTCGTCGTT
CCCACAGCCG ACTTCCTGCA GCGCCAGATC CAGGCCGGCG TCTTCCAGAA GCTCGACAAG
TCGAAACTGC CGAACCTCTC CAACATGTGG GATGTGATCC AGCAGCGCAC CGCCGAATAC
GACCCGGGCA ACGAACATGC GGTCGATTAC ATGTGGGGCA CCGACGGCAT CGGCTACAAC
GTCAAGAAGG TCGCCGAAAT CCTCGGTCCC GATGCCAAGC CCGGCCTCGA AGTGATCTTC
GATCCGAAGG TCGCCGCAAA GTTCAAGGAT TGCGGCATCT ATATTCTCGA CACACCGAAG
GACGTCATTA CCACGGCGTT TCGCTATCTC GGCCTCGACC CGAACTCCAC CAAGGCCGAG
GATTTCAAGA AGGCCGAAGA GCTGCTGACG GCCGCCCGCC CCTATGTCCG CAAGTTCCAT
TCGTCCGAAT ACATCAATGC GCTTGCCAAC GGCGACATCT GCATCGCCTT CGGCTATTCC
GGAGACATGC TGCAGGCGCG CGACCGTGCG GCCGAAGCCA AGAACGGCGT CGAGGTCAAT
TATTCGGTTC CCCCGCAGGG CGCCCAGATG TGGTTCGACA TGATGGCCAT CCCCGCCGAT
GCGCCCCACG TCGCCGAAGC CCACGAATTC CTCAACTACA TGATGAAGCC CGAGGTCATC
GCCAAGGCGA GCGATCACAC CTTCTATGCC AACGGCAACA AGGCCTCGCA GCAGTTCGTC
AGCAAGGACA TTCTGGAAGA CCCTGCCGTC TATCCGACCG AGGCGGTGAT GAAGAACCTC
TTCACGGTCA AGCCGTGGGA TCCGAAAACG CAGCGCCTGG GGACGCGCCT CTGGACGAAG
GTCGTTACCG GCCAGTAA
 
Protein sequence
MRSIIASVTA AAVAALLTAA PAFAQERVVN VYNWSDYIDD SILADFTKET GIKVVYDTFD 
SNETVETKLL AGGTGYDVVV PTADFLQRQI QAGVFQKLDK SKLPNLSNMW DVIQQRTAEY
DPGNEHAVDY MWGTDGIGYN VKKVAEILGP DAKPGLEVIF DPKVAAKFKD CGIYILDTPK
DVITTAFRYL GLDPNSTKAE DFKKAEELLT AARPYVRKFH SSEYINALAN GDICIAFGYS
GDMLQARDRA AEAKNGVEVN YSVPPQGAQM WFDMMAIPAD APHVAEAHEF LNYMMKPEVI
AKASDHTFYA NGNKASQQFV SKDILEDPAV YPTEAVMKNL FTVKPWDPKT QRLGTRLWTK
VVTGQ