Gene Rleg_6036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6036 
Symbol 
ID8016298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp64993 
End bp66069 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content57% 
IMG OID644827344 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002978544 
Protein GI241258660 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.62779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAC TCACCAGACG CGACACAATG AAGTTGATGT CCGCAGCACT GATAGCTGGC 
GTCAGCATCT CCACCTTGCC GCGCAAAGCC TTTTCGGCCG GATCCATCAC GGTCCTCAAC
TGGCAGGGCT ACGGCACCGA CGAAGCCTGG TCGCTGAAGG CTTTCACCGA AAAGACGGGC
ATCACAGTCG TCCATGACTA CTACAGCTCG GAATCGGAAA TGCTGACCAA GATGCGCACG
AATCCGGGCG CCTACGATCT TGTTATCCTC AACGCTGCGC GTTGCGCCCA GGCGGTTGCC
GAAGACCTGC TGCAGCCGAT CGATTTCAGC AAGGTTCCAA ACGCGTCAAC AGTCGACGAG
ACCCTGCGCG CCAACCCGAA CTTCAGCAAG GATGGCAAGG GTTATGCCGT TCCGTGGGTC
TGGGGCATGA CCTCGCTTGC CATCCGTGAA GGCATGACCG TTCCCGACAG TTACGCGGTC
CTCGCCGACC CGGCCTATAA GGGCCGTGTT GCCATGGACG ACGACGCCAT CATCAATGTC
GGCGTTGGCG CGCTGATGAG CGGGCAGGAC ATCAACGACC CCAAAGATCT GGCTGCTGTT
ACGGCCGCGC TGAAATCGAT CAAGCCGAAC GTCAAGTTGC TGTGGTCGAC GGAAGATCAG
TGGAACAAGT CGTTCGCCGC CAAGGAATTC GACCTGTCGC TGTTCTGGTC GGGCGGCTCG
GTGCGATCCA AGCGCGTTTC TAAACTTCCG GTTCAGTTCG TAGTGCCTAA GGAAGGCGGC
GTCGGTTGGG TCGATGGTCT GGGTGTTCCG GCATCGGCTC CGAATCCAGA AGGTGCTCTT
GCCTTCGTCA ACTGGATGAT CGATCCGATA TTCTATGTCG AATGGGCAAC CAAGATTGGA
GCCCCCGCTT CTTCAAATTC GGCAGCCCTT TCAGCACTTC CTGCCGACGA TCTCACCCGC
CTGGTACACA AGACCGAATA TTTGAAGACC ACGTCCTTTG TCTCGGGCAT TCCCGACGAC
CGGCGCGAAG CCTTCAACAA TATCTGGCAG GAAGTGAAAG CCTTCTATGC AGAATGA
 
Protein sequence
MTELTRRDTM KLMSAALIAG VSISTLPRKA FSAGSITVLN WQGYGTDEAW SLKAFTEKTG 
ITVVHDYYSS ESEMLTKMRT NPGAYDLVIL NAARCAQAVA EDLLQPIDFS KVPNASTVDE
TLRANPNFSK DGKGYAVPWV WGMTSLAIRE GMTVPDSYAV LADPAYKGRV AMDDDAIINV
GVGALMSGQD INDPKDLAAV TAALKSIKPN VKLLWSTEDQ WNKSFAAKEF DLSLFWSGGS
VRSKRVSKLP VQFVVPKEGG VGWVDGLGVP ASAPNPEGAL AFVNWMIDPI FYVEWATKIG
APASSNSAAL SALPADDLTR LVHKTEYLKT TSFVSGIPDD RREAFNNIWQ EVKAFYAE