Gene Rleg_6544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6544 
Symbol 
ID8017067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012854 
Strand
Start bp260827 
End bp261837 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content61% 
IMG OID644828331 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002979531 
Protein GI241554318 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.492931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA CATTTGCTGT CGCGGCGGTT GCCGCATTCG TGGCTGTGTC CGTACCGGCG 
CACGCCGACA ACATGGTCTT CTCGAGCTGG GGAGGAACGA CCCAGGACGC GCAGAAGGCC
GCATGGGCCA GCCCGTTCAC GGAGAAGACC GGTATCACCG TCGTGCAGGA CGGGCCGACG
GATTACGGCA AGCTCAAGGC GATGGTCGAG GCTGGCCAGG TCACCTGGGA CGTCGTCGAC
GTCGAAGGCG ACTATGCCGC CCAGGCCGGC AAGAATGGCC AGCTCGAGAA ACTCGATTTC
TCGGTCATCG ACAAATCCAA GCTCGATCCG CGCTTCGTGA CTGATTACTC GGTCGGCAGC
TTCTATTATT CCTTCGTCAT CGGCTGCAAT GCCGATGCCG TCACCGCCTG CCCGAAGACA
TGGGCGGATC TGTTCGACAC GGCAAAGTTT CCGGGAAAGC GCACATTCTA CAAGTGGTCG
GCTCCCGGCG TGATCGAAGC GGCGCTGCTT GCCGACGGTG TGGCCGCCGA CAAGCTCTAT
CCACTTGATC TCGACCGCGC CTTCAAGAAG CTCGATACGA TCAAATCGGA TATCGTCTGG
TGGTCGGGCG GCGCACAGTC GCAGCAGCTT CTGGCATCCG CCGAGGCTCC CTTCGGCAGT
GTCTGGAACG GCCGCATGAC CGCGCTTGCG GCGAGCGGTA TCAAGACCGA GACCTCATGG
GAACAGAACA TCACCGCTGC GGATTCGCTC GTCGTGCCGA AGGGTTCGCC GAACGTGGAA
GCCGCGATGA AGTTCATTGC AATGGCGACT TCTGCCGAAC CGCAGGCCGC CCTTGCAAAA
GCCACCGGAT ATGCACCAAT CAACGTTGAC TCGGCCAAGC TGATGGATCC GGAGACGGCC
AAGACCCTGC CGGATCAGCA GACGGCCAGC CAGGTGAATG CCGATATGAA CTACTGGGCT
GATAATCGCG ATGCCATCGG CGAGAAGTGG TACGCCTGGC AGGCGAAATA G
 
Protein sequence
MKTTFAVAAV AAFVAVSVPA HADNMVFSSW GGTTQDAQKA AWASPFTEKT GITVVQDGPT 
DYGKLKAMVE AGQVTWDVVD VEGDYAAQAG KNGQLEKLDF SVIDKSKLDP RFVTDYSVGS
FYYSFVIGCN ADAVTACPKT WADLFDTAKF PGKRTFYKWS APGVIEAALL ADGVAADKLY
PLDLDRAFKK LDTIKSDIVW WSGGAQSQQL LASAEAPFGS VWNGRMTALA ASGIKTETSW
EQNITAADSL VVPKGSPNVE AAMKFIAMAT SAEPQAALAK ATGYAPINVD SAKLMDPETA
KTLPDQQTAS QVNADMNYWA DNRDAIGEKW YAWQAK