Gene Rleg_1878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1878 
Symbol 
ID8012930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1862826 
End bp1864421 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content58% 
IMG OID644824467 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002975699 
Protein GI241204603 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.558624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.335683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCATT TTTCTAAGAG CTTGTTCGTC GGTGCCGTCA TCGGCGCCCT GACGATATCG 
GCAGCGCAGC TGCAGGCCGC CACGCCGCAG GACCAGCTGG TGATTGGCAC TTCGCTGGCG
CAGGTTTTGT CGCTCGATCC GCAGCAGGCG ACCGAAGGCA AGGCGGTCGA AATCATGTCG
AATCTGTACG ATCGGCTGGT TGCCAGCACG GCTGATGGCA AGATCCTTCC GCAACTGGCG
GAAAGCTGGA AGATTGACGA CAAGGGCATC ACCTTTACGC TGCGCAAGGC CAATTTCGCC
TCCGGTAACC CGGTCACCTC GAAGGATGTC GTCTATTCAC TGGCGCGGCT CCTGAAAATG
GACCAGGCCG CCGCCGCTAA CCTCAAGCGC GTCGGCTACG ACAAGGACAA TGTCGAAAAG
CTCGTCAAGG CCGTCGACGA TCAGACGGTG CGGATCGATC TCTCCGACCA GGTGACGGCA
GAGCTTCTGC TCTATCGGCT GACAACGACG ACGACCAGCG TGGTCGACAG CGTCGAAGTC
GAGAGCCACG CCGTCGATAA CGACTACGGA AACGCGTGGA TGCGAACGCA TTCTGCCGGC
TCCGGTCCGT TTACCCTCAA TCGCTGGTCT CCGAACGAAC TGGTGATCCT CGACGCCAAC
AAGGACTATA TGGCAGGCAC GCCGAAGATG CGTCGCGTCA TCGTCCGGCA TGTGCCTGAA
AGCCAGGTCG AGCGGCTGAT GCTTGAACGC GGCGATATCG ATATTGCCAG CGCCTTGACC
GCATCGGATC TCGCGACGTT CCAGACCAAG AAAGGCTTTG CCATCCAGCG TATTCCGACG
GGCGGTTTCT ACGTGCTGTC GATGAATGCC GGCAACAAAT ACCTCGCCAA TCCGAAGGTT
CGCGAAGCCA TCGCCTATGG CATCGACTAC AAGGGCATCG AAAAGACGAT CATGGGCCCT
TACGGCCGGG CGAGAAACGT TCCCGTTCCG GAGAATTTCG AATATGCCAT CCCGAACCCC
GATTGGCATC TCGACGTCGA AAAGTCGAAA CAGCTGCTGA GCGAGGCAGG CTTCAAGGAC
GGCTTTTCGC TGACGCTGAA GACCATCGCG CAAACGCCGC GCATCGATCT TGCCACCGCC
ATCCAGGCAT CGCTTGCTCA AGTTGGCATC AAGATCGACA TCCAGCAGGG CAACGGCTCG
GAAATCATCG CCGCCCATCG CGCCAGGGAT TTCGATCTGC TGATCCCGCA GACCAGCGCC
TATATGCCGA ACGTGCTCGG CTCGATGGAG CAGTTTTCCT CCAATCCCGA CAATTCGAAG
GAAGCCAACA ATGCCGGCAA TTTCGTCTGG CGCTCGGCCT GGGATATTCC GGAACTCACG
GCGCTGACGG CGAAAGCATC GATGGAGCCG GACGCCAAGA AGCGTGGCGA ACTCTATGTT
CAGATGCAGA AGATGTTCGT CGAACAGAAG CCGGCGGTGC TTCCGCTCTT CGAGCGCTTT
GAGCCGATCG TCCTCAATAG CAAGGTCGAG GGATATGTGG GGCATCCGTC TCAGCTGACG
CGTCTCGAGA ACGTCACCAA GGTCGAAACC CAGTAA
 
Protein sequence
MKHFSKSLFV GAVIGALTIS AAQLQAATPQ DQLVIGTSLA QVLSLDPQQA TEGKAVEIMS 
NLYDRLVAST ADGKILPQLA ESWKIDDKGI TFTLRKANFA SGNPVTSKDV VYSLARLLKM
DQAAAANLKR VGYDKDNVEK LVKAVDDQTV RIDLSDQVTA ELLLYRLTTT TTSVVDSVEV
ESHAVDNDYG NAWMRTHSAG SGPFTLNRWS PNELVILDAN KDYMAGTPKM RRVIVRHVPE
SQVERLMLER GDIDIASALT ASDLATFQTK KGFAIQRIPT GGFYVLSMNA GNKYLANPKV
REAIAYGIDY KGIEKTIMGP YGRARNVPVP ENFEYAIPNP DWHLDVEKSK QLLSEAGFKD
GFSLTLKTIA QTPRIDLATA IQASLAQVGI KIDIQQGNGS EIIAAHRARD FDLLIPQTSA
YMPNVLGSME QFSSNPDNSK EANNAGNFVW RSAWDIPELT ALTAKASMEP DAKKRGELYV
QMQKMFVEQK PAVLPLFERF EPIVLNSKVE GYVGHPSQLT RLENVTKVET Q