Gene Rleg2_4796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4796 
Symbol 
ID6977890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp433456 
End bp435054 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content61% 
IMG OID643393959 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002278777 
Protein GI209546859 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.478428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC TGTCTAGATT ATCCGCTATC GCGCTTGGTG CCCTGCTGTC GACGGCCGCC 
GTTCCGGCTC TTGTCGTTTC GGGCGCGGCA ATCCAGGCTC AGGCAGCTAC GCTGTCCGGC
GGCTTCGATG TCGGCCCCGG AGGCTTCCAG GGCAACTTCA ATCCGCTGGC GGCGACCGCC
GGCTTCACCT GGCTCAGCAT CTACTACGAA CCGCTGATCA CTTATGACGA GAAGCTGCAG
AAGGTCGTCG GCGCGCTGGC GGACGCCTAC GAGGTCAGTC CGGATCAGAT GACCTACACA
TTCAAGCTCG CCGATGCCAA ATGGCATGAC GGCAAGCCCT TCACCGCCAA GGATGCAAAA
TTCACCGTCG GCCTTGCGAT AGATGCAAAA ACCGGCTCGG TGCTCGCTGC CCGGCTGAAG
GGCATATCAT CCGTCGAGAC GCCGGACGAT CACACCGTCG TCATCAAGCT CAGCGCACCC
AGCAGCAGTT TCCTCGACAC GATGACCAAG GTGATGATGC TGCCCGAGCA TGCTCTCGCC
TCGATACCGG CCGACCAGCT GGCAAAGAAC ACGTGGTGGT CCACCGCGCC GATCGGCACC
GGCCCGTTCA AATTCACCAA ATACGTCTCC GATCAGTATG TCGAGCTTGC CGCAAACACG
GACTATCGCG GCGGCAAACC GGCCCTGGAG CGCGTCATCA ACCGCTATTT CGCCAACCCG
GCCGCAGCGA TCGCGGCGCT GAGATCCGGC GAAATCCAGT TCACCTATGT CGATTCCAAC
GACGTGCCGA CCTTCAAGGA CAACAAGGAT TTCAAGGTCA TCGAAGGCAA CTCTTTCGTC
GTCAACTATC TGGGCTTCAA CCATGATTCC CCGATCTGGA AGGATGTGCG CGTCCGCCAG
GCGGTGATGT ATGCGATCAA TCGCGATACC ATCATCCAAA GCCTCTATGG CGGCGCGGCC
AAACCGGCCA ACTGCGCCTA TGTCGCCGAA CAGCTGATAC CCCAGGGCAT CGACACCTAC
GCCTACGATC CCGAAAAGGC CAAGAAACTG CTCAAGGAAG CCGGCTGGGA TCAGATCAAC
GGCGGCAAGC CGATCACGCT TCTGACCTAT TACACCACGC CGCTTGCCAC CAACGTCCTT
GCCGCCGTCC AGGCGATGCT TGCGGAGGTC GGCATCAACA TCGTGCCGCG CGCCGTCGAT
GCGCCGACCT ATAACAGCAT CGTGCTGAAT GCGACGCCGG ATATCGCCCA GTTCCAGATG
GTGTACGCCG GGCTGCAAAA CGGGCCGGAC GCCGGAAGCA TCAATGTCGG CCTCAACGAG
AAGCAGATCC CTCCGGCCGG GCCGAACGTC GCCAGAGTTC GCATGCCCGA TCTCACCAAG
GCACTCGATA GCGCGCTTGC CGAGCCCGAC AGCGCCAAGC GGGATGCGGC CTACCAGAAT
GTCTGCAAGG TGATGAACAC GAACCTGCCC TGGGCGACGC TTTGGGTGGC GAACCGTTAC
GGCATCGTCT CGACCAAAGC GAAGGATTTC GTCTGGACGC CGGCGCCGGG TGGCGGCCCC
TACCAGGCCG CCCCGCAGAA ATGGTCGCTC GCCGAATAG
 
Protein sequence
MKRLSRLSAI ALGALLSTAA VPALVVSGAA IQAQAATLSG GFDVGPGGFQ GNFNPLAATA 
GFTWLSIYYE PLITYDEKLQ KVVGALADAY EVSPDQMTYT FKLADAKWHD GKPFTAKDAK
FTVGLAIDAK TGSVLAARLK GISSVETPDD HTVVIKLSAP SSSFLDTMTK VMMLPEHALA
SIPADQLAKN TWWSTAPIGT GPFKFTKYVS DQYVELAANT DYRGGKPALE RVINRYFANP
AAAIAALRSG EIQFTYVDSN DVPTFKDNKD FKVIEGNSFV VNYLGFNHDS PIWKDVRVRQ
AVMYAINRDT IIQSLYGGAA KPANCAYVAE QLIPQGIDTY AYDPEKAKKL LKEAGWDQIN
GGKPITLLTY YTTPLATNVL AAVQAMLAEV GINIVPRAVD APTYNSIVLN ATPDIAQFQM
VYAGLQNGPD AGSINVGLNE KQIPPAGPNV ARVRMPDLTK ALDSALAEPD SAKRDAAYQN
VCKVMNTNLP WATLWVANRY GIVSTKAKDF VWTPAPGGGP YQAAPQKWSL AE