Gene Rleg2_6276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6276 
Symbol 
ID6983349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp225056 
End bp226546 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content55% 
IMG OID643399284 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002284040 
Protein GI209552124 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0695772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCTG CATTCGGTGG AGCTGTAAAG GCTGCTCCCA AGAGAGGCGG TAACTTACGC 
ATTGGCATCG CCGGAGGCAC TTCCGGAGAC AGCCTCGATC CGACGTCAAC GCCGGTCGAC
GCGGGCTTTC TCACGCTCAA TACCATGCGC AGTACGCTTG TCGGGATGAA TGCGAAAGGC
GAGCCCACTC CTTTGTTGGC GGAGAGCTGG GAGCCATCGA ATGATCTGAC CAAATGGTAT
TTTAACATCA GGAAAGGCGC GACCTTCCAC AGCGGCAAAT CGGTGACGGC TGACGATGTG
GTCGCATCGC TCAATCTGCA TCGCGGTGAC AAGACAACTT CGCCGGCAAA AGCTCTCCTT
GATCCAGTTA CAAATGTCGA GGCCGATGGT CCTAACCGGG TCGTCATCAC GCTCAACCGT
GCGAACATTG AGTTTGTTAG CCTTTTCAAA ACGGATTTCC TGGTCATACT GCCTTCTAAG
GATGGCGTAA TCGACCGTGC TAGCAAGGAT GGGACAGGTC CATATGCACT TGAAAGCTTT
GAACCTGGTC AGCACCTCAG TTTCAAGCGA AATCCAAACT ATTGGGATCT TGACAATTAC
GGCTTCTTCG ACTCAGCGGA AGTCGTTGTC ATTGCGGACC CAGCCGCCCG CATGAACGCC
CTGCGCTCGG GTCGGGTCGA TCTTGTTAAC TCGGTAGATC TAAAGACCGC CGCGATGCTG
AAGCGCGTTG CAGGCCTTAA GCTAGAAAAC ATTCCGAGCG GATTGTACTA CGGCATGCCG
ATGCTTGTCG ACGTTGCTCC GTTCAATGAT AATAATGTCC GTATGGCTCT GAAGTACGCG
ATCAAACGGC AGGAAATCGT TGACAAGGTC CTTCTTGGCC ACGGAACTGT GGGCAACGAC
CAGCCAATCT TCAAGAACGT CAAGTTTGCC GCGACGGACC TCCCGCAACG GGAATACGAT
CCGGACAAAG CCCGTCATTA CCTGAAGCAG GCGGGCTATG ATTCAATCGA TCTACCGCTG
AACGTCGCGG AAATTGGCTT CCCGGGCGCG ACCGCTGTTA GTCAGCTGTT CGCCGCTTCG
GCCAAGGCAG CTGGGATCAA CCTCAACGTT ACGAGAGAGC CCGACGACGG TTACTTCGAA
CGCGTATGGA TGAAGCAACC GTTTACAACC GCCTATTGGC ATCAGGCCGT GACGGCTGAC
TCCCGCTTTA CCGAGGCTTT CCTGCCTGGT GCTGCTTGGA ACGAAACTCA CTTCAACAAC
CCACGCTTCA ACGAGCTTGC CGTAAAGGCG CGCGAGACCG TGGACGAGAA CACCCGAGCC
GGTATGTATC ATGAAATGCA GCGCATCATA TACGACGAAG GCGGCCTCTT GAACCCTGTG
TTCGCCAACT ACGTCTGGGC AATGAAGGAT AACGTTCACC GCCCCGACGA TGTGACCACT
CTTGGAGACT TGGACTCGTT CCAATGCATT TCTCGCTGGT GGATGGCTTA A
 
Protein sequence
MTAAFGGAVK AAPKRGGNLR IGIAGGTSGD SLDPTSTPVD AGFLTLNTMR STLVGMNAKG 
EPTPLLAESW EPSNDLTKWY FNIRKGATFH SGKSVTADDV VASLNLHRGD KTTSPAKALL
DPVTNVEADG PNRVVITLNR ANIEFVSLFK TDFLVILPSK DGVIDRASKD GTGPYALESF
EPGQHLSFKR NPNYWDLDNY GFFDSAEVVV IADPAARMNA LRSGRVDLVN SVDLKTAAML
KRVAGLKLEN IPSGLYYGMP MLVDVAPFND NNVRMALKYA IKRQEIVDKV LLGHGTVGND
QPIFKNVKFA ATDLPQREYD PDKARHYLKQ AGYDSIDLPL NVAEIGFPGA TAVSQLFAAS
AKAAGINLNV TREPDDGYFE RVWMKQPFTT AYWHQAVTAD SRFTEAFLPG AAWNETHFNN
PRFNELAVKA RETVDENTRA GMYHEMQRII YDEGGLLNPV FANYVWAMKD NVHRPDDVTT
LGDLDSFQCI SRWWMA