Gene Rleg2_6279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6279 
Symbol 
ID6983352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp229203 
End bp230741 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content57% 
IMG OID643399287 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002284043 
Protein GI209552127 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.334528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTCT CTCGTAGAAC TTTCTTGCAG GGAACGACCG CCGTTGCCTT GGCTTCGTCG 
TTCGGGGTTA AGGCGATCGC CGCTCCCAAG CGTGGGGGAC ATTTGCGTGT CGCAGTCGCG
ACCGGCTCGA CGACCGATAG CCTCGATCCA ACTAGCACTC CCGTAACGTG GGGGTTCATC
AATCTTGCCA CCTCGCGTAA CACGCTGGTC GGCACCGATC ATGCCGGCAC CCTCATTCCA
AAACTGGCGG AAAGCTGGGA GCCGTCGAGC GATCTCAAGA CCTGGGTGCT CAATCTCAGG
AAAGGAGTGA CCTTCCACAG CGGAAAGACG ATGACAGCAG ACGACGTTGT CAATTCGTTG
AACCTTCACA GGGGCGAAAA TACGGTCTCG CCCGCCAAAT CGCTCTTGAG TCCCGTCAGC
AATGTTAAAG CGGACGGTGC CAACAGGGTG GTGATATCGC TGGAAAGTCC GAATGTGAAC
TTTGTGAACC TTCTGAGGGA TGACTTCCTG GTTGTTGCGC CTTCCAAGGA CGGCGAGGTG
GATCGTACTT CGACGGACGG CACCGGACCC TACGTGCTGG AGAGCTGGGA AGCTGGGCGA
AGCCTCCGGT ATAAGCGCTA CGAGAATTTT TGGGACCTGA ACAATTACGG CTTCTTCGAC
TCCGCCGAGG TGGTGGTTAT CCAGGACAAC GCAGCCCGGA TGAACGCACT TCGTTCCGGA
CAGGTGGATC TCGTGAATTC AGTCGACCTC AAGACCGTCC CGATGCTGAA GCGCGTTCCG
AAAATACGGG TTGAGGATAC CCCGAGCGGG ATGTACTACG GCCTGCCCAT GCTTACCGAC
GTAGCGCCAT TCAACGACAA CAATGTTCGC CTAGCGCTTA AGTACGCGTT CAACCGGCAG
GAAGCCGTCA ACAAGGTCCT CCTCGGGCAC GGCACGGCGG GCAACGACCA TCCGATTTTC
GCGAATGACA AATTCAACGA CCCCACGATT CCGCAGCGCG AGTATGATCC AGACAAGGTT
CGGTTCTACC TCGACAAAGC CGGTCTCCAA TCGCTTGAAA TCCCTCTGAA CGTGGCAGAG
GCGGGTTTCC CAGGAGCCGT CGACACCGCT CAGCTTTACG CATCCTCGGC AGCAGCGGCA
GGGATCAAGA TCAACGTAAC ACGCGAACCC GACGACGACT ACTATGAGCG CGTATGGCTC
AAGAAGCCAT TCTGCGCAGC TTATTGGAAC CAAGCCATCA CCAACGACGC GCGGTTTACC
GAAGCCTTCC TGCCGGACGC TCCGTGGAAT GAAACGCACT ATAACAATCC ACGCGTCACG
GAGTTGGTCG TGAAGGCCAG GTCCACGCTG GACGAAGGGG CGCGCGCAAG CATCTATCAC
GAGTTGCAGC GTATCATCCA TGACGACGGC GGCCTTCTCA ACCCGATGTT CGTGAATTAT
GTCTGGGCAA TGAAAGACAA CGTGAAGAGG CCAGAGAAGG TGTCCACCTT GGGCGACCTC
GACGGCTACG AGTGCATCGC CCGTTGGTGG ATGGAGTAG
 
Protein sequence
MTFSRRTFLQ GTTAVALASS FGVKAIAAPK RGGHLRVAVA TGSTTDSLDP TSTPVTWGFI 
NLATSRNTLV GTDHAGTLIP KLAESWEPSS DLKTWVLNLR KGVTFHSGKT MTADDVVNSL
NLHRGENTVS PAKSLLSPVS NVKADGANRV VISLESPNVN FVNLLRDDFL VVAPSKDGEV
DRTSTDGTGP YVLESWEAGR SLRYKRYENF WDLNNYGFFD SAEVVVIQDN AARMNALRSG
QVDLVNSVDL KTVPMLKRVP KIRVEDTPSG MYYGLPMLTD VAPFNDNNVR LALKYAFNRQ
EAVNKVLLGH GTAGNDHPIF ANDKFNDPTI PQREYDPDKV RFYLDKAGLQ SLEIPLNVAE
AGFPGAVDTA QLYASSAAAA GIKINVTREP DDDYYERVWL KKPFCAAYWN QAITNDARFT
EAFLPDAPWN ETHYNNPRVT ELVVKARSTL DEGARASIYH ELQRIIHDDG GLLNPMFVNY
VWAMKDNVKR PEKVSTLGDL DGYECIARWW ME