Gene Rleg_5344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5344 
Symbol 
ID8007302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp752674 
End bp754194 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content58% 
IMG OID644822248 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002973508 
Protein GI241113673 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.297716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.516905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGC ATTTGTTAGC TGCCTGCTTT TCAACAACGG TGCTCGCGTT GAGCGGCGGT 
ACGGCACACG CTCAGGATGC CAAGAGCAAC GTCACTGTTG TGCTTGCCGA AACCGTCGAT
GTCGTCGAAC CCTGTATGGC AGCGCGCCAG GACGTCGGCC GGGTCATTTC TGAAAACGTC
AACGAGATGC TGGTGGAGTT CGACTACGTC AATGGCGGCC TCAAACCTCG CCTGGCGACG
GAATGGTCGA AGATCGATGA CGACACCTGG GAGTTCAAGC TGCGCCCGAA TGTCAAATGG
CACGATGGCA AGCCGTTCAC CGCCAAGGAT GTCCAATTCA CCATCGAGCG CAACAAGAAT
AAGAAGCTCA GCTGTGAGAC CGGCGGCAAA TATTTCGGCG GCACGGAGTT CAGCTTCGAA
ACGCCCGATG CCAACACGAT CCGCATTACA ACAAAACCGG CGCAGCCGAT TCTTCCGCTT
CTGATGACGG TGATGGCTGT GGAATCGGCC GAGGCGACAC CAGCCGACGA ATTCACCCGC
AAGCCGATTG GCACTGGCCC TTATACGTTC GACAAATGGG AAATCGGCCA GTCAATCGTG
CTGAAACGCA ATCCGGAATA TTGGGGGGAG AAACCCCAGG TGGAACAGGC GACATATCTG
TTCCGCTCAG ACAGCGCCGT TGCAGCCGCC ATGGTCGATG CTGGCGAAGC CGATATCGTT
CCGGCCGTAT CCGTACAGGA TGCCACCAAC AAGGAAACCG ATTTCGCCTA CCCGAATTCG
GAAACGACAT CGCTGCGCAT CGATACGCGC GCAGCACCCC TTAACGACCG GCGCATCCGC
GAAGCGATGA ACCTCGCCAT CGATCGTCAG GCGATGCTCG GAACGCTGTT CCCCGAACAG
GCAAAGATCG CGACACAGCT CGTTGTGCCG ACCACGATCG GTTACAATGC CGATATCCCC
GCTTGGCCCT ATGATCCCGA AAAAGCAAAG GAACTGGTCG CAGCAGCGAA AGCGGACGGC
GTCCCGGTCG ATCGTGAGAT CCGTATCATC GGTCGCAATG GACAATATCC AAACGCAACC
GAAGCGATGG AAGCGATGAT GGCGATGCTT CAGGAAGTCG GCTTGAACGT AAAGCTCGAC
ATGTATGACG TGTCCGTGTG GAACGGCTAC TTCGTTGCAC CCTTTGTCGC CGATTCCGGT
CCGACGCTGA CCCAGTCGCA GCACGACAAT GCGACCGGCG ACCCCGTCTT CACCGCATTC
GTGAAATACG CGACCGACGG TTCCCACTCC ATGGTTCGCG ATCCGGCCGT TGACGCGCTG
ATCGCCAAGG CGACGTCTGC CACCGGCGAC GAGCGCACAA AACTCTGGAA GGAGCTTTTC
GCCAAGGTGA ACACCGAAAT CATCGCCGAC ATCCCGATGT TTCACATGGT CGGTTTCACC
CGCGTCTCGC CGCGTCTCGA CTTCAAGCCG ACGATCGCGA CGAATTCCGA ACTGCAGCTG
TCGCAGATCC GCTTCAAGTA A
 
Protein sequence
MKLHLLAACF STTVLALSGG TAHAQDAKSN VTVVLAETVD VVEPCMAARQ DVGRVISENV 
NEMLVEFDYV NGGLKPRLAT EWSKIDDDTW EFKLRPNVKW HDGKPFTAKD VQFTIERNKN
KKLSCETGGK YFGGTEFSFE TPDANTIRIT TKPAQPILPL LMTVMAVESA EATPADEFTR
KPIGTGPYTF DKWEIGQSIV LKRNPEYWGE KPQVEQATYL FRSDSAVAAA MVDAGEADIV
PAVSVQDATN KETDFAYPNS ETTSLRIDTR AAPLNDRRIR EAMNLAIDRQ AMLGTLFPEQ
AKIATQLVVP TTIGYNADIP AWPYDPEKAK ELVAAAKADG VPVDREIRII GRNGQYPNAT
EAMEAMMAML QEVGLNVKLD MYDVSVWNGY FVAPFVADSG PTLTQSQHDN ATGDPVFTAF
VKYATDGSHS MVRDPAVDAL IAKATSATGD ERTKLWKELF AKVNTEIIAD IPMFHMVGFT
RVSPRLDFKP TIATNSELQL SQIRFK