Gene Rleg2_5411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5411 
Symbol 
ID6978505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1055648 
End bp1057168 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content58% 
IMG OID643394513 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002279331 
Protein GI209547413 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.253312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00154474 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAACTGC ATTTGGTAGC TGCCTGCTTT TCAACCGCGA CAATCGGGCT GAGTATCGGC 
TCGGCTCACG CTGAAGATGC CAAGAGCAAC GTCACTGTTG TGCTTGCCGA AACCGTCGAT
GTCGTCGAGC CCTGCATGGC AGCGCGCCAG GATGTCGGCC GGGTCATTTC CGAAAACGTC
AACGAGATGC TGGTGGAATT CGATTACGTC AATGGCGGCC TCAAACCCCG CCTGGCGACG
GAATGGTCGA AGATCGACGA CGACACCTGG GAGTTCAAGC TGCGCCCGAA TGTCAAATGG
CACGATGGCA AACCGTTCAC CGCCAAAGAC GTTCAGTTCA CGATCGAGCG CAACAAGAAC
AAGAAGCTCA GCTGCGAGAC CGGCGGCAAA TATTTCGGCG GCACGGAATT CAGCTTCGAG
ACGCCTGATG CCAACACCAT CCGCATTACG ACAAAACCGG CGCAGCCGAT TCTTCCGTTG
CTGATGACGG TGATGGCCGT CGAATCGGCC GAAGCGACAC CTGCAGACGA ATTTACCCGC
AAGCCGATCG GCACCGGCCC CTATACATTC GACAAATGGG AGATCGGCCA GTCGATCGAG
CTGAAGCGTA ATCCGGACTA TTGGGGGGAC AAGCCGCAGG TGGAGCAGGC GACCTATCTG
TTCCGCTCGG ATAGTGCTGT CGCGGCGGCG ATGGTCGATG CCGGCGAAGC CGATATCGTT
CCGGCCGTGT CGGTGCAGGA TGCCACCAAC AAGGAAACCG ATTTCGCCTA TCCGAATTCG
GAGACGACGT CGCTGCGCAT CGACACCCGC GCAGCACCGC TCAACGATCG GCGCATACGC
GAAGCGATGA ACCTCGCCAT CGATCGTCAG GCGATGCTCG GAACGCTGTT TCCCGAACAG
GCAAAGATCG CCACGCAACT CGTCGTACCC ACCACGATCG GCTACAATGC CGATATCCCC
GCCTGGCCCT ATGATCCCGA AAAAGCAAAG GAACTGGTCA AAGCGGCAAA AGCCGACGGC
GTGCCGGTCG ATCAGCAGAT CCGCATCATC GGCCGTAACG GGCAATATCC CAACGCCACC
GAAGCGATGG AAGCGATGAT GGCCATGCTT CAGGACGTCG GCTTGAACGT CAAACTCGAC
ATGTACGATG TTTCCGTGTG GAACGGCTAT TTCGTTGCAC CCTTCGTTGC CGATTCCGGT
CCGACATTGA CCCAGTCGCA GCACGACAAT GCCACCGGCG ATCCCGTCTT CACCGCATTC
GTGAAGTACG CCACCGACGG CTCCCATTCC ATGGTTCGGG ATCCCGCGGT CGACGCCCTT
ATCGCCAAGG CGACCTCAGC CACCGGCGAC GAGCGCAAGA AACTCTGGAA GGAGCTTTTC
GCCAAGGTGA ACGCCGAGAT CATCGCCGAT ATTCCGATGT TCCATATGGT CGGTTTCACC
CGCGTTTCGC CGCGTCTTGA CTTCAAGCCG ACGATCGCGA CGAATTCCGA GCTGCAGCTG
TCGCAGATCC GCTTCAAGTA A
 
Protein sequence
MKLHLVAACF STATIGLSIG SAHAEDAKSN VTVVLAETVD VVEPCMAARQ DVGRVISENV 
NEMLVEFDYV NGGLKPRLAT EWSKIDDDTW EFKLRPNVKW HDGKPFTAKD VQFTIERNKN
KKLSCETGGK YFGGTEFSFE TPDANTIRIT TKPAQPILPL LMTVMAVESA EATPADEFTR
KPIGTGPYTF DKWEIGQSIE LKRNPDYWGD KPQVEQATYL FRSDSAVAAA MVDAGEADIV
PAVSVQDATN KETDFAYPNS ETTSLRIDTR AAPLNDRRIR EAMNLAIDRQ AMLGTLFPEQ
AKIATQLVVP TTIGYNADIP AWPYDPEKAK ELVKAAKADG VPVDQQIRII GRNGQYPNAT
EAMEAMMAML QDVGLNVKLD MYDVSVWNGY FVAPFVADSG PTLTQSQHDN ATGDPVFTAF
VKYATDGSHS MVRDPAVDAL IAKATSATGD ERKKLWKELF AKVNAEIIAD IPMFHMVGFT
RVSPRLDFKP TIATNSELQL SQIRFK