Gene Rleg2_4502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4502 
Symbol 
ID6977596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp138023 
End bp139603 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content59% 
IMG OID643393680 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002278498 
Protein GI209546580 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.146925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGTG AAAAGGATCA ATCTTCGCCT CTGGGCATCG GCCGCCGTCA CTTCGTTGGC 
GGTGCCGTCG CGCTGGGTGC TTTGCCAGTC CTCACCTCCG GCCTCCTGAT GCCGCGCGAT
GCCCGCGCGC AGGAAGCGAA GCGCGGTGGT CATCTGAAAC TCGGTCTCAA GGGCGGCGCC
ACCAGTGATG CGCTCGACCC CGCGACCTAC AGCGCCTCCG TGTTGTTCGT GATTGGCCGT
CTCTGGGGCG ACACACTTGT CGAATCCGAC CCTAAGACCG GTACGCCCTT GCCGTCGCTG
GCGACTTCCT GGACGCCATC GGCGGACGCA TCCGTCTGGA CTTTCAAAAT CAGGAACGAC
GTGCAGTTTC ACGACGGCAG CAAGATGACC GTCGCGGACA TCGTCGCGAC ACTGAAGCGA
CACGCGGACA AGAATTCGCA GTCGGGCGCT CTGGGGCTCA TGGCCTCGAT CACCGGTATC
GAAGAAAAAG CGGGCGACCT TGTCCTCACA TTTTCGGAAG GCAATGCGGA CCTGCCTTTG
CTTTTGACCG ACTATCACCT AATCATCCAA CCGAAGGGCG GCCTCGACAA GCCTGCGGCG
GCCATCGGTA CAGGTCCTTA CATTCTGAAA AGCTTCGAAC CGGGCGTTCA CGCAACCTTC
GAGAAAAACC CGAAGGATTG GCGCTCCGAC CGCGGCTTTG TCGACAGTGT CGAGATCCTC
GTCATCAACG ACAACACCGC CCGCGTTGCC GCACTTGCCT CCGGCCAAGT CCATTTCGTC
AACAATGTTG ACCCCAAGAC AGTCCCGATG CTGCAGCGAG CACCGACCGT CGAAATCCTC
CGGAATGCAG GCAAGGGCTT CTATTGTTTC CTGATGCATT GCGACGCGGC TCCCTTCGAC
AACACCGATC TTCGGCTTGC GCTGAAATAT GCCATCGATC GTCAGGCGAT CCTCGACAAG
GTTCTGGGCG GCTACGGAGT CATCGGCAAC GACTATCCGG TCAATTCCAA CTACGCTCTC
GCCCCGACCG ATATCGAGCA GCGCCCTTAT GATCCAGACA AGGCCGCCTT CCACTTCAAG
AAGGCGGGTC TCGACCGCTC CATTCAGCTG CTCACGTCAG ACGCAGCCTT TCCGGGCGCT
GTGGATGCGG CGATCCTGTT CCAGCAAAGC GCGCGCAAGG CCGGCATCAC GATCGACGTC
AAGCGCGAAC CGGAAGACGG CTACTGGACC AATGTCTGGA ACAAGCAGCC CTTCTGCGCC
TCGTTCTGGG GCGGTCGTCC GACCCAGGAT TCACGCTATT CGACCTCTTA CCTGTCGACC
GCAGAATGGA ACGACACGCG TTTCAAGCGC CCCGACTTCG ACAAATTGGT TTTGCAAGCA
AGGTCAGAAC TCGATGAGGC CAAGCGCAAG GTGCTTTATC GGCAATTGGC CCTGATGGTG
CGAGACGATG GCGGTCTGAT CCTGCCCGTC TTCAACGACT ACATCATGGC CTCTTCGAAA
ATGCTGAAGG GATATGTCGA CGATATTGGC AACGATATGT CCAACGGCTA CATCGGCAGC
CGCGTGTGGC TTAATGCCTA A
 
Protein sequence
MTREKDQSSP LGIGRRHFVG GAVALGALPV LTSGLLMPRD ARAQEAKRGG HLKLGLKGGA 
TSDALDPATY SASVLFVIGR LWGDTLVESD PKTGTPLPSL ATSWTPSADA SVWTFKIRND
VQFHDGSKMT VADIVATLKR HADKNSQSGA LGLMASITGI EEKAGDLVLT FSEGNADLPL
LLTDYHLIIQ PKGGLDKPAA AIGTGPYILK SFEPGVHATF EKNPKDWRSD RGFVDSVEIL
VINDNTARVA ALASGQVHFV NNVDPKTVPM LQRAPTVEIL RNAGKGFYCF LMHCDAAPFD
NTDLRLALKY AIDRQAILDK VLGGYGVIGN DYPVNSNYAL APTDIEQRPY DPDKAAFHFK
KAGLDRSIQL LTSDAAFPGA VDAAILFQQS ARKAGITIDV KREPEDGYWT NVWNKQPFCA
SFWGGRPTQD SRYSTSYLST AEWNDTRFKR PDFDKLVLQA RSELDEAKRK VLYRQLALMV
RDDGGLILPV FNDYIMASSK MLKGYVDDIG NDMSNGYIGS RVWLNA