Gene Rleg2_0390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0390 
Symbol 
ID6979105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp403328 
End bp404923 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content60% 
IMG OID643395103 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002279915 
Protein GI209547998 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.18086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.296586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TCTCTGTCAT GCTGGCAGCG ACGGCCCTGA TTTCGGTCAT GGCGACGTCG 
GCCTGGTCCA AGACCCTTGT TTATTGCTCC GAGGGTTCGC CTGAGGGTTT CGACCCGAGC
CTCTATACGG CTGGCACGAC CTTCGACGCC TCGTCGCGTA CGGTCTATAG CCGCCTGGTC
GAATTCAAGC ATGGCGGTAC CGAGATCGAA CCGGGCCTCG CCGACAGCTG GAGCGTTTCG
GCCGACGGCA CGGAATACAC CTTCAAGCTT CATCCCGGCG TCAAGTATCA GACCACCGAC
TTCTTCACGC CGACGCGCGA TTTCAACGCC GACGACGTCG TGTTCTCCTT CGAGCGCCAG
CTGAAGGCCG ACAATCCGTG GAACAAGTAT GTCGAGGGCG GTTCTTATGA ATACGCCGCC
GGCATGGGCT TCCCGGATCT GATCAAGTCG ATCGAAAAGG TCGACGACCT CACGGTCAAG
TTCACGCTCA ACCATCCCGA AGCGCCGTTC CTTGCCGATC TGGCGATGGA CTTCGCCTCG
ATCGTTTCCA AGGAATATGC CGACAAGCTC GCCGCCGACG GCAAGATGGC GCAGCTCAAC
CAGCAGCCCC TCGGCACCGG CCCCTTCACC TTCGTCGCCT ACCAGCCGGA TGCCGTCATC
CGCTACAAGG CCAACGAAAC CTATTTCAAG GGCAAGGAAA AGATTGACGA TCTGGTTTTC
GCCATCACCT CTGACGCCGC CGTCCGCGCG CAGAAGCTGA AGGCCGGCGA ATGCCACCTG
ATCCCCTATC CGAATGCGGC TGACGTTCCC GAGTTGAAGA AGGACGGCAA TCTGACGGTG
ATGGAACAGG CCGGCCTGAA TGTCGGCTTC CTCGCCTACA ACACGCAGAT GGCCCCGTTC
GACAAGCCGG AAGTTCGCCG TGCGCTCAAC ATGGCGATCA ACAAGCAGGC GATCATCGAC
GCCGTCTTCC AGGGCGCAGC GGCTGTTGCC AAGAACCCGA TCCCGCCGAC GATGTGGTCC
TATAACGACG CCGTTCAGGA CGACAAGTAC GATCCGGATG CCGCCAAGAA GGCTCTCGCC
GATGCCGGCG TCAAGGATCT CAGCATGAAG GTCTGGGCGA TGCCGGTGTC GCGTCCCTAC
ATGCTGAACG CGCGCCGCGC CGCCGAACTG ATCCAGGCCG ATTTCGCCAA GGCCGGCGTC
AAGGTCGAGA TCGTTACCCA TGAATGGGCC GAATATCTGA AGCTCTCCTC CGACGTGAAG
CGCGACGGCG CCGTCATCCT CGGCTGGACC GGCGACAACG GCGACCCGGA TAACTTCATG
GATACGCTGC TTGGCTGCGA TGCCGTCGGC GGCAACAACC GTGCTCAGTG GTGCAACAAG
GAATATGACG ACCTGATGAC CAAGGCCAAG CTGACCGCCG ATGTCGGCGA GCGCACCAAG
GCCTATGAGC AGGCACAGCT GATCTTCAAG AAGGAAGCCC CCTGGGCGAC CCTCGATCAC
TCGCTCGTCT TCGTTCCGAT GAGCAAGAAG GTCTCCGGCT TCTTCATGGA TCCGCTCGGC
ATTCACCGCT TCGACGGCGT CGACGTATCC GAATAA
 
Protein sequence
MKKISVMLAA TALISVMATS AWSKTLVYCS EGSPEGFDPS LYTAGTTFDA SSRTVYSRLV 
EFKHGGTEIE PGLADSWSVS ADGTEYTFKL HPGVKYQTTD FFTPTRDFNA DDVVFSFERQ
LKADNPWNKY VEGGSYEYAA GMGFPDLIKS IEKVDDLTVK FTLNHPEAPF LADLAMDFAS
IVSKEYADKL AADGKMAQLN QQPLGTGPFT FVAYQPDAVI RYKANETYFK GKEKIDDLVF
AITSDAAVRA QKLKAGECHL IPYPNAADVP ELKKDGNLTV MEQAGLNVGF LAYNTQMAPF
DKPEVRRALN MAINKQAIID AVFQGAAAVA KNPIPPTMWS YNDAVQDDKY DPDAAKKALA
DAGVKDLSMK VWAMPVSRPY MLNARRAAEL IQADFAKAGV KVEIVTHEWA EYLKLSSDVK
RDGAVILGWT GDNGDPDNFM DTLLGCDAVG GNNRAQWCNK EYDDLMTKAK LTADVGERTK
AYEQAQLIFK KEAPWATLDH SLVFVPMSKK VSGFFMDPLG IHRFDGVDVS E