Gene Rleg_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0422 
Symbol 
ID8011624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp438308 
End bp439903 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content60% 
IMG OID644823017 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002974271 
Protein GI241203175 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00166095 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00221721 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAGA TCTCTGTCCT GCTGGCAGCG ACGGCCTTGA TTTCCGTCAT GGCGACGTCG 
GCTTGGTCCA AGACCCTTGT TTATTGCTCC GAGGGCTCGC CGGAGGGCTT CGACCCGAGC
CTCTATACGG CAGGCACGAC CTTCGACGCG TCGTCGCGCA CGGTCTATAG CCGCCTCGTC
GAATTCAAGC ATGGCGGTAC CGAGATCGAA CCGGGCCTGG CCGACAGCTG GAGCGTTTCG
GCCGACGGCA CGGAATACAC CTTCAAGCTT CATCCTGGCG TCAAGTACCA GACCACCGAC
TTCTTCACGC CGACGCGCGA TTTCAACGCC GACGACGTCG TGTTCTCCTT CGAGCGCCAG
CTGAAATCCG ACAATCCGTG GAACAAGTAT GTCGAGGGCG GCTCTTACGA ATACGCCGCC
GGCATGGGCT TTCCCGAGCT GATCAAGTCT GTCGAGAAGG TCGATGACCT CACCGTCAAG
TTCACGCTCA ACCACCCCGA AGCGCCGTTC CTCGCCGACC TCGCCATGGA CTTCGCCTCG
ATCGTCTCCA AGGAATATGC CGACAAGCTT GCCGCCGACG GCAAGATGGC GCAGCTCAAC
CAGCAGCCGC TCGGCACCGG CCCGTACACC TTCGTCGCCT ACCAGCCGGA TGCCGTCATC
CGCTACAAGG CGAACGAAAC CTATTTCAAG GGCAAGGAAA AGATCGACGA TCTGGTTTTC
GCCATTACCT CTGACGCCGC CGTGCGCGCC CAGAAGCTGA AGGCCGGCGA ATGCCACCTG
ATCCCCTATC CGAATGCAGC CGACGTACCC GAACTGAAGA AGGATGAAAA TCTGACCGTT
CTGGAACAGG CCGGCCTCAA TGTCGGCTTC CTCGCTTACA ACACCCAGAT GGCCCCGTTC
GACAAGCCGG AAGTTCGCCG TGCGCTGAAC ATGGCGATCA ACAAGCAGGC GATCATCGAC
GCCGTCTTCC AGGGTGCCGC GGCCGTTGCC AAGAACCCGA TCCCGCCGAC GATGTGGTCC
TATAACGACG CCGTTCAGGA CGACAAGTAC GATCCGGACG CTGCCAAGAA GGCTCTTGCC
GATGCTGGCG TCAAGGATCT CAGCATGAAG ATCTGGGCAA TGCCGGTGTC GCGTCCCTAC
ATGCTGAACG CGCGCCGCGC CGCCGAACTG ATGCAGGCGG ATTTCGCCAA GATCGGTGTC
AAGGTCGAGA TCGTCACCCA TGAATGGGCC GAATATCTGA AGCTCTCCTC CGACGTGAAG
CGCGACGGCG CCGTCATCCT CGGCTGGACC GGCGACAACG GCGACCCGGA CAACTTCATG
GATACGCTGC TTGGCTGCGA TGCCGTCGGC GGCAACAACC GTGCTCAGTG GTGCAACAAG
GAATATGACG ACCTGATGAC CAAGGCCAAG CTGACGGCCG ATGTCGGCGA GCGCACCAAG
GCCTATGAGC AGGCGCAGCT GATCTTCAAG AAGGAAGCTC CCTGGGCAAC CATCGACCAT
TCGCTCGTCT TCGTTCCGAT GAGCAAGAAG GTCTCGGGCT TCCAGATGGA CCCGCTCGGC
ATTCACCGTT TCGACGGCGT CGACGTATCC GAATAA
 
Protein sequence
MKKISVLLAA TALISVMATS AWSKTLVYCS EGSPEGFDPS LYTAGTTFDA SSRTVYSRLV 
EFKHGGTEIE PGLADSWSVS ADGTEYTFKL HPGVKYQTTD FFTPTRDFNA DDVVFSFERQ
LKSDNPWNKY VEGGSYEYAA GMGFPELIKS VEKVDDLTVK FTLNHPEAPF LADLAMDFAS
IVSKEYADKL AADGKMAQLN QQPLGTGPYT FVAYQPDAVI RYKANETYFK GKEKIDDLVF
AITSDAAVRA QKLKAGECHL IPYPNAADVP ELKKDENLTV LEQAGLNVGF LAYNTQMAPF
DKPEVRRALN MAINKQAIID AVFQGAAAVA KNPIPPTMWS YNDAVQDDKY DPDAAKKALA
DAGVKDLSMK IWAMPVSRPY MLNARRAAEL MQADFAKIGV KVEIVTHEWA EYLKLSSDVK
RDGAVILGWT GDNGDPDNFM DTLLGCDAVG GNNRAQWCNK EYDDLMTKAK LTADVGERTK
AYEQAQLIFK KEAPWATIDH SLVFVPMSKK VSGFQMDPLG IHRFDGVDVS E