Gene Rleg_4446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4446 
Symbol 
ID8015212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4579098 
End bp4580930 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content57% 
IMG OID644827021 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002978223 
Protein GI241207127 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.215863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCTC TGTGGTCGAA GATCGGTTTG TTTCTATCGC TTGCGGGTGC TCTGGCGCCA 
ATGTCGGCAA CGGGTCAGGA CCAGCCCTTT CAGATCGGAA GCTCGGTCAT CAGCGAGATG
AAGTACAAGC CGGGCTTTGC GCATTTCGAC TACGTCAATC CCGATGCACC GAAAGGCGGA
GATCTGCGCC TCTCCGCAAG CGGCGCCTTC GACACCTTCA ACCCGCTGCT CGCCAAAGGC
CAGGCGGCAG TGGGCCTGAC GCTCGTTTAC GACACGTTGA TGAAGCCCGC CGACGACGAG
CTGCTCGTCT CCTACGGTCT GCTTGCCGAG GGATTGTCTT TTCCCGCTGA CGTCTCAAGC
GCGACCTTCC GCCTGCGCAA GGAAGCGAAA TGGGCGGATG GTCAGCCGGT CACGCCGGAA
GACGTCATCT TCAGTCTGGA CAAGACCAAG GAATTAAATC CCCTCGCCTC GAACTATTAC
CACCACGTTG TCAAAGCGGA AAAGACCGGC GAGCGTGACG TCACCTTCAC CTTCGATGAG
AAGAACAACC GCGAACTGCC GAATATTCTC GGCCAGTTGA TGGTCGTGCC GAAGCATTGG
TGGGAAGCGC CCGGACCGGA TGGCAAGCCG CGCGACATTT CTAAAACGAC GCTGGAGCCC
GTGATGGGTT CGGGGCCGTA CAAGATCGCT TCCTTCTCGC CTGGCGCAAC GATCCGTTAT
GAATTGCGTG ACGACTATTG GGGCAAGGAC CTCAATGTGA ATGTCGGCCA GAACAATTTC
CGCAACGTCA ACTACACCTA TTTCGGTGAT CGGGATGTCG AGTTCGAGGC CTTTCGCGCT
GGCAACAGCG ACTACTGGCA GGAAACCACG GCTGCCCGCT GGGCGACGGG ATATGATTTT
CCCGCGGTGA AGGAAGGGCG TGTCAAAAAA GAGGAGGTTG CAAACCCCCT GCGCGCCACC
GGCATCATGC AAGCTCTCGT GCCCAACATG CGACGTGACC TCTTCAAGGA CATCAGGGTC
CGCGAGGCGC TGAACTACGG TCTGGATTTT GAGGAACTGA ACCGCACCGT TGCCTTTAAT
AGTTACAAAC GCATCGACAG CTATTTCTGG AATACCGAAC TCGCCTCCTC CGGCCTGCCG
CAGGGTAAAG AACTGGAAAT TCTGCAGGGC ATGAAGGATA AGGTTCCGCC CGAAGTCTTC
ACCACGCCCT ATACCAATCC CGTCGGCGGC GATCCGCAAA AGAGCCGCGA CAACCTCCGC
AAGGCGATTG CGCTTTTCAA AGAAGCCGGC TGGGAGCTCA AGGGCAATCG CATGGTCAAT
ACCAAGACTG GCCAGCCGAT GAGTTTCGAG ATCCTGTTGT CGAGCCCCAT GCTGGAGCGC
TGGGCGGTGC CCTATGCCAA CAATCTCAGG AAAATCGGCA TCGATGCGCG GATCCGGACA
GTCGATGCGT CGCAATCTGT CAATCGTGAA CGCAGCTTCG ACTACGATAT GATCTGGAAT
GTCTGGGCGG AGACCATGAA TCCGGGCAAC GAACAAGCCG ACTATTGGGG ATCCGGTTCG
GTCAATCAGC AGGGTTCCCG CAATTATGCC GGCATTGCCA ACCAAGCCGT TGATGAGCTC
ATTCGCATGA TTATCTTCGC GCCGAACCGC GGCGAGCAGA TCGCAGCAAT CAAGGCCATG
GATCGGGTCT TGCTTGCAAA TCACTACGTC ATCCCGCTGT TCTACCGCGA TACCTATAAC
ATCGCCTATT GGAACACGGT CACGCATCCG GCCGAGTTTC CGGCCTACAG CCTTGGCTTC
CCCGATGCCT GGTGGTCGAC CTCGGCAAAA TGA
 
Protein sequence
MAALWSKIGL FLSLAGALAP MSATGQDQPF QIGSSVISEM KYKPGFAHFD YVNPDAPKGG 
DLRLSASGAF DTFNPLLAKG QAAVGLTLVY DTLMKPADDE LLVSYGLLAE GLSFPADVSS
ATFRLRKEAK WADGQPVTPE DVIFSLDKTK ELNPLASNYY HHVVKAEKTG ERDVTFTFDE
KNNRELPNIL GQLMVVPKHW WEAPGPDGKP RDISKTTLEP VMGSGPYKIA SFSPGATIRY
ELRDDYWGKD LNVNVGQNNF RNVNYTYFGD RDVEFEAFRA GNSDYWQETT AARWATGYDF
PAVKEGRVKK EEVANPLRAT GIMQALVPNM RRDLFKDIRV REALNYGLDF EELNRTVAFN
SYKRIDSYFW NTELASSGLP QGKELEILQG MKDKVPPEVF TTPYTNPVGG DPQKSRDNLR
KAIALFKEAG WELKGNRMVN TKTGQPMSFE ILLSSPMLER WAVPYANNLR KIGIDARIRT
VDASQSVNRE RSFDYDMIWN VWAETMNPGN EQADYWGSGS VNQQGSRNYA GIANQAVDEL
IRMIIFAPNR GEQIAAIKAM DRVLLANHYV IPLFYRDTYN IAYWNTVTHP AEFPAYSLGF
PDAWWSTSAK