Gene Rleg2_4201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4201 
Symbol 
ID6982974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4375382 
End bp4376908 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content62% 
IMG OID643398932 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002283689 
Protein GI209551772 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGG TTTCCTTCGT ACCGGCGCGC TTCGCCGGGC AACTTTCCCT CAGGGCGGCG 
CTTGCTGCGG GACTTATGAT GGCGGCGATG GCGCCGGCCG AGGCAGCGAA GACCACTTTC
ACTCTCGGCA TGAGCGTCGA GCCGACCGGC CTCGATCCGA CGATCGCGGC ACCGGTGGCG
ATCGGCCAGG TGACCTGGCA GAACGTGTTC GAGGGACTGG TGACGATCGA CCAGGCCGGC
AAGATCCAGC CGCAGCTGGC AAAAAGCTGG GAGATTTCGC CTGACGGCCT GACCTATACG
TTCAAGCTGC AGACCGGCGT CACATTCCAT GACGGCGAGG TCTTCGATGC CGCCAGCGCC
AAGTTTACGC TCGACCGCGC CCGCGGCGCG GATTCGGTCA ATCCGCAGAA ACGCTTCTTC
GCCTCGATCG CTTCGATCGA TACGCCCGAT GCCGAAACGC TGGTGCTGCA TCTTTCGGCG
CCGACCGGCA GCCTGATCTA CTGGCTCGGC TGGCCGGCCT CGGTGATGGT CGCGCCCAAG
ACCGCGGCCG ACGACAAGGC GACGCCGATC GGCACCGGCC CCTTCAAATT TGTCGGCTGG
GCGAAGGGCG ACAAGGTCGA ACTGGAAAAG AATGCCGATT ACTGGAACAA GGCTGCGGCC
GCCAAGCTCG ACAAGGTGAC CTTCCGCTTT ATCGCCGATC CGCAGGCGCA GGCGGCTGCG
CTGAAATCCG GCGATCTCGA TGCCCTTCCG GAATTTGCCG CACCGGAGCT GATGAGTTCT
TTCGAGGGCG ATGCAAGGCT TGTCACCAGG ATCGGCAATA CCGAGCTCAA GGTCGTCGCC
GGCATGAACA ATGCCAGGAA GCCGTTCGAC GACAAACGCG TGCGCCAGGC GCTGATGATG
GCGATCGACC GCAAGACGGT GATCGACGGC GCATGGTCCG GCCTCGGCAC GCCGATCGGC
AGCCATTACA CGCCGAACGA TCCCGGCTAT CAGGATATGA CGGGCGTGCT GCCTTACGAT
GTCGAGAAGG CGAAGGCGCT GCTGGCAGAA GCCGGCTATC CCAACGGCTT CACCTTCACG
ATCAAATCGC CGCAGATGGC CTATGCGCCG CGCAGCGCTG AGGTAATGCA GGCGATGTTT
GCCGAGATCG GCGTGACGAT GAATATCGAG CCGACCGAGT TTCCGGCAAA ATGGGTCCAG
GACATCATGA AGGACCGCAA CTTCGACATG ACGATCGTCG CCCATGCCGA GCCGCTCGAC
ATCGACATCT ACGGGCGCGA TCCCTATTAT TTCAACTATA AGAACCCGGT GTTTACCGCG
CTGATGAAGA AGGTCCAGGA GACCGCCGAT CCCGCCACGC AGAACGCGAT CTACGGCGAG
ACGCAGAAGA TCCTCGCCGA GGACGTGCCG GCGCTCTACC TCTTCGTCAT GCCGAAACTC
GGCGTCTGGG ACGGAAAGCT GAAGGGGTTG TGGGAGAACG AGCCGATGCC GTCCAATGTG
CTGTCCGGGG TTTCGTGGGA GGAGTGA
 
Protein sequence
MIKVSFVPAR FAGQLSLRAA LAAGLMMAAM APAEAAKTTF TLGMSVEPTG LDPTIAAPVA 
IGQVTWQNVF EGLVTIDQAG KIQPQLAKSW EISPDGLTYT FKLQTGVTFH DGEVFDAASA
KFTLDRARGA DSVNPQKRFF ASIASIDTPD AETLVLHLSA PTGSLIYWLG WPASVMVAPK
TAADDKATPI GTGPFKFVGW AKGDKVELEK NADYWNKAAA AKLDKVTFRF IADPQAQAAA
LKSGDLDALP EFAAPELMSS FEGDARLVTR IGNTELKVVA GMNNARKPFD DKRVRQALMM
AIDRKTVIDG AWSGLGTPIG SHYTPNDPGY QDMTGVLPYD VEKAKALLAE AGYPNGFTFT
IKSPQMAYAP RSAEVMQAMF AEIGVTMNIE PTEFPAKWVQ DIMKDRNFDM TIVAHAEPLD
IDIYGRDPYY FNYKNPVFTA LMKKVQETAD PATQNAIYGE TQKILAEDVP ALYLFVMPKL
GVWDGKLKGL WENEPMPSNV LSGVSWEE