Gene Rleg_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3103 
Symbol 
ID8014011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3102504 
End bp3104084 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content58% 
IMG OID644825670 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002976898 
Protein GI241205802 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.529212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.177978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCATT TCGCGAAAAA ATTTCTCGCC TCTGCAATGC TTGGCACATT GCTGGCGTTT 
TCGGCCCACG CGGCCACGCT CAACATTCAC AATGGTGGCG ACCCGCAGTC GCTCGATCCG
CAGAAGCTTT CCGGCGACTG GGAGAACCGC ATCGCCGGCG ACATTTTTGA AGGCCTCGTC
ACTGAGGACG CCAAGGATAA TCCGATCCCC GGCCAGGCTG AAAGCTGGAC AATTTCGCCT
GACGGCAAGG TCTACACCTT CAAGCTTCGC GACGGCATCA AGTGGTCCGA TGGCCAGCCG
GTAACGGCAG GAGACTTCGT CTTCGCCTTC CAGCGCCTCG TCGACCCTAA GAACGCCGCC
GACTACGCTT ATCTCCAGTT CACCATCAAG AACGCGGAAA AGATCAACAA GGGTGAGATC
ACCGATCTCA ACCAGCTCGG CGTCAAGGCG ATCGACGACA AGACGCTCGA AATCACCCTC
GAAAACTCCA CCCCTTATTT CCTCAATGCC CTGATGCACT ACACCGCCTA TCCGCTGCCG
AAGCATGTCG TCGAGGCGAA GGGGCAGGAT TGGGTCAAGA TCGGCAACAT CGTCACCAAC
GGCCCCTACA AGCCCGTCGA ATGGGTTCCG GGCTCGCATG TCACGACAGT CAAGAACGAT
CAGTGGTATG ACACCAAAGA CCTGAAGATC GACGGAGCGA AGTTCTTCGT GCTCGAAGAC
CAGGAAGCCG CGCTGAAACG CTACCGCGCC GGCGAATTCG ACATCCTCAC CGACTTCCCA
ACCGACCAAT ACGAGTGGAT GAAGAAAAAC CTGCCGGGCC AGGCGCATGT CGCCCCCTTC
TCTGGCCTCT ATTATTACGT CGTCAACTCG CAGAAACCGC CCTTCAGCGA CAAGCGCGTC
CGCCAGGCTC TCTCCATGGC GATCAACCGC GAGGTCATCG GCCCGCAGAT CCTCGGCACC
GGCGAACTGC CGGCCTATTC CTGGGTTCCG CCGGGCACGG CGAATTACGG CGAACCGGCC
TATGTTAGCT GGAAGGACCT GCCCTACAGC GAGAAGGTCG CCGAAGCCAA GAAGCTCCTG
ACCGAAGCCG GTTTCGGCCC CGACAAGCCG CTTCACGCCG TGCTGAGCTA CAACACCAAC
GACAACCACA AGCGCATCGC CGTCGCCATC GCATCCATGT GGAAGCCGCT TGGCGTCGAT
GTCGAACTCG TCAATGCCGA AACCAAAGTG CATTACGACC AGATGCAGCG TGGTCAAGTC
GAAATCGGCC GCGCCGGCTG GCTCGCCGAC TACAACGACC CTGATAATTT CCTGAACCTC
CTGGTGACAG GCGTGCAGAT GAACTACGGC CGCTGGTCGA ATCCCGAGTA CGACAAGATG
ATCAAGGAAG GCAACGCCGA GACGGATCTC ACCAAGCGTG CCGCGATCTT CAAGAAGGCC
GAACAGCTGG CGCTGGATGA ATCCGCCGCC CTGCCGATCT ACTACTATGT CTCGAAGAAC
GTCGTTTCGC CGAAGATCGA AGGCTTCGTC GACAACATCC AAGACATCCA CCGCACCCGC
TGGCTGTCGA TGAAAGAGTA A
 
Protein sequence
MNHFAKKFLA SAMLGTLLAF SAHAATLNIH NGGDPQSLDP QKLSGDWENR IAGDIFEGLV 
TEDAKDNPIP GQAESWTISP DGKVYTFKLR DGIKWSDGQP VTAGDFVFAF QRLVDPKNAA
DYAYLQFTIK NAEKINKGEI TDLNQLGVKA IDDKTLEITL ENSTPYFLNA LMHYTAYPLP
KHVVEAKGQD WVKIGNIVTN GPYKPVEWVP GSHVTTVKND QWYDTKDLKI DGAKFFVLED
QEAALKRYRA GEFDILTDFP TDQYEWMKKN LPGQAHVAPF SGLYYYVVNS QKPPFSDKRV
RQALSMAINR EVIGPQILGT GELPAYSWVP PGTANYGEPA YVSWKDLPYS EKVAEAKKLL
TEAGFGPDKP LHAVLSYNTN DNHKRIAVAI ASMWKPLGVD VELVNAETKV HYDQMQRGQV
EIGRAGWLAD YNDPDNFLNL LVTGVQMNYG RWSNPEYDKM IKEGNAETDL TKRAAIFKKA
EQLALDESAA LPIYYYVSKN VVSPKIEGFV DNIQDIHRTR WLSMKE