Gene Rleg2_3852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3852 
Symbol 
ID6982615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3993824 
End bp3995467 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content58% 
IMG OID643398574 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002283340 
Protein GI209551423 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.479782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATAA CCAAGCTTAG CCGCAATTTC CGCATGCTTT CCACGGGGGC TGCTCTTTCG 
CTCCTGATGA TGACCGCACC CTCCGCCTTC GCCGAGACGC CCAAGGATAC GCTGGTCGAG
GGCTTTGCCA TCGACGACAT CATTACGATG GATCCGGGCG AGGCGTTCGA GCTTTCGACC
GCCGAAATCA CCAGCAATAG CTACAGCCTG CTTGTCCGTC TCGACATGGA CGACACGTCC
AAGGTCAAGG GCGATCTGGC CGACAGCTGG AGCGTTTCCG ACGACGGTCT GACCTATACG
TTCAAGCTGA AATCCGGCAT GAAATTCGCC TCCGGCAACC CGATCACCGC CGAAGATGTT
GCTTGGTCGT TCGAGCGCGC CGTCAAGCTC GACAAGAGCC CGGCCTTCAT CCTCACTCAG
TTCGGCCTGA CCGGCGACAA CGTCAGCGAA AAGGCCAAGG CGGCCGATGC CGGCACTTTC
GTCTTCACCG TCGACAAGGC CTATGCGCCG AGCTTCGTTC TCAACTGCCT GACGGCGACG
GTTGCCTCCG TCGTCGACAA GAAGCTGGTG ATGGACCATG TGAAGGCGGT AACACCAGAT
GCCGAGCACA AATACGACAA TGATTTCGGC AATGAATGGC TGAAGACCGG CTATGCCGGC
TCCGGAGCCT TCAAGCTGCG CGAATGGCGC GCCAATGAAG TGGTCGTTCT CGAGCGCAAC
GACAATTATT ACGGCGACAA GGCAAAGCTC AACCGCGTCA TCTACCGCTA CATGAAGGAA
AGCTCGGCCC AGCGGCTGGC GCTCGAAGCC GGCGATATCG ATATCGCCCG CAACCTCGAG
CCTGGCGACA TCGACGCCGT TTCGAAAAAT GCCGATCTCG CGACGACGAG TGCGCCGAAG
GGCACGATCT ATTATGTCAG CCTCAACAAC AAGAACGAGA ACCTGAAGAA GCCCGAGGTG
CAGGAAGCCT TCAAATATCT GGTCGACTAT GATGCGATCG GCGCGACCTT GATCAAGGGT
ATCGGCGAAA TCCACCAGAC CTTCCTGCCG AAGGGCCAGC TCGGCGCGCT CGACGAAAAT
CCCTACAAGC TCGATGTCGC CAAGGCCAAG GAACTGCTGG CCAAGGCCGG CGTGCCCGAC
GGTTTCTCGA TCACCATGGA CGTGCGCAAC AGCCAGCCGG TGACCGGTAT CGCCGAATCC
ATGCAGCAGA CGCTGGCGCA GGCCGGCGTG AAGATGGAAA TCATCCCGGG TGACGGCAAG
CAGACGCTGA CCAAATACCG CGCCCGCACA CATGATATGT ATATCGGCCA GTGGGGTTCG
GACTATTTCG ATCCGAATTC CAACGCCGAT ACCTTTACCG GCAATCCCGA CAATTCCGAT
GCCGGCACGG TCAAGACGCT CGCATGGCGC AACACCTGGG AGGCGCCGGA GCTCGACAAG
GAAGCCAAGG CAGCTCTTCT CGAACGTGAT GCCGCCAAAC GCGCCGCCAT ATATCAGGAC
ATCCAGAAGA AATACCTGGC AAACAGCCCC TTCGTCTTTA TCTTCCAGCA GACCGAAGTG
GCCGGTTACC GCAAGAACCT CAAGGACTTC AAGTTGGGTC CGAGCTTCGA TACCAATTTC
GTCGGTCCGA TCGCCAAGGA ATAG
 
Protein sequence
MMITKLSRNF RMLSTGAALS LLMMTAPSAF AETPKDTLVE GFAIDDIITM DPGEAFELST 
AEITSNSYSL LVRLDMDDTS KVKGDLADSW SVSDDGLTYT FKLKSGMKFA SGNPITAEDV
AWSFERAVKL DKSPAFILTQ FGLTGDNVSE KAKAADAGTF VFTVDKAYAP SFVLNCLTAT
VASVVDKKLV MDHVKAVTPD AEHKYDNDFG NEWLKTGYAG SGAFKLREWR ANEVVVLERN
DNYYGDKAKL NRVIYRYMKE SSAQRLALEA GDIDIARNLE PGDIDAVSKN ADLATTSAPK
GTIYYVSLNN KNENLKKPEV QEAFKYLVDY DAIGATLIKG IGEIHQTFLP KGQLGALDEN
PYKLDVAKAK ELLAKAGVPD GFSITMDVRN SQPVTGIAES MQQTLAQAGV KMEIIPGDGK
QTLTKYRART HDMYIGQWGS DYFDPNSNAD TFTGNPDNSD AGTVKTLAWR NTWEAPELDK
EAKAALLERD AAKRAAIYQD IQKKYLANSP FVFIFQQTEV AGYRKNLKDF KLGPSFDTNF
VGPIAKE