Gene Rleg_2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2115 
Symbol 
ID8013138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2105623 
End bp2107284 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content58% 
IMG OID644824701 
Productalpha amylase catalytic region 
Protein accessionYP_002975931 
Protein GI241204835 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.232775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0296214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAACG ACCTCTGGTA TAAGAACGCT GTCATTTACT GCCTGTCCGT CGAGACCTTC 
ATGGATGCGA ACGGTGACGG CGTCGGTGAT TTTCAGGGTC TGATGCGGCG CCTCGATTAT
CTCTCCGGCC TCGGCGTGAC GGCGATCTGG CTCATGCCGT TCCAGGCATC GCCCGGTCGC
GACGACGGTT ATGACGTCTC CGATTATTAC AATGTCGATC CGCGTTATGG TTCGCTCGGC
GATTTCGTCG AATTCACCCA CGGCGCCAAG CAGCGCGGTA TCCGGGTACT GATCGATCTG
GTGATCAATC ATACATCCAA AGACCATCCC TGGTTCCAGG ACGCCAGGAG CGATCCGCGT
TCGCGCTATC GCGACTGGTA CGTCTGGTCG GAGAGAAAAC CTGATAATGC CGATCAGGGC
ATGGTCTTCC CCGGCGTCCA GAAGACCACT TGGACCTATG ACGATAGAGC CAAGGCCTAT
TACTTTCACC GCTTCTACGA TCACCAGCCC GATCTCAATA CATCGAACCC CGAGGTGCAG
GCAGAAATCC TGAAGATCAT GGGCTTCTGG ATCCAGCTCG GCGTCTCCGG CTTCCGAATG
GATGCCGCCC CCTTCATCAT CGCAACGAAA GGCGCCGACG TTACCAAGCC TGTCGAACAG
TTCGATATGC TGAGGAAATT TCGCGAATTC CTGCAGTGGC GGCTGGGCGA TTCCATCATC
CTGGCAGAGG CCAACATCCT GCCGAAGGAC AATTTCGAAT ATTTCGGCGA TGACGGCGAC
CGCATGCAGA TGATGTTCAA TTTCCAGGTC AATCAAGCGC TATTTTATGC TTTCGCCAGC
GCCGATACCC GGCCGCTCAA GAAGGCCATG GAGGCCACCA AACCGCGCCC TGCGACCGCG
CAATGGGGCC TCTTTCTCCG CAACCATGAT GAACTGGATC TCGGCCGGCT GACGGAAAAA
CAACGCGCCG CGGTATTTGC CGCCTTCGGG CCTGAAAAGG ACATGCAGCT TTACGACAGG
GGCATTCGCC GTCGCCTCGC GCCCATGCTC GGCGGCGACC ACCGCAGGAT CGAAATGGCC
TACAGCCTGC TGTTTTCACT GCCTGGAACG CCCGTCATCC GATACGGCGA CGAAATCGGC
ATGGGCGACG ATCTAGGCCT GCCCGAGCGC AATTGCGCAC GCACGCCGAT GCAGTGGTCG
ACCGAACCGG AAGGCGGGTT CACCAAAAGT GAAAAGCCGA TCTCGCCCGT TATCAAGGAC
GGTCCCTACG GTTTCCAGCA TGTCAATGTT GCCGAACAGC GGCGCGATCC CAATTCTTTG
CTGAACTGGA CCGAGCGGAT GATCCGGATG CGCAAGGAAG CCCCTGAGAT CGGCTGGGGC
GACTTTTCTG TAATCGACAC CGGGGATGAC GGCGTGCTGG CGCTTCGTTA TGACTGGCGC
GGCAATTCCG TGTTGATCCT GCACAATCTG CATGCGCAGC CGGCGGAAGT GACCTTCGAT
CCCGAGATAG GCGAAGACGG GCGGCAATTA ATCGATATCG CCGATGGCGC GAGCAGCAAA
GCGGACGAAA AAGGATTTCA TACCGTCATG CTCGATGCCT ACGGCTATCG CTGGTATCGC
GTCGGCGGTC TCGATTATCT CCTCAGGCGG ACGGAGATTT AG
 
Protein sequence
MINDLWYKNA VIYCLSVETF MDANGDGVGD FQGLMRRLDY LSGLGVTAIW LMPFQASPGR 
DDGYDVSDYY NVDPRYGSLG DFVEFTHGAK QRGIRVLIDL VINHTSKDHP WFQDARSDPR
SRYRDWYVWS ERKPDNADQG MVFPGVQKTT WTYDDRAKAY YFHRFYDHQP DLNTSNPEVQ
AEILKIMGFW IQLGVSGFRM DAAPFIIATK GADVTKPVEQ FDMLRKFREF LQWRLGDSII
LAEANILPKD NFEYFGDDGD RMQMMFNFQV NQALFYAFAS ADTRPLKKAM EATKPRPATA
QWGLFLRNHD ELDLGRLTEK QRAAVFAAFG PEKDMQLYDR GIRRRLAPML GGDHRRIEMA
YSLLFSLPGT PVIRYGDEIG MGDDLGLPER NCARTPMQWS TEPEGGFTKS EKPISPVIKD
GPYGFQHVNV AEQRRDPNSL LNWTERMIRM RKEAPEIGWG DFSVIDTGDD GVLALRYDWR
GNSVLILHNL HAQPAEVTFD PEIGEDGRQL IDIADGASSK ADEKGFHTVM LDAYGYRWYR
VGGLDYLLRR TEI