Gene Rleg_5097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5097 
Symbol 
ID8007689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp488783 
End bp492007 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content61% 
IMG OID644822011 
Productalpha amylase catalytic region 
Protein accessionYP_002973271 
Protein GI241113436 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.511822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0191149 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTCT CTCCAAACCA GGGTGTGTTG AGCACCCCAC CCCGGATCTA TTATGTCAAT 
CCACTCCTTC TTCAGGGTAT CGATGCATGG CGCGAGGTTT TCGACCATGC GGCGGATACC
GGTTTCGACC GCGTGCTGAC GGCACCTCTC TTCGACCGGG GCGGGGAGCG CAGCATCTTT
GCCTCCCAGG ATCTGAAGCG CCTTGATCCG CAACTGTCGC TCGGAAGCGC GGTCGAAAAT
GGGGTGGGAC GCCTGGCGGA GGCCGCCCGT AAGAGCGGCG TCGCCTTGAT GATGGACCTG
ATGCTCGACG GCAAGGCCCG GGATCCCAAG GTGGGCTTTC ACCCTGTCGA TCCGAGACGC
TCGCCATTGG ATCCGGCAGA GCCGATGACT GCGATGGAGG CGGAGTCTCA ATCCCGGCTG
TTGGAAGAAT GGACAGAACG GCTGCGGGGG CTGGCCGGTC TTGGCCTCGC CGGCTATCGG
GCCCTGGGGA TCGATCGCAT TGGGCCGGCG GTTTTCAAAT CGCTGATTTC GGCGGTGCGC
GAGAAGACGG ACGCGCAATT CCTTGCCTGG ACGCCCGGCA CTGATTTCGG CGTGAGGGAC
GCGGTCAAGA ATACAGGCTT CGATGGATGT TTTTCATCGA TGGCCTGGTG GGATTTTGAC
GAGAGATGGT TCATCGAAGA ACACCGGGTC CAGAAGCCGC TCGGCTGGCA GATCGCTTTT
CCGGAACCGC CCTTCGGCAG GCGTATCGCC CACGGAACCG AGAGCCGCGA AATCCTCGAA
CGGCGCGCCG TTAGGGCGCT ACGCCTTGCC GCCTCGCTCG GCGGCGGCCT GATGGTTCCG
ATGGGCTTCG AATATGGCGC AGCCACGCCG CTCGATCCGA CCCATGGGAA TGGAACGGGT
TTGCGGGGAC TGCGCCATGA TCTGGCTTTC GATATCTCCT CCGAAATTCG CCTTGCGAAC
ACCGAGATCG GCAAGGAACC CCATGCACTT GCCTCATCGC TTCGGCTCAT CCGCAACGCC
AACGGACCGG TCTCGGCGCT GCTGCAATCG GCGGCGGAAG ACCCTCGCAG CGCCGAAAAT
GTGCGGTTCA TCCTGCTCAA CAGGGACCTT CGCAAAAGTG CGCCCGCTCC GGTAACAGCA
CTGCGCGAAG CGGGCTCCGG TTTCCTTCCC GTCGCAGCCG ACGGCACGGT TCTGCGGCTG
CGGGCAGGGG AGACCCTTGT CGTCGAAGGC AAGGCGCCGG CGCCGATCAC ATCGCGGCCA
ATCCTCGACG TCGCGCAGGC GATAGCTTCA CCCCGGCTCG CCATCGAAAA TATCCTGCCG
CGGGTCGACG ACGGGCGCTT TCCTGTCAAA CGGGTGGTCG GCGACATCCT CACCGTCGAG
GCCGATATAT TCGCCGACGG CCACGACCCG ATCGCCGTGG TCCTGCTCTG GCGGCCGCTC
GATGCGGCGG ACTGGAACGA GACGGAGATG CAGCTGGTCG AAAACGACCG CTGGCGCGCC
GAATTCCTGC TGGAGCGCAT CGGACGCTAT GAGTTTGCCG TCGAGGCATG GAAGAACCCG
TTTGCGATCT TTCGCTACGA GTTGACAAAG AAGAACGATG CCAGGCTGGA CCTGAAGCTC
GAACTCCAGG AGGGATTGAA CCTCATCCGC TCGGCTGAGG TGCATGCCGG CGCAACGCTC
AATGCCGAAT TGAAGGCCCT TGGTGATAGT CTCGAAGGGG CGTCCGATAC CGAGCGCACG
GCGATACTGC TCGATGCCGG AACATCCGAA TTGATGAACA AGGCGGACAA CAGGCCGTTT
CGGCTTCGCT CGACGGCAAG CGCCGTCGAT GCCGAACGCA AGGAGGCCGC CTTTGCCAGC
TGGTACCAGA TCTTCCCGCG CTCGCAGAGC GGCGATCCGG ACCGGCACGG CACCTTCGAC
GACGTCATCC CGCGACTGCC CGCCATCCGC GACATGGGCT TCGACGTGCT TTATTTCCCA
CCGATTCATC CGATCGGCTC GACCAACCGG AAAGGTCGCA ACAACACCCT AAAAGCTGCG
CCGGGCGACC CCGGAAGTCC CTATGCCATC GGCTCCGAGG ACGGCGGCCA TGACGCCATC
CATCCCGAGC TCGGCGAGTT CGAGGATTTC CGCCGGCTGG TCGATGCGGC AGGCCGGCAT
GGCCTGGAGA TTGCTCTCGA CCTCGCGATC CAGGCGTCGC CGGATCACCC CTGGCTGAAG
GAGCATCCCG GCTGGTTCGA CTGGCGCCCC GACGGCACGA TCAAATATGC TGAAAACCCG
CCCAAGAAAT ACGAGGACAT CGTCAACGTC GATTTCTACA CGAAAGACGC GCTGCCTTCA
TTATGGGTGG AGCTCAGGGA TGTCGTCCAG CTTTGGGTGG ACCAGGGCGT CAAGCTGTTT
CGCGTCGACA ATCCGCACAC AAAGCCATTT CCGTTCTGGG AATGGCTGAT CGGCGATATC
AGGGGCCGTC ATCCCGATGT CGTCTTCCTG TCGGAAGCCT TCACCAAGCC GAAGGTGATG
TACCGGCTGG CAAAGATCGG CTTCTCCCAA TCCTATACCT ACTTCACCTG GCGCAATGCC
AAGTGGGAGC TCGAGCAATA TATGCGGGAG CTGACCGAGA CGGCGCCGAA GGAATTCTTC
CGACCGCATT TCTTCGTGAA CACGCATGAT ATCAATCCGG ATTTCCTGCA GAACGCGCCG
CGCCCGGCCT TTCTGATCCG CGCAGCACTT GCCGCTACCC TGTCGGGATT GTGGGGCGTT
TATAACGGTT TCGAACTTTG CGAGGGGCGT CCCGATGCCA AGCGCAAGGA GTATGCCGAC
AGCGAGAAAT ACGAAATCCG CGCCTGGGAC TACGATCGGC CGGGCAATAT CATCGCCGAA
ATCAGGACGC TCAATCGCAT CCGCAACGAA AACACCGCGC TGCATTCGCA TCTCGGGCTG
ACGCTGCTGA ATGCGCGAAA TGACAATATC CTGTTTTTCG AGAAGGCGAG CCGTGCCCGC
GACAATGTCC TGCTGATCGC CATCAGCCTC GATCCCCACA ATTTCCAGCA GAGCGACGTC
GAGCTGCCGC TCTGGCAGTG GTCGCTGGGC GACGGCGGCA CGCTGGATGT CGAAGATCTG
ATCGGCGGGC ATCGTTTCAA GTGGACCGGC AAATGGCAGA GCATCAGTCT CAATCCTGAG
GTCCTGCCCT ATGCGATCTG GCGTATTCGC TCAACGGAGG CATGA
 
Protein sequence
MSFSPNQGVL STPPRIYYVN PLLLQGIDAW REVFDHAADT GFDRVLTAPL FDRGGERSIF 
ASQDLKRLDP QLSLGSAVEN GVGRLAEAAR KSGVALMMDL MLDGKARDPK VGFHPVDPRR
SPLDPAEPMT AMEAESQSRL LEEWTERLRG LAGLGLAGYR ALGIDRIGPA VFKSLISAVR
EKTDAQFLAW TPGTDFGVRD AVKNTGFDGC FSSMAWWDFD ERWFIEEHRV QKPLGWQIAF
PEPPFGRRIA HGTESREILE RRAVRALRLA ASLGGGLMVP MGFEYGAATP LDPTHGNGTG
LRGLRHDLAF DISSEIRLAN TEIGKEPHAL ASSLRLIRNA NGPVSALLQS AAEDPRSAEN
VRFILLNRDL RKSAPAPVTA LREAGSGFLP VAADGTVLRL RAGETLVVEG KAPAPITSRP
ILDVAQAIAS PRLAIENILP RVDDGRFPVK RVVGDILTVE ADIFADGHDP IAVVLLWRPL
DAADWNETEM QLVENDRWRA EFLLERIGRY EFAVEAWKNP FAIFRYELTK KNDARLDLKL
ELQEGLNLIR SAEVHAGATL NAELKALGDS LEGASDTERT AILLDAGTSE LMNKADNRPF
RLRSTASAVD AERKEAAFAS WYQIFPRSQS GDPDRHGTFD DVIPRLPAIR DMGFDVLYFP
PIHPIGSTNR KGRNNTLKAA PGDPGSPYAI GSEDGGHDAI HPELGEFEDF RRLVDAAGRH
GLEIALDLAI QASPDHPWLK EHPGWFDWRP DGTIKYAENP PKKYEDIVNV DFYTKDALPS
LWVELRDVVQ LWVDQGVKLF RVDNPHTKPF PFWEWLIGDI RGRHPDVVFL SEAFTKPKVM
YRLAKIGFSQ SYTYFTWRNA KWELEQYMRE LTETAPKEFF RPHFFVNTHD INPDFLQNAP
RPAFLIRAAL AATLSGLWGV YNGFELCEGR PDAKRKEYAD SEKYEIRAWD YDRPGNIIAE
IRTLNRIRNE NTALHSHLGL TLLNARNDNI LFFEKASRAR DNVLLIAISL DPHNFQQSDV
ELPLWQWSLG DGGTLDVEDL IGGHRFKWTG KWQSISLNPE VLPYAIWRIR STEA