Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0507 |
Symbol | |
ID | 3834205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 599246 |
End bp | 602065 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637824591 |
Product | Alpha amylase, catalytic region |
Protein accession | YP_425598 |
Protein GI | 83591846 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.213544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCCG ATCTCTATGC GCCGACCGCC ACCTATCGGC TGCAGTTCCA TGCCGGGTTC ACCTTCACCG ATGCCGCCGC CCAGGTTGGC TATCTGCGCG ATCTGGGGGT CAGCCATCTT TACGCCTCGC CCATTCTCAA GGCGCGGCCG GGATCGACCC ATGGCTATGA CATCATCGAC CATGGCGCGC TCAATCCCGA ACTGGGCGGC GAGCGCGGCT TCGCCCAGTT ATCCGAGGCG CTGGCCGGCG CCGGCCTGGG GTTGATCATC GACATCGTGC CCAATCACAT GGGCATTGGT GCGGCTGATA ACGGCTGGTG GCTGGACGTG CTGGAATGGG GGCGCGGCGG ACGCTACGCC GGCTATTTCG ATATCGACTG GTTTCCCGCC ACCCCGGGCT TGCGCGAAAA GGTGGTGCTG CCGGTGCTGG GCGATCTCTA TGGCCGGGTG CTCGATGCCG GCGATCTGGT GGCGCGCTTC GATGACGCCG ACGGCTCGTT CAGCATCTGG TACCACGAGC ATCGCTTTCC GGTCTGCCCG GCGACCTATG CCACTATTCT GGATCTGTGT CTGAAAGAGG TCGCCCTGCC CGAGGCGGCG GCCCTGATGA CCGAGGCCCG GCGGCTGCGC GGCACCCCGC GCTCCGATAT TCGCCGCAAA GCCCAGCGCA CCCGGGGCGA GACCTTCAAG CGCAGCCTGC GCGAGGCGGC CGCCACCCCG GCCCTGGCCG CAGCCCTGGC CTCGGTGACC ACGCTGTTCG GACCGGCCGC CACTGACGGC AAGGGGCTGA CCCGTCTGCA CGCCCTGCTC GAAAACCAGC ATTACCGGCC CTCGTTCTGG CGGATCGCCG GCCATGAGAT CAATTACCGC CGCTTCTTTC AGATCAATGA TCTGGCCGGC TTGCGGGTCG AGGAGAAGGA GGTCTTCGAC GCCTCGCACG CCCTGATCGG CGACCTTGTC GGCAGCGGCC GGGTGCATGG GGTGCGGGTT GATCATATCG ACGGCCTGCT TGACCCCCAT CAGTATCTTG ATCGCCTTCA AGGCCTTGTC GCGCCCTTTG CCGAGACCCT GGGGTTCCGG CCGGGGGCCT TTCCGGTCTA TGTGGAAAAG ATCCTGGAGC ACGGCGAAGC CCTGCGCCGC GACTGGCCGA CCGCCGGCAC CACCGGCTAT GACGCCCTGA ACGAAATCTC GACGCTGTTC GTTGCCGCCC CCGGGCTGGA AACCCTGCGC GCCCTGTGGC GGCGCGAGGT CGGCGACGAG GCGGCCGATC CGGTGCGGGT GGCGGTGAGA GCCAAGCGTC AGGTGATGGA CGAGGAACTG GCCTCCGAGC TCGAGGTGCT GACCGATCAG TGCACCCGTC TGCTCAAGCG CGATCCGCAG ACCCGCGATT TCAGCCGCGC CGGCATCAAT CGGGCGCTGC GCGAGATCGT CGCCCAGTTC CCGGTCTATC GCAGCTATAT CGGGCCCAAG GGGGCGACGC CCGAGGACCG CGCGGTGATC GCCACCGCCA TCCGCCGGGC GCGGCGGGCC CGCGCCGTCA GCCATGGGGC GCTCTATGAC GTGCTGGACG AGGTGCTGAC CGGCCAATGG GGCAAGGGAG TGGGCGGACG GCCCCGGGTG GCGGTCTTGC ATCTGGCGCG CAAGGTTCAG CAATATACCG GCCCGGTGAT GGCCAAGGGC ATGGAGGACA CCACCTTTTA CCGGGTGATG CCGCTGGTTT CCTTGAACGA GGTCGGCGGT GGTCCCGGTC TCACCCCCCT GGATGGGGCG GCCTTCCATC AGGGGATGGC CGAGCGCCAG CGCTTTCTGC CCCGCGCCCT GGTGGCGACG GCGACCCACG ATACCAAGCG CGGCGAAGAC GTGCGGGCGC GTCTGCATGG GCTGAGCGAA TGTCCCGAGC GATGGGCGGA GCGGCTAAGC GCCTGGCGCG AGATCCTGGC CCCGCTCTGC CAAACGGTCG AAGGCGAGGT TTGGCCAAGC CCGGCCGATC AGATCTTGTT CTTGCAAACC CTGGTTGGCA TCTGGCCGGC GGGGTTGGAC GCCACGGCGC CGGTGCCGCC GACCCTGCTC GACCGTTTGC GGGCCTATAT GCGAAAAGCC GCGCGCGAGG CCAAGACCCA TACCTCGTGG ACCGATCCCG ACGAGGACTA TGAAGCGGCG CTGGAGGCTT ATGGGGTGGG GGCGCTGACC GGCGAGCCGG CGCCCAAGAT CCGCCGCGAG GTGGCCGAGC TGGTCACCCA CCTGGAGGGA CCGGGGCGGA CCACCGCGCT TGCCCAGCTT ACCCTGCGCC TGACCATTCC CGGGGTGCCC GATACCTATC AGGGGACCGA GCTTTGGGAT GATTCGCTGG TCGATCCCGA TAACCGGCGG CCGGTGGATT TCGCCCTGCG CCGGGAAAAG GCCGCCGATC TGGCCGGGGT TGGCGGGGCC GCCGTTGAAA AACTGCTGGC CGATCCGGCG GGCGCGGCGA AGATGCTGGT TCTGACCCGC CTGCTCGCCT TGCGCCGACG GCTGCCCGAT CTGTTCCTTG AAGGCGGCTA CGAGCCTTTG ACCGTCACCG GCAAGGCCGC CGGGCATGTG GTGGCCTTTC TGCGCCGTCA CGGCGAGGCC ACGCTGCTTG TCGCCGTGCC CCGGCTGACG ATGACGCTCT CCGGCGAGGG GGCTTCCGTG GCCAAGGCCT GGGGCGATAC CACCCTGGTC CTGCCCGACC GCCTGCCGCT GGAAGGCTGG ACGGATTGCC TGTCGGGAGA TCGGCTGGCC GATCTCCCCT CCTGCGCCAC GCTGTTCGCC CGGCTGCCGG TGGCGGTGCT GCTGGGCTAG
|
Protein sequence | MTADLYAPTA TYRLQFHAGF TFTDAAAQVG YLRDLGVSHL YASPILKARP GSTHGYDIID HGALNPELGG ERGFAQLSEA LAGAGLGLII DIVPNHMGIG AADNGWWLDV LEWGRGGRYA GYFDIDWFPA TPGLREKVVL PVLGDLYGRV LDAGDLVARF DDADGSFSIW YHEHRFPVCP ATYATILDLC LKEVALPEAA ALMTEARRLR GTPRSDIRRK AQRTRGETFK RSLREAAATP ALAAALASVT TLFGPAATDG KGLTRLHALL ENQHYRPSFW RIAGHEINYR RFFQINDLAG LRVEEKEVFD ASHALIGDLV GSGRVHGVRV DHIDGLLDPH QYLDRLQGLV APFAETLGFR PGAFPVYVEK ILEHGEALRR DWPTAGTTGY DALNEISTLF VAAPGLETLR ALWRREVGDE AADPVRVAVR AKRQVMDEEL ASELEVLTDQ CTRLLKRDPQ TRDFSRAGIN RALREIVAQF PVYRSYIGPK GATPEDRAVI ATAIRRARRA RAVSHGALYD VLDEVLTGQW GKGVGGRPRV AVLHLARKVQ QYTGPVMAKG MEDTTFYRVM PLVSLNEVGG GPGLTPLDGA AFHQGMAERQ RFLPRALVAT ATHDTKRGED VRARLHGLSE CPERWAERLS AWREILAPLC QTVEGEVWPS PADQILFLQT LVGIWPAGLD ATAPVPPTLL DRLRAYMRKA AREAKTHTSW TDPDEDYEAA LEAYGVGALT GEPAPKIRRE VAELVTHLEG PGRTTALAQL TLRLTIPGVP DTYQGTELWD DSLVDPDNRR PVDFALRREK AADLAGVGGA AVEKLLADPA GAAKMLVLTR LLALRRRLPD LFLEGGYEPL TVTGKAAGHV VAFLRRHGEA TLLVAVPRLT MTLSGEGASV AKAWGDTTLV LPDRLPLEGW TDCLSGDRLA DLPSCATLFA RLPVAVLLG
|
| |