Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | AFE_2081 |
Symbol | |
ID | 7135678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 23270 |
Kingdom | Bacteria |
Replicon accession | NC_011761 |
Strand | - |
Start bp | 1831069 |
End bp | 1833978 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643530452 |
Product | alpha-amylase family protein |
Protein accession | YP_002426484 |
Protein GI | 218666940 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3280] Maltooligosyl trehalose synthase |
TIGRFAM ID | [TIGR02401] malto-oligosyltrehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCCTG ATCCCATTCC CCGCGCAAGC TACCGGCTCC AGTTCAATCG CCAATTCACC TTCGACGATG CGGTGGCCCT CGTGCCCTAT CTGCAGGAAC TGGGCGTGTC CCATTGCTAC GCATCCCCCT ACCTCAAGGC CCGCAGTGGC AGCCCGCACG GCTACGACAT CGTCGACCAC AACGCCCTCA ACCCCGAGAT CGGCGACGAC CGGAGCTTCG GTCGCTTTGT CACCGCCCTC GCTCGCCATG GCTTGGGCCA TATCCTGGAC TTCGTACCCA ACCACATGGG CGTGGGCGGT GACGACAACG CCTGGTGGCT GGATCTGCTG GAAAACGGCC AGGCATCCCC CTACGCGGAT TTCTTCGATA TTGACTGGCA TCCCCACGAG AAGGCGCTGC GCGGCAAGGT CCTGGTTCCC TTCCTGGGCG GCTACTACGG CGACCTGCTC GAAAACGGCG AGTTGCATCT GGCCTTCGAC GCGCAGCGAG GGGAGTTCAG CGTCTGGTAC CTGCAGCACC GCTTCCCGCT GGATCCGCGT ACCTATCCCG GCATCCTTGA ATATGGGCTG GAACGGTTGA AGGAGCGGTT GGGCGCCGAC CAGCCCAGGC TGGCCGAATA CCAAAGCCTC AGCACCGCCT TCACCCATCT CCCCGCGCGC CGAGAGACGG CGGCCGCCAA ACTGGAGGAG CGGCGCCGCG ACAAGGAGGT CCACAAGCGC CACCTGGCGG AGCTCTGCAC CGCCGAACCC CGGATCGGGG CCTTCATCGA AGAGAACGTG GCATCCCTGA ACGGCGAGCC GGGCGAGGTG GCCACCTTCG ACCGGCTCCA CGAGGTCCTG GAAGGGCAGG CCTACCGCCT CGCGTATTGG CGGGTCGCCG CGGACGAGAT CAATTATCGC CGCTTTTTCG ACATCAACGA CCTCGCCGGC CTGCGGATGG AGCGGCCCGA GGTGTTCGCC GCCACCCATC GCCTGGTGCT CCAACTGGTG GCCGAGGGCA AGCTGGACGG CCTGCGCATC GACCATGCCG ACGGTCTGTA CGACCCAGCC GGCTATTTCA CGCGGCTCCG GATGGAGATC CATGCGGCGC TGGCGGAGCG GGCCGAACCA GCGGGCGGCG GCACGGCCCC CGATTTCTAC CTCGTGGCGG AAAAGATCCT GATCGGTCCG GAACGGCTGG TGGACGGCTG GCCGGTGCAG GGGACCACCG GCTATGACTT CGCCAACGCG GTCAACGACC TCTTCGTCAA CCCGGCGGCG CAGCGGGACT TGGATCGCAT TTATGCCCGA TTCATCGGAC AACGCGGCGA TTTCGCGGAA ATGCTCTTCC AATGCAAGCA ACTGGTCATG GAGGCGCAAC TCTCCTCGGA ACTCACCATG CTCGCCGACA TGCTCGACGG CATCGCCCAA AGCGACCGGC ACACCCGCGA CTTCACCCGT AACGGCTGCC GCGGCGCCGT GGCCATGCTC GTGGCCTGTT TCCCGGTGTA CCGTAGCTAT ATCGCCGCCA ATCGGGTCTC CGAAGACGAC CGCCGCTACG TGGAAGCGGC GGTGGCACAG GCAAAAAAAC GCAGCCCGGG GGACGTGAGC ATCTTCGACT TCATCCGCCG CATCCTGTTG GAAGACACCG GCGCGCCGGA GGCCTCGCGG CGCCGGCGGG CGGCCCGCTT CGCCTTGAAA CTCCAGCAGT ACACGGCCCC GGTCATGGCC AAGGCCCAGG AGGACACGGC CTTCTACCGC TATCACCGGC TGGTGTCCCT GAACGAGGTC GGCGGTGACC CGCGACGTTT CGGGACCACT CCGGCCGCCT TTCATCGCGC CAACCAGGAA CGGGCCCAGA AATGGCCATA CGCCATGCTG ACGACCTCCA CCCACGACAC CAAACGCAGC GAGGACGTGC GGGCCCGCAT CGACGTCCTC TCCGAACTCA CCGACGAATG GCGGAAAAGG GTGGGACGTT GGGCGCGCTT CGCCCGCCGG CACAAGCGGA TGGTGGATGG CATTCCGGCA CCGAGCCGCA ATGACGAGTA CCTGTTCTAC CAGACCCTGC TCGGCGTCTG GCCCCTCGAA TCCCCTGGCG AGCGCGCCTT CGACGATCTC CGTGAGCGGG TCCTGCGTTA TATGCTCAAG GCAGTGAAGG AGGCCAAGAC CCACACCTCC TGGCTGAACC CCAACCTTGC TTACGAAGAG GCCCTGGATT ACTTCGTCAT GGCCATGCTC GACCGCACCG GCCGCAACCC CTTCCTCGCC GATTTTCTCC CGTTCCAGGC CCGAGTCGCC CGCTTCGGGC TGTGGAACGG CCTGTCCCAG CAGCTATTGA AGCTTACGGC GCCGGGCGTA CCCGACATCT ACCAGGGGAC CGAAGTGTGG GACTTCAGCC TCGTGGACCC GGACAACCGC CGCCCCGTGG ACTACCGGCG CCGGCGGGAC CTGCTGCACC GGCTGAGGGC TGTCGCCCGC GGGCGGGACG GAGCGCGCCT CGCGCAGGAT CTGATGGAGC ATCCGGAGGA CGGTCGCGCC AAGCTCTTTG TCACTTGGCG GGCACTGGAG GCGCGGCACC GATTCCCCGA GGTCTTTGCC GGCGGCGCCT ACCTACCCCT TGCCGTCACG GGTGCCCAGG CCGACCACGC AGTGGCCTTT GCACGGCAGG CGAACGGGCG CACCGTGATC ACCATCGCCA CCCGCTGGTT CGCCACCCTC CTCGGCGACG AAAAACGCCT GCCGGTAGGT GAGGCGGTAT GGACCGATAC CGGGATCGAG CTCCCCAACC CGGCCGCGGC ATGGGAAAAC CTGTTCACGG GCGAGACGGT GCGCCCCGAT GCCGCCGGAG ACAAGCCTCG CCTTTCTCTT GCCCGGACGT TGGCTTGCTT TCCCGTCGCC CTGCTGGTTC CGGCGGAGGT CGGCTCGTGA
|
Protein sequence | MAPDPIPRAS YRLQFNRQFT FDDAVALVPY LQELGVSHCY ASPYLKARSG SPHGYDIVDH NALNPEIGDD RSFGRFVTAL ARHGLGHILD FVPNHMGVGG DDNAWWLDLL ENGQASPYAD FFDIDWHPHE KALRGKVLVP FLGGYYGDLL ENGELHLAFD AQRGEFSVWY LQHRFPLDPR TYPGILEYGL ERLKERLGAD QPRLAEYQSL STAFTHLPAR RETAAAKLEE RRRDKEVHKR HLAELCTAEP RIGAFIEENV ASLNGEPGEV ATFDRLHEVL EGQAYRLAYW RVAADEINYR RFFDINDLAG LRMERPEVFA ATHRLVLQLV AEGKLDGLRI DHADGLYDPA GYFTRLRMEI HAALAERAEP AGGGTAPDFY LVAEKILIGP ERLVDGWPVQ GTTGYDFANA VNDLFVNPAA QRDLDRIYAR FIGQRGDFAE MLFQCKQLVM EAQLSSELTM LADMLDGIAQ SDRHTRDFTR NGCRGAVAML VACFPVYRSY IAANRVSEDD RRYVEAAVAQ AKKRSPGDVS IFDFIRRILL EDTGAPEASR RRRAARFALK LQQYTAPVMA KAQEDTAFYR YHRLVSLNEV GGDPRRFGTT PAAFHRANQE RAQKWPYAML TTSTHDTKRS EDVRARIDVL SELTDEWRKR VGRWARFARR HKRMVDGIPA PSRNDEYLFY QTLLGVWPLE SPGERAFDDL RERVLRYMLK AVKEAKTHTS WLNPNLAYEE ALDYFVMAML DRTGRNPFLA DFLPFQARVA RFGLWNGLSQ QLLKLTAPGV PDIYQGTEVW DFSLVDPDNR RPVDYRRRRD LLHRLRAVAR GRDGARLAQD LMEHPEDGRA KLFVTWRALE ARHRFPEVFA GGAYLPLAVT GAQADHAVAF ARQANGRTVI TIATRWFATL LGDEKRLPVG EAVWTDTGIE LPNPAAAWEN LFTGETVRPD AAGDKPRLSL ARTLACFPVA LLVPAEVGS
|
| |