Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_2934 |
Symbol | |
ID | 5160292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 3198985 |
End bp | 3202053 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640554864 |
Product | alpha amylase, catalytic region |
Protein accession | YP_001236043 |
Protein GI | 148261916 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.124614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAGC CGGCCGCGAC GACTGCGAAC CGGGCGCCCG CCCCCGTCTC GATCCGCCGC ATCGATCCCC TGGCGAGCGC GACGTTCGAC GATGTCGCGA CGCAAATCGC CGATGCGGCG GATCTGGGGT TCGATACCAT CCTGATCGGC GGGCCGTTCG CCACCGATCC GTCGCGCGAC GCCGCCTCCG AGCTCGACCC CCTGCTGATT GCGCCCGATG GCGGGCCCTC CATGCCGGCA CTCGCCCGGC TTGCGGGCGA GGCCAGGCGG CACGGGCTCG CGATGTGCCT CGAACTCCGG CTCGATCAGG TCGCGCCGGG CGCGGCGCTG GTGCGGCGCC ATCCCGGCTG GTTCACCTAC ACCGCCGACG GGCTGGCGTT CCGCTTCGGC GCGGATGGCG TTCCGGACGC CTGGTGGAAC GACCTGGTCC TCCGGCTCCA GGAACACGGT CTGGACGGCC TCTGCCTGCT CGGCGCGCAC CGCATGCCGG TGGCGGCGGC CGCGCGGCTG AACCGCGACG CGCGCGCGCG CAATCCGGGC TGGACCGCCA TCGCCGCCCC CTTCGGCGCG GCGCCGCCGC AGATCGCCGG CCTCGTGGGC GGGGAATTCG ATCACATCGC CGCATCGTCG TGCTGGTGGG ATGGCGAGGC CGCCTGGCTC GCCGACGATC TCGAACGGAT CGCCTGCGCG GGCAACGTGG CCAGCTTCGC CGTGCCGCCG CTCGCGGCGT CGCAAACCGT CGCGGACTGG CGCCTCGCCC TCGCGGGCTT TCTCGGCACG GCCTGGGTGA TGAACGCCGC CATTCCGCCC CGCCAGCGCG CCGCCGTGAC GGCGCTCAAC AGCTTGCGTC GGACCGAGCG CGAGGTGTTC GCGGGGCGGC CCTTCAGCCT CGCGCCGCCG CACGCGGCAA CGTCCCTGTT CGCCCGGTTC GGCGCCGATG GCCGCGGACT GGCGCTCGCC TGCTCCCCAA CGGGTGCCGA GCCCGATCGC CACGAACTGC ATGCGTTTCT GGCCGGCCGG GCCGCGGCGC CGGTTCCGGT CCCGCTCGAG GGATGGAACG CCGCCGGCCT TGCCTGGTTC ACGACCTCGC CGCTGCCCGC CCACCATTCA CCGGAATCCC CGCGCCATTC GGTCGATGCG CCGCGCATCG CGATCGAGGC GATTGCGCCG GCCGTCGATG GCGGGCGCTT TCCGGTGAAG CGGATGGCCG GTGAAGACGT GACGGTCGAG GCCGATCTCC TTTGCGACGG CCACGGCCGC CTCGCGGCCG ACCTGCTCTG GCGGATGTCC GGCGAGCCAG CCTGGCGGCG CGCGCCGATG GCGGCCATCG GCAACGATCG CTGGCGCGCC CAATTCGGGC TGAGCGCGAT CGGCCGCGCC GAATTCACCA TCGAGGCATG GGTGGATCGC TACGGCACCT TCGCCGACGA GCTCGCGAAG AAACACGAGG CCGGCCTCGA TGTCCGGCTT GAGCTTGTTG AAGGCCGGCA GTTGCTGGAC GCCGCGATCG CGCGGGCCGG CGAAGAGGCC AGAACCATTC TGGCCGAATG CCGAGCGGGA CTCGAACGCT GCGCTCCCGG AGAGCAGGTC GGAAAACTGA CCGCGCCCGC CATCATCGCC GCGATGCGCG ATGCCGAGGA GCGTCGCTTC CTGGCACGGC ACGAACCCGC CTTGCCGATC GATTCCGAGC GTCGCGCCGC GGGTTTCGCG AGCTGGTACG AAATCTTCCC CCGCAGCCAG AGCGGCGACG CCGATCGCCA TGGCACCTTC GACGACGTGA TCGCCGCATT GCCGCGAATC CGCGCCATGG GATTCGATGT CCTGTATTTC CCGCCGATCC ATCCCATCGG CCGGAAGAAC CGCAAGGGGC CGAACAACAC GCTCGTCGCC AATCCCGGCG ATCCGGGAAG CCCCTACGCC ATCGGCGGCG CCGAGGGTGG GCATGACGCG ATCCATCCCG AGCTCGGCTC GTTCGACGAT TTTCGCCGCC TGATCTCTGC GGCGGCCGGG CAGGGCCTCG AAATCGCGCT GGACTTCGCC ATCCAGTGCG CTCCGGACCA TCCATGGCTG GCCGAGCACC CGGAATGGTT CGACTGGCGG CCGGACGGCT CGATCAAGTA TGCCGAAAAT CCGCCGAAGA AATACCAGGA TATCGTGAAT GTCGATTTCT ACGCCGATGG CGCGCAACCC GCGCTCTGGC ATGCCTTGCG CGATATCGTC CTCTTCTGGA TCCGCGAGGG GATACGGATC TTTCGCGTCG ACAATCCGCA TACCAAGCCG TTCCCGTTCT GGGAGTGGCT GATCGCCGAT ATCCGGCGGA ACCATCCCGA CGTGATCTTT CTCTCCGAGG CCTTTACCCG GCCGAAGATC ATGCACCGCC TGGCGAAGAT CGGATTCTCG CAATCCTACA CCTATTTCAC CTGGAGAAAT TCCGCTGACG AAATGAAAAA TTACCTCGTC GAACTGACGC AGTCATCGAC ACGGGACTAT TTCCGTCCCC ATTTCTTCGT CAACACGCCG GACATAAATC CCGCGTTCCT GCAGACATGC AGCCGCCCCG GCTTCCTGAT CCGCGCGGCC CTCGCCGCGA CGCTGTCCGG CCTCTGGGGC GTCTATAACG GCTTCGAGCT CTGCGAAGCC GCCGCGCCGC CAGGGACGGA GGAATATCTC GATTCGGAAA AATACCAGAT CAGAAGCTGG GATTATGGGC GGTCCGGAAA TATCGTGGCG GAAATTTCCC TGCTCAACCG GATCCGCCGG CAGAACCCGG CCCTGCAAAG CCACCTCGGC GTGCACTTCA TCGATTGCGA TAATGACAAC GTCCTGTGCT TCGAAAAGGC CAGTCCCGAT CGCTCGAACA TCGTGCTGGT CGCAATCAGT TTCGATCCCC GCCAGTGGCA GACAACCAGG ATCGACCTCC CTCTCGAACG CTACGGGTCT TCCGGGCGGG ATGCCCTGCA TCTCGAAGAT CTCATGCGCG GCGTCGCATT CACCTGGCAC GGGCGGCAGC AGGTCGTCGA TTTCGACCCG GGCGAACTAC CCTTCAGCAT CTGGAGGATT CGCAAATGA
|
Protein sequence | MNEPAATTAN RAPAPVSIRR IDPLASATFD DVATQIADAA DLGFDTILIG GPFATDPSRD AASELDPLLI APDGGPSMPA LARLAGEARR HGLAMCLELR LDQVAPGAAL VRRHPGWFTY TADGLAFRFG ADGVPDAWWN DLVLRLQEHG LDGLCLLGAH RMPVAAAARL NRDARARNPG WTAIAAPFGA APPQIAGLVG GEFDHIAASS CWWDGEAAWL ADDLERIACA GNVASFAVPP LAASQTVADW RLALAGFLGT AWVMNAAIPP RQRAAVTALN SLRRTEREVF AGRPFSLAPP HAATSLFARF GADGRGLALA CSPTGAEPDR HELHAFLAGR AAAPVPVPLE GWNAAGLAWF TTSPLPAHHS PESPRHSVDA PRIAIEAIAP AVDGGRFPVK RMAGEDVTVE ADLLCDGHGR LAADLLWRMS GEPAWRRAPM AAIGNDRWRA QFGLSAIGRA EFTIEAWVDR YGTFADELAK KHEAGLDVRL ELVEGRQLLD AAIARAGEEA RTILAECRAG LERCAPGEQV GKLTAPAIIA AMRDAEERRF LARHEPALPI DSERRAAGFA SWYEIFPRSQ SGDADRHGTF DDVIAALPRI RAMGFDVLYF PPIHPIGRKN RKGPNNTLVA NPGDPGSPYA IGGAEGGHDA IHPELGSFDD FRRLISAAAG QGLEIALDFA IQCAPDHPWL AEHPEWFDWR PDGSIKYAEN PPKKYQDIVN VDFYADGAQP ALWHALRDIV LFWIREGIRI FRVDNPHTKP FPFWEWLIAD IRRNHPDVIF LSEAFTRPKI MHRLAKIGFS QSYTYFTWRN SADEMKNYLV ELTQSSTRDY FRPHFFVNTP DINPAFLQTC SRPGFLIRAA LAATLSGLWG VYNGFELCEA AAPPGTEEYL DSEKYQIRSW DYGRSGNIVA EISLLNRIRR QNPALQSHLG VHFIDCDNDN VLCFEKASPD RSNIVLVAIS FDPRQWQTTR IDLPLERYGS SGRDALHLED LMRGVAFTWH GRQQVVDFDP GELPFSIWRI RK
|
| |