Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2149 |
Symbol | |
ID | 5056034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1925371 |
End bp | 1926969 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640469701 |
Product | alpha amylase, catalytic region |
Protein accession | YP_001154347 |
Protein GI | 145592345 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTTGCT CTGTGGTGAA GTGGCGGAGC GACCCCTTCT ACGGCCGCGT GGCGGTCGTC AAGGCGGGTA ACGGATACGT CGTGGGAGAC TTCACCGGCT GGATACACGG GGCCTTTAAA GAGGTGGTGG AGCTACCGCC CGGCAGATAC GCCGTTGCCT CCAGCGACGG CGCCGAGGAG TGCCTAGTAG AGCCGCCGGA GTATCCATGG CACTTCTCCG TGCCCTATAT GGGGGTGGAC TGGGGGGACG TGGCCGAGAT TAGGATATAC GCCCCGGAGC CCCCCGAGGT AAGCGGGGGC CGTGTCGTGA AGTTATTGGA GGGGGAGCCC TTTTCAATAT ACGCGGCGGT TATAAAAAGT AGAAGGTATG AAGTGAGGTG TTGCGGCAAG GTGAAGAGGT ATAGACGGCC TCCTCTAGTT GAGGGGCATG GCATCTACGC CATGTACGAG GTCCTACCCG ATAGAGCCGC CAATAGAACG GGGTGTAGGG ATCTTAGGCG CCAGTTCTGC GGCGGGACTT TAAAGGACGT TGCCGAGATT GCCTTGTCCG CTTCTGAATT TGCAGATGCG CTGTATCTGC ATCCAATATA CCCCGCAATG AGCTACCACA GATACGACGT GGTGAACCAC TTGGATGTGG ACGAGAGGCT TGGGGGGTGG GCTGCGTTCG CCGCACTTAA GGACGCACTC AACGGGCGGG GGATGAAGCT TGTTCTTGAC TTGGTACTAT ACCACGTTGG CCTCCGCAAC CCGCTCTTCC CCAACGGTCC CTTCATCATA AGAGACCAAT CCTTCACAAC GCTTGTCAAG TCTCTGGCTG ACATTATGCC TAGGAACGCC TTGACGGGAC TCCTGCTTGG AAAACCCCCG TATGATACGT TTCTAAAAGT TTGGCTTATG CCTCGGCTAG ACTACTCAGA CCGCCGTGCG GTCCAATACG CGAGGAGCGT TGTGGAGTTC TGGACGCCTA AAGTCGACGG GTTTAGGCTT GACGTTGCCC ACGGCATGCC CCCATCTGCG TGGGACGAGA TACTAGAACC GGCGCGGCAT CGGTACATTC TAGGAGAACA TGTGGGCAAC CCCGCTCCAT TTTACAAGTC AATTAAGGGC TTCACTGCCT ATATCTTATA TGGAGAATTG GTGAAGTCCG GTTCTTTTTC CACAATTTCG GAGGCGATTA ATAGGTACCT TGCACTGACG CCGCCGGGCG CCTTGCCTTA TATGAACACG TTTATTGAAA ACCACGACAC TGATAGGGCA GTCACTACTA TGGGGGGCTT GGTGACTGTG GGATATGCGG TGATATTCAC GCTACCTGGG GTCCCCTCCG TGTACGCAGG TGGCGAGTGC GGCGTAGGTG GTAGGGCCAG TGACCACACT AACCGGGCCC CTTATAAGCC ATGTCCCGGG TCTCCCATTG CCGACACGCT CCGTGCGCTG TACTCGGCGA GAAGAGAGTT TGGCCTATGG CGTGGGCCTG CGTGGGCAGA GCAGAAAAGA GGGCGTATTA TCATAAACAG GCCGGGCACT AGAGCGGAGA TAGACTTAAA TAAAATTGCC ATACTCGGCG CGGGGCGACA GCAAGAGATT CCTTTATAA
|
Protein sequence | MACSVVKWRS DPFYGRVAVV KAGNGYVVGD FTGWIHGAFK EVVELPPGRY AVASSDGAEE CLVEPPEYPW HFSVPYMGVD WGDVAEIRIY APEPPEVSGG RVVKLLEGEP FSIYAAVIKS RRYEVRCCGK VKRYRRPPLV EGHGIYAMYE VLPDRAANRT GCRDLRRQFC GGTLKDVAEI ALSASEFADA LYLHPIYPAM SYHRYDVVNH LDVDERLGGW AAFAALKDAL NGRGMKLVLD LVLYHVGLRN PLFPNGPFII RDQSFTTLVK SLADIMPRNA LTGLLLGKPP YDTFLKVWLM PRLDYSDRRA VQYARSVVEF WTPKVDGFRL DVAHGMPPSA WDEILEPARH RYILGEHVGN PAPFYKSIKG FTAYILYGEL VKSGSFSTIS EAINRYLALT PPGALPYMNT FIENHDTDRA VTTMGGLVTV GYAVIFTLPG VPSVYAGGEC GVGGRASDHT NRAPYKPCPG SPIADTLRAL YSARREFGLW RGPAWAEQKR GRIIINRPGT RAEIDLNKIA ILGAGRQQEI PL
|
| |