Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0734 |
Symbol | |
ID | 5055160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 650948 |
End bp | 654016 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640468292 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001152972 |
Protein GI | 145590970 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1449] Alpha-amylase/alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.983175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.257164 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGGG CATTGTTGGT ACTAGTTCTG TTAGTTGTAG TTGTTTCGGC TCAAAATGTT GTGTTTGTGT GGCATATGCA CCAGCCCCCG TATTATATAC CGAAAGAGTC TGTCCCCACG GATACGGGCA AGGCGGTTGT TGAGGCCCCC TGGGTGAGGC TTTGGACCGC CAAGGCCTAC TACCCCATGC TCCTGCTCGT GGAGGAGACC GGGGTTAAAG TCACGTTTGA CATAACGCCT ACGTTGCTCG AGCAGATAGA GATGTATGCC CAGGGCAGGC TGACGGACAA GTATCTGCAA ATATCTTTGA AAAAAGCAGA GGACCTAACG GAGGAGGAGA AATCCTTTAT ATTAGAGCGG TTCTTCGACA TTAGCTGGGA GGTGCAAATT CCCAAATTTC CGAGATATCA ATGCCTTCTT GAAAAGAGGA ACAACGCGCC GGGAGCTTTC GACACCGCCG ACTACCGGGA TCTCCAAGTT CTGTTCAACC TGGCTTGGAT AAACGAGAAG CTCCTCAACG AGGACCCAGA CCTACGCCCA ATATATCAAA AAGCTAAGGG GAGCGACTGC GACACGCACT TCACAGACGG GGACAAGATG GCCGTGCTTG CAAAACACCT GCAGTACCCC AAGGCGTTTC TAGACAAGTT GGCGCAGCTG TACAAGAAGG GGCAAATAGA GGTGATAATG ACCCCCTACT ATTACCCCAT AGCCCCCCTC GTGGAGAACA CCAACAACGC CCTGTCCACA GACCCGGGAA TTATAACGCT ACCTAAGCCC TTCGCACATC CAGAAGACGT CTCCGCGCAG GTACAGCTGT CTCGGCACAA GTTCAAGACC TTCTTCAAGG CCGAGTTGTT GGGAATATGG CCGCCGGAAC TCGCCGTAGA TGACCAATTC ATACAAATCC TCGCCCAGAG CGGCATAAAA TACACCGTAG CCGACCAAGT AGCGCTGGAG CGCTACCTGG GGAGACAGCC GACGCAGGAG GAGCTCTACA CGCCGTGGGA GAGATACGGA GTGCTAATCT TCTTCAGAGA TAAAGAGCTG TCGGACTGGA TAGGCTTCAC AGGCTCTTCA ACTTCTAGGC AGTACGGCGA AAAATATGCC GCTGAGCAAT TCCTCTCGCT CCTCGCTGGC AAGGCTCAGG GCGGCCTCGT GGTGATAGCC CTAGACGGCG AGAACCCCTG GGAGTGGTAC CCCAACGACG GCTACGTCTT CCTCACCGAG ATATACAAAG CAGTGAAAAA TAGTACAAAA ACACTTAGAG AAGCAACAAG TTCGGCGACG CCGCGGAGAT CAGAGTCGCC GCTACCGACG TCAAGCTGGG CCGGCGGAAG CCTCTCTGTG TGGGTAGGCG AGTGGGAGGA GAACTTGGCG TGGAGGATAC TAGAAGGGGC GCGGCAAGTT GCGAAAAACA AGGCGTGGCA ACAGCTACTC TACCCAGCAG AGGCGGGCGA CTGGTTCTGG TGGTACGGCC GCGACAGGGA AAGCCCCCGA GAAGAGATCT TCGACTCACT GTTCCGCGAA ATTGTGAAAA AGTTCTACGA GAAAATAGGG CTTACATACA ACTATACTTG GCCCCTAGAC GAGCCCATAC ACTACAGAAC CGCCTACACG GTAGACTGGG CAGGTGCCCC ATTCAGAAGG CTTATCATCA GCGAAGTGGA GAAGATAAAC ATAACAGTGG AGGTTTACTC CCAGTCTGCA AACACCGCGC AACATGGGAG AGCCCCCGGA ATTAGGGTAA TCGCCCACTG GGGCCCAGTG GCCTACTGGG GCGGGCCGTG GGGAGATCTC TACTTCGCCC CAATGACATA CGCAGGAGAT GTGGGTAACA ACGACCTCTA CGTCCTTACG CCCAAGCTCG CCCCGGGGAG GTATGAGTTT ACATTCATAG CACAAGGCTC CAACGAAGTC TTCGCCACGG CGCTTGGCCA AAACTACCAA GTCGAGGTAC TGCCTAGAAC AGACGGCCGC GTCTGCGGCG TAGAGCTCAA GCGAGTGGAG GTATACGACG GCGGTCACCT CCTTGCCGTG TTCACGGGAA ACGCCTTGGC TTACGTAGGA AACGTCATTA AAGCTTACTA CGAGGTCTGC GGCGAGGGGC CAATCTACGT AGCGGCGTCG ATGGCCCTTA TCCGCCAGGA CTCAAGCGCG CCCTGGGACG AGGTCTTCGT CCTAGCCAGC CCCACGGCGG AGGGGCTCTA CGTGGCTGAG TTTAGGGTAA ACTACAGCGG CGTGTTTGAG CTGAGGGCCA AGGCCATGGG AGGCAATGTG TATTACTCGA CGCCTGTGTA CATAAGAGTA GAGGGAGGGC CCGGCCCAAG AGTTGTGGAC GGCAGAGAAG ACGACTGGGT AGGCCAGCCT CCCTTGCAGA CGCCTGGCGC CGCGGTGAGC ATGTACGAGC TAATAGTGAC AGATCCACAA GACGACCAAT ACAGGTTCTA CCGCCCCGAT TGGAGCTGGC CGCCGACAGA AGACCTAGAC GCCGTAGAGC TGAGGCTCTA CTCAGACGGC CAAAACCTCT ACGGTCTGGT AAAACTCAAG CAGCTGAGCA ACATATACGC GCCGTATGTA ATAATAGCCA TAGGCGTACC GGGCGGCGGC TTCAGCGAGT GGCTACCAGA CTGGAGCGAC ACAAGGCTCG CGTTTAAGTG GGATTACATT ATAGGCATAA ACTACGGCAA AGGCTCCCCA CTATTCCTCT TCGACCACGA TTGGTCGCCC AGGCCCACCG GTCAAATCGC CAGGTCTGGC AACGTAATAG AATTCGCAAT ACCCCTAGAC CAGCTACCCC TCCTAAAAAC AGCCGCCGAG ACTTACATAA CGGCGGTAGT CTTTGCTAAT AACTACGGAG GCGTATGGGA CCCCGGCAAG AATAACGCAT ACGACCCAGT GAGCGGCAGG TACATAACCG AGGACAACTA CGCGTCTAAC GTATATGATG TATTCGGCCA GGCACCTACC TCAGCCGAGG TCTACGGAGG CTGGGACGGC GGCGACCAAA CAGTAGATTT CTACGTGAGA GCTGGCCTAG ACAACGGGAG AATAGTAGCG GTTACCTAA
|
Protein sequence | MSRALLVLVL LVVVVSAQNV VFVWHMHQPP YYIPKESVPT DTGKAVVEAP WVRLWTAKAY YPMLLLVEET GVKVTFDITP TLLEQIEMYA QGRLTDKYLQ ISLKKAEDLT EEEKSFILER FFDISWEVQI PKFPRYQCLL EKRNNAPGAF DTADYRDLQV LFNLAWINEK LLNEDPDLRP IYQKAKGSDC DTHFTDGDKM AVLAKHLQYP KAFLDKLAQL YKKGQIEVIM TPYYYPIAPL VENTNNALST DPGIITLPKP FAHPEDVSAQ VQLSRHKFKT FFKAELLGIW PPELAVDDQF IQILAQSGIK YTVADQVALE RYLGRQPTQE ELYTPWERYG VLIFFRDKEL SDWIGFTGSS TSRQYGEKYA AEQFLSLLAG KAQGGLVVIA LDGENPWEWY PNDGYVFLTE IYKAVKNSTK TLREATSSAT PRRSESPLPT SSWAGGSLSV WVGEWEENLA WRILEGARQV AKNKAWQQLL YPAEAGDWFW WYGRDRESPR EEIFDSLFRE IVKKFYEKIG LTYNYTWPLD EPIHYRTAYT VDWAGAPFRR LIISEVEKIN ITVEVYSQSA NTAQHGRAPG IRVIAHWGPV AYWGGPWGDL YFAPMTYAGD VGNNDLYVLT PKLAPGRYEF TFIAQGSNEV FATALGQNYQ VEVLPRTDGR VCGVELKRVE VYDGGHLLAV FTGNALAYVG NVIKAYYEVC GEGPIYVAAS MALIRQDSSA PWDEVFVLAS PTAEGLYVAE FRVNYSGVFE LRAKAMGGNV YYSTPVYIRV EGGPGPRVVD GREDDWVGQP PLQTPGAAVS MYELIVTDPQ DDQYRFYRPD WSWPPTEDLD AVELRLYSDG QNLYGLVKLK QLSNIYAPYV IIAIGVPGGG FSEWLPDWSD TRLAFKWDYI IGINYGKGSP LFLFDHDWSP RPTGQIARSG NVIEFAIPLD QLPLLKTAAE TYITAVVFAN NYGGVWDPGK NNAYDPVSGR YITEDNYASN VYDVFGQAPT SAEVYGGWDG GDQTVDFYVR AGLDNGRIVA VT
|
| |