Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1866 |
Symbol | |
ID | 5055880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1668938 |
End bp | 1671940 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469412 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001154069 |
Protein GI | 145592067 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1449] Alpha-amylase/alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCGG TCCTAGCGCT TCTACTAGCC GCAGTGGCTA TCCTGGCAGG GCCTGTTAAT GTCGTGTTTA TCTTGCACAA CCACCAGCCT TGGTACGTCG ATTTTGAGAA AAATGAACTC ATACTCCCAT GGGTCAGGAT GCACGCGGTT GGCAACTATT TGAAAGTCCC CCTCTTGATT AACGAAAGCG GAGTCCCTGT GGCCTTCACA CTCTCAGGAA GCTTAATCGA ACAGCTCAAC TGGTACGCCA ACGGGACCTA CATAGACGTG AGGTATCGCA TATCGGAGAA AATCGCCCGG GGCGAGCCGC TGACGGCTGA GGAAAAGTAC GCAATGCTGG CAATACCAGG CGGTTTCTTC GATATTAACT GGCAGAACAT ACTGTACAAA CACCCGCGGT ATGCCGTCCT CCTGGGTCTA AGGAACGATG CATTTAGCAA ATGCCCTCCT GGCAACGTTA CGTGTGTAAT ATCTAGGTTC TCCGACCAAG ACTTTGTCGA CCTAGCCACC TTGTTTAATC TACTCTGGAT TGACCCCTAC ATCGCGAAGA AGTATTCCGA CATCTGGACA CTGGTAAACA AGACGTCGTA TACCCGCGAC GATTTGAAAA AGGTTTTGGA CTTACACAGA GAGCTTGTAG GCAAGGTCTT GTCTCTCTAC AATGAGTTGG CACGACAAGG CAAAATAGAG CTCGTCCCAG TCCCCTACTC TCACCCCCTC ATGCCGTTGC TGGCGGACAT GGGCGCTCTG GAAGACCTAA GACTCCACAT ATCCCTCTCT GAGGGGCTGT TCAAGAGATA TTTGGGAACC ACGCCGACAG GTGTGTGGCC ACCCGAGCAA GCCGTAAATG ACGAAGTGCT AAGGCTCTTC GCCGAGGCTG GCTACTTGTG GACGGTCACA GACGAAGATG TCCTAAAAAC CACGGCCCCT GGCTTCAGTC ATTACAGCCT TTACTATGCT GACTACGGCG GGCGCCGGCT CTACGTCTTT TTCAGAGACA AGACGTTGTC AGACGCGCTG GGCTTCAGAT ACTCGTCTAT GTCCCCCCAG GTCGCCTTAG CGGATTTTGT AAACTACCTG AAGAAAGTCC CCCAGGGCGA GTGCAACGTG GTAGTGGTGG CGCTTGACGG AGAAAACCCA TGGGAGAACT ACCCCAACTT CGGCGACGAC TTTCTGAAGA CGTTCTTCCA AGGCCTGGCG GAGCTAGAGA AAAACGGTAC TGTGAAAATA TGGAAGCCTA CAGATTTTGT GAACCACTGC AAGGACAAGG CCACGCCTCT CCCCCAGCGC CAGTTTAGAT ACTTCGACTT GAAGGTAGAC ATATCGATTT ACAAATCTAT AAGAGATCTC CCCACCCGCC CAGTGGCTGG CAAAATAGCG GAAGGCTCTT GGTCAGGCGG AGGAAGCCTT GCAGTGTGGA TAGGAGACCC AGACGAAAAC GTCTGGTGGA TGTGGCTGAA AAAGGCAAGA GAAGACGTCG GCATAAACCG GACGTGGGAT GTGCTCTTCC CACTACTCGT TGCAGAAGCG TCGGATTGGC CGTTTTGGTA CGGCAACGAC ATGGGCTCGC CGCAGACCTT CGACCCAGTG GCTAAGTCCG CGCTTAGGTC ATACTACACG CGCGCTGGTC TGCAACCCCC ACAGTATCTC CAGACATCTG CATACCCCGC CGGTACTCCT CGCGAGGATA AGATAGTGGG CAGGGGAGAG GGCAAAATCG CCATGTACGG CGGAGCTGTA ATATACGCAA ATACAACCCA TATATGGATC GAGGGAGGGC CCTGCGGAGT TGTCTACATA TCTAACCCAG ACGTGGCTAG GTCGCCCTAC ATGTTCAGAG GCGCGGCCAA GGGGCTCGGC GGGGAGGCTC TTGACGTATA CGCCGACATG GCCATAGACA CCTGCAAAGG ACTTGTGTAC CTCTCCGACA GCGGCAGGTT TTACCCAGTA GGCAACACGG CGGCGCTGGG CTTTATAGGA GCAAGGCCGG GGGGACTGCT TTACGTAGAG TTTAGAGGCC TTGTATACGC GCTGAGAGTG CCGGAGAGCT ACGCAGCCCA GAACCTCCTA TTAAGGGCAG TAGATCCTCT AGGCGACGAT TTCGGGCCTG GGAAGTACCA ATATCCCAAA AACCCAGCCT TCAAGCCGGG AGTGTTCGAC CTAACCCTGT TCGAGCTATA CGACTTGGGG GACAAGTTCA GGTTCCAGTT CAGGGTTAAA GAGCTAGGCG GCAATCCGTG GGGCGGGCCA GCCGGCTTCT CCCTGCAGTT CTTCCACGTG TACATCAATA GAGGAGGCGG CGTCCGGAAC GACACGCTTG GCTTGAGGGT TTCCCTCTGC AAAGAAGCGT CATGGGACGT CGCATTGTTA ATTGGGCCGG GCTGGACCGG CGGTAATAGG ATAGTCTACG CTGACGGCGT CTACGTCGAC GACGCCATGT CTATAAAGCT GGGCACCAAC AATACCATAA TTGCCGATGT GCCCAAGAAA TACCTCGGAA ACTACGACAA TAAGTGGAAA ATAACCGTGT TCCTAACTTC GTGGGACGGC TACGGCCCAG ACAACATTAG AAACTTTGGA GTAATGGCCG ACGAGTGGAC TGTAGGAGGC GCCGATCCCG TCGCTGTGTT GGCAGGCGTT GCGCCGCGGG TCTTTGACGT CTTGGCGGAA ACTGCAGAGA TGCAAGTAAA AGCCCTCACT TCCTACAAAG TCGTCAGACT TCCCAACGGC ACATATATTG GAGCACCCGC CGTTGTGTGC GCGTATCTCA CAGGTTCGGC GGAGATGTGC ACCGCCACCG TCACCCAGTT CGTCACAATC ACGGCGACAG CCACAGTAAC TCGTACATTC ACCGAGACGT ACACCACTGT GTCCACTCAA GTAGCCACCA CGGTGACCTC TACTGAGAGG GTCAAGGAAA TCGATTGGCC CACCACAGCG GCGCTGACAG TAGCCGCCCT CGTAGCCGGA CTCCTCGCCG GCATTTTAAC AAGACGAAAA TAA
|
Protein sequence | MRAVLALLLA AVAILAGPVN VVFILHNHQP WYVDFEKNEL ILPWVRMHAV GNYLKVPLLI NESGVPVAFT LSGSLIEQLN WYANGTYIDV RYRISEKIAR GEPLTAEEKY AMLAIPGGFF DINWQNILYK HPRYAVLLGL RNDAFSKCPP GNVTCVISRF SDQDFVDLAT LFNLLWIDPY IAKKYSDIWT LVNKTSYTRD DLKKVLDLHR ELVGKVLSLY NELARQGKIE LVPVPYSHPL MPLLADMGAL EDLRLHISLS EGLFKRYLGT TPTGVWPPEQ AVNDEVLRLF AEAGYLWTVT DEDVLKTTAP GFSHYSLYYA DYGGRRLYVF FRDKTLSDAL GFRYSSMSPQ VALADFVNYL KKVPQGECNV VVVALDGENP WENYPNFGDD FLKTFFQGLA ELEKNGTVKI WKPTDFVNHC KDKATPLPQR QFRYFDLKVD ISIYKSIRDL PTRPVAGKIA EGSWSGGGSL AVWIGDPDEN VWWMWLKKAR EDVGINRTWD VLFPLLVAEA SDWPFWYGND MGSPQTFDPV AKSALRSYYT RAGLQPPQYL QTSAYPAGTP REDKIVGRGE GKIAMYGGAV IYANTTHIWI EGGPCGVVYI SNPDVARSPY MFRGAAKGLG GEALDVYADM AIDTCKGLVY LSDSGRFYPV GNTAALGFIG ARPGGLLYVE FRGLVYALRV PESYAAQNLL LRAVDPLGDD FGPGKYQYPK NPAFKPGVFD LTLFELYDLG DKFRFQFRVK ELGGNPWGGP AGFSLQFFHV YINRGGGVRN DTLGLRVSLC KEASWDVALL IGPGWTGGNR IVYADGVYVD DAMSIKLGTN NTIIADVPKK YLGNYDNKWK ITVFLTSWDG YGPDNIRNFG VMADEWTVGG ADPVAVLAGV APRVFDVLAE TAEMQVKALT SYKVVRLPNG TYIGAPAVVC AYLTGSAEMC TATVTQFVTI TATATVTRTF TETYTTVSTQ VATTVTSTER VKEIDWPTTA ALTVAALVAG LLAGILTRRK
|
| |