Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1402 |
Symbol | |
ID | 5054224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1265188 |
End bp | 1266663 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640468945 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001153614 |
Protein GI | 145591612 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.556587 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTACTGG TTGTTGTTTA TTCTATTATA TTCCTCGCTT TGTCTATTGT TGCATACAAC ACCTGGCTTC AATACAGAGG AGTGAATTTA CCGTATTCTT CGCCTATTGA GACTGGCGGA GAAACTCCGG ACTTTATAAT AGTCGTCTTG CTTGATGGTG CTTCTTCACC AGTAGTTCAA AGCTATATAC GGAGTGCAGA GACTTCTGTG CTTTTAGACA TGGGGCTGTT TTTGCCAAAT GGGCGTTCGG TTTATCCATC TTATTCAGGC CCCAGCAGGG CGTCAATATT AACAGGAGTC CCGCCTGCTG TTCATGGAGT TGTTTCTAAT GAGGGGGCTT TTAAAGTAAA AATCACTGGG CTAATCGACT TAGCTAGGGA GAAAGGGTTT AAGATAATTA ACATCGGCGA CGGTTTGATA GAGACGATAT TCGGCGTAAA GGCGGTGGCG ATAGACGAAG GCGCCGGCCA GGGGAGTTTG GCGCTGAAAA AAGCCGTAGA GGTGCTTAGG GCTAATCTTT CCAACGGCTC CAAGGTTTTT ATCTGGGTTA CGGTAAATGA TGTAGATGTA ATTGGGCATA AGGCAGGTGG TTTTTCAAAG GAGTACAACG CGACAGTTAA AAATTACCTC ATATTAATTG CCGGCTTTAT TTCAGAGATC TCAGATGTTT TAAATAGAGG CGTTGTGGTA GTGCTTAGTG ACCACGGTTT TAAAAAAGGT GGTCACCACG GCGGGGGAGA GGACACAGTT ATGAACACTT TTATGTTCAT AGCAGGTAGA GGCATTGCCC CAGGGGTGTG TTATGAAGAA TTTTTGCTAA TAGACATCGC TCCGAGTCTG GGAATACTCA CCAGCATTGG CGTACCCCCC TACTCGATGG GCAAAGCGCT TGCCACATGT CTAGGCATAA ATCCAACACC TGCAGAGATG AAGAGAAAGG AGGTGTATAA CCTATTAGGA ACGAGGGAAA CCGTGTCCAT GTTTACTGAT CAATTGTGGA TTAGGTTGTT TATAATCACG GCGTTGTTTA TACCTCTTGT GCTGGAGGTT AGGAGGATTG GGTTAAAACC TCTTACGCTG GGCGTCGCGT TTCTTGTTAT CTACATTGTA TATTACGTTT ATAGTGTTAG AGTTTACACA TTTTCAGATA TTTACTCATT TACAGAGGTA ATGACTAAAA TTATTGTGGC AGTCGTAGTT GTTTCGTTTT TAACTGGTTT ATTTGCCGCT AGATTTTACC CAACTCGTGG GGAGGTTGCT AGAGGTTTAA TAGGGGCTTA CCTATTTATC ATCACTGTAG TTTTTATCGG CGTATCTACG TTCTTAGTAC CATATGGCCC AGTTGTTGTT TTTCCAAATC CTGATTGGGA TTTCGCGGTG AGGTACTTCG CAATGTTGAT CACAGGAAGC TTTTCAGGAC TTGTCGGCAT GCCCATTGCG TTAGTTACAG CTATTTTGAT ACAGAAAAAT CGCTAA
|
Protein sequence | MLLVVVYSII FLALSIVAYN TWLQYRGVNL PYSSPIETGG ETPDFIIVVL LDGASSPVVQ SYIRSAETSV LLDMGLFLPN GRSVYPSYSG PSRASILTGV PPAVHGVVSN EGAFKVKITG LIDLAREKGF KIINIGDGLI ETIFGVKAVA IDEGAGQGSL ALKKAVEVLR ANLSNGSKVF IWVTVNDVDV IGHKAGGFSK EYNATVKNYL ILIAGFISEI SDVLNRGVVV VLSDHGFKKG GHHGGGEDTV MNTFMFIAGR GIAPGVCYEE FLLIDIAPSL GILTSIGVPP YSMGKALATC LGINPTPAEM KRKEVYNLLG TRETVSMFTD QLWIRLFIIT ALFIPLVLEV RRIGLKPLTL GVAFLVIYIV YYVYSVRVYT FSDIYSFTEV MTKIIVAVVV VSFLTGLFAA RFYPTRGEVA RGLIGAYLFI ITVVFIGVST FLVPYGPVVV FPNPDWDFAV RYFAMLITGS FSGLVGMPIA LVTAILIQKN R
|
| |