Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2292 |
Symbol | |
ID | 5054129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2052504 |
End bp | 2053967 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640469844 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_001154488 |
Protein GI | 145592486 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0322637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTTTA TAAATTGGTA TATTGTCCAC CGCGTGCAGT TTAGGAAAGC GATTAGAGAT CCGGTGCATG GGTTCATAAA GCTCACGGAG GAGGAGGTGA GGTTTATTGA CGGCGAGCCC ATCATCCAGA GGCTTAGGTA CGTTAAGCAA CTTGGCTTTG TCTACCTCGT ATACCCAACC GCCACACACA CCAGGTTTGA CCACTCCCTG GGCGTCATGC ACATAGCCAC TCAGCTCGGG CACCGGATAA TGGAGCAGAG GGGGGAGTTC GACGAGGTTT TGTTGAAGCA TCTCCGCATG GCAGCTCTGT TACACGACGT GGGCCACCTG CCTTTTTCCC ACTCTTTTGA GATCCTCACT AGAGAGCTCC TCCACATGGC AACTGTGAGG GGCTGTTTGG AGGTGGATCT GGCCTTGTTT GACAGGTCTC AGAAACCGCA CGAAGTAACC ACAAAGTTGC TTGTAGAGAA ACTGAGCGAT AGGCTCTCGG CCCTTGGGTA CAATCCCTCT CTTGTCTTAG GGCTCTTGTT TGAGCCGCCG AGTAAGTATC GTCTCTACAG TAGTATTCTA TCTGGCGTAT TCGACGCAGA TAGGCTTGAC TATATTATGC GCGACATGTA CTTTACAGGC GCGGCGGTGG GGACCAGCTT TACCCACATA GATCTTGAGC GGATAGTGGA AAATCTCGAA GTGGTGGGCG ATAGCTTCCA GTTTAACGAA AAGGCGAGGG TAAATTTGGA GGGCTATATA ATTACGCGGT ACAACCTCTA CCGCCACGTT TACCTGCACC ACAAAACTGT GCTTTTCACA GAGCTGGCCC GCGACATCTT AGCCGACAAT ATAGAGAAGT GTTCCGAGGG TGCGGGGGAT CCCGAAATTT GCCGCTACCT CTGCGAGCTG GCCCAGTTCG TGACTGGAAG TGCAGACGAG GTGAGTATCT GGAAGGCCAC TGATGACTAC TTTGTCTCAG TCTTCATGCG TGACCCTCGT TTTAGGGATC TTCTTTCGAG GAAGCCGCCT GGCTACATAC CGCTGTGGAA GAGGGAAAAG GACTTTATGG AAATTTTCCA AAACCCGGTG AGGGTAAACG AGTCTGTGGA TAGGATAGGC CCTCTGCACT GGGCTTTGAT AAATAGGCTC AAGAGGAAGT TCGTCGAGCG GCTGAACATC GAGGTAGGAG AGGTAGGCGG CTGCCGCCTA GCTCCTAACG ACGTGATTCT GTCGTATGTG AGCTTTGACC CGTACGCGGA GGATATATAC ATCACCACGG CCATGGGCCC CATCCCTATA TCGAAAATAT CGCCGCTGGT GGAGGCGGTG AACGAGGCGT GGAAAAGGGC TCCACACGTA TTTCTCTATG TAAAGAGGGA TGTTATCGAA AAATGCGGGG GTGATGTCTT GCAGAAGATC CTGACGGTAT TGGAGCCCCT GATGGAGCTC GCCGTGCGTA GGCTAGGAGC GTAG
|
Protein sequence | MLFINWYIVH RVQFRKAIRD PVHGFIKLTE EEVRFIDGEP IIQRLRYVKQ LGFVYLVYPT ATHTRFDHSL GVMHIATQLG HRIMEQRGEF DEVLLKHLRM AALLHDVGHL PFSHSFEILT RELLHMATVR GCLEVDLALF DRSQKPHEVT TKLLVEKLSD RLSALGYNPS LVLGLLFEPP SKYRLYSSIL SGVFDADRLD YIMRDMYFTG AAVGTSFTHI DLERIVENLE VVGDSFQFNE KARVNLEGYI ITRYNLYRHV YLHHKTVLFT ELARDILADN IEKCSEGAGD PEICRYLCEL AQFVTGSADE VSIWKATDDY FVSVFMRDPR FRDLLSRKPP GYIPLWKREK DFMEIFQNPV RVNESVDRIG PLHWALINRL KRKFVERLNI EVGEVGGCRL APNDVILSYV SFDPYAEDIY ITTAMGPIPI SKISPLVEAV NEAWKRAPHV FLYVKRDVIE KCGGDVLQKI LTVLEPLMEL AVRRLGA
|
| |