Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0984 |
Symbol | |
ID | 5055465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 875815 |
End bp | 876969 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640468540 |
Product | metallophosphoesterase |
Protein accession | YP_001153216 |
Protein GI | 145591214 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000094413 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAATTC TACATATGTC AGATGCCCAC TTGGGCAGAG CGCAGTACGG CCTCCCCGAG AGGGAGGAGG ATTACTACAA GGCGTTCCGC GAGGGCCTTA TGAGGGGGAA AACGGCAGAC GCCGTTTTAA TAACCGGCGA CGTTTTCGAC TCCAGGAGGC CCTCCACCAG GGCTATGCTA CGCTTTGTCG AGGCTGTGGA GGGGGTAATG CTACCCATCT ACGTCATTGG AGGGAATCAC GACTTCAGCT ACGTGAGGTA CAGGGCAGAG GCGGAGCAGT GCGGGGGGAG TTGCGCCGGC GATACGGTGT TGAGACTTCT AGACCGCGTA AAACTTGCAA AGCTCCTCTG CTGGAGCCCC GCCGATCTCG GCGGCTTGGT CCTCTTCGGC GCCTGCGCTA CGCCGAGGGA TTACGCGGCG GAGTACCGAA AGCTACTGCA GAAGGCCCCT CCAGGCGCTT TGTTGGCAAT TCACCAAGCT GTTGAAGGCG TGCAGGCGAG GTACCCTGCA GAGGAAGATG ACTACACTAT GCCTCAGTCA GTCTTCCAAG GGCTTTCCTA CATCCATATA GCGGCTGGTC ACATACATGA CCACTTCGCT AAGCACCCCA TTGGCGCCGT GTGGGCCGGG TCAATAGAGG TGTGGGACTC AGGCGAGTTC GAGACCTGGG AATACGCGGG GAGGTTTAAT AAGGTCCAAG AAAGGGCAAC TAAGGGCGTA GTCTTGATAG ACGCGGCGGG GAGGGGGGTC TCGGTCAAGT CTATACCTCT TCCGGATTCG AGGCCGTTGT ATAGGCTGAG AATCTACGTC AGAGAGGGCA AGGAGCTGGC CGGCGCGGTA GAAGAGGCGG CAAGGCTCTT CGACAAGCCC GGGGCAGTTG TTAGGCTCGA GATCATGGGC ACCGCCGAGG AGGGGGTAAA AACAAGGCAA TTGGCAGCAG CGTTTACAAA GGCGCTTTAT GTCGACGTAG TGGACCGCAC ATCAGCCCCC CAGAGGGCCG TGGCACTGCG CGGATCTGCC TTTGAGGAGC TGTGGCGGTT ACTTAGGGAG AAGCTGGGGG AACACGCAGA GGTGGTGATA AAGGCGATGG AGCTGGTGAG GGAGGGCGAG AGGGAAGCGG CGTATAGACT AGTGCTTAAG GCGCTGTATG ATTAG
|
Protein sequence | MKILHMSDAH LGRAQYGLPE REEDYYKAFR EGLMRGKTAD AVLITGDVFD SRRPSTRAML RFVEAVEGVM LPIYVIGGNH DFSYVRYRAE AEQCGGSCAG DTVLRLLDRV KLAKLLCWSP ADLGGLVLFG ACATPRDYAA EYRKLLQKAP PGALLAIHQA VEGVQARYPA EEDDYTMPQS VFQGLSYIHI AAGHIHDHFA KHPIGAVWAG SIEVWDSGEF ETWEYAGRFN KVQERATKGV VLIDAAGRGV SVKSIPLPDS RPLYRLRIYV REGKELAGAV EEAARLFDKP GAVVRLEIMG TAEEGVKTRQ LAAAFTKALY VDVVDRTSAP QRAVALRGSA FEELWRLLRE KLGEHAEVVI KAMELVREGE REAAYRLVLK ALYD
|
| |