Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1166 |
Symbol | |
ID | 5054883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1052138 |
End bp | 1054063 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640468716 |
Product | metallophosphoesterase |
Protein accession | YP_001153389 |
Protein GI | 145591387 |
COG category | [R] General function prediction only |
COG ID | [COG1409] Predicted phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0143203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TTTTTTCACT CTCCATCCTC CTACTTGTCG CTCTCCTCTC CGCCGCCTCA ATCGTCGACC CGAGGTGGGG CACGCCGGCG TACGTCACGC CGGGAGGCGC CTTTAACATA ACCCTTGATG CAGCCGTAGC AATTAAGAGC GTCGCCCTCG CCGCGCCGGG GCTGGAGAAG CCCATCGAGC TGAACTACGC TACAAGCGGC AACGTAATCA CGGCGAAGGT GCCTCCCGGT ACGCCGCTGG GGGTGTACGA CTTGGTCATA AACGAGGGCG AGCTCTACGA GCCTAAGGCT GTGTGGGTTG GCAACGTCAC GGGGCCGCTT AGGATAATCC AGCTCACCGA CATACACGTG GGGGTTGAGC TGGACATGGC TTCTCTCTAC CGCCTCATAC ACGGCGCGCT TGTAGCCAGC GGCGGGCCGT ACGACGTGGT GTTCTTCACC GGAGACCTCG CAGACGTGGG GGGCCAGGTG TGGCACTACA CGCTGTTCAC TAAATACGCC GCTACTATAC CAAAACCCGT CTTCGCAATA CCGGGAAACC ACGACCACGC CGGCGACGAC CCCTTGACGA ACTACCAGAA GTTCGTCGGG AGGCCCGTGT GGTACAGGGT GATAGGCCCA TACCTAATAA TAGGGCTCGA CAGCGGCTTC GACGGCTACC TTAGAGACGA CCAGGTTAAG ATGTACGAGG AGGTGCTGAA GAGGTACCCC GACAAGGTTA AGATTGTCCT CATCCACCAC CCGCCCTTCT ACTTGCAAGA CAGTTACGTC GTTGAGACGT ACAGGGGGCC CCAAGACGTG GATCGGCTGA ACAAGGACCC CACTGGGAGG AGGAGCTACT ACATCGTATA CACCAGCTAC CTCCAGAACA GGCCCGCTTT CGAGAAGTTC CTTGACCTCA CCATTAGGTA CAAGGTGGCT CTGGTAATGG CCGGCCACGT CCACAGCGGC AACTCCACCA TTGTGATAAA CGGGACCTAC TTCGTCACCA CGAGGACGCT GGGCGGCTCC ATTGACACCT CCCACGGCTT TAGGACATAC GTGGTGTACC CAGACGGCCG CGTCGAGGTT GACAAGGAGT TGTGGACTTT CAAAAACTTC ACAGTCCTGA CCTGGGGCAC TAAGGCGGCT CAGATATACG CCGAGAGCTC TCTACTGCCG CCGACTATCA CCATCGACCT CCCGGGGGAG TTCGCCGGAC TGAAGGTGTT CAACGGCACT GCCGAGCTGG TAAAGGCGGA GAGGCACCCC CTGGGCAAAT ACACCAGGTA CACCCTGAAG ATTGCCGGGA AGAGGCTGTG GCTCGCGTTG GGCGACTACC AGCCCTCCCC CACAGTGGAG GTGGTTAGGA TACTCCCCAG GTCTCCCACT CCCGGCGACG TGGTCACAGT GACCATTAGG GCAACCGACC CCAACGTAGG CGTGCCCTTC ATCACCGTAA ACGGCCAGAG GATACTCGCG TCGTACCTAG AAGAGCAACC GGTCTACATC TACAAGTTTA AGTACACCGC CCCCACCACG TTGCAAGCAG CGGCGCCTAG TGGCAAGCCC CTAAGCCTCC AGATAGGCCA GCCCACAACT ACCACGCCGC CCACACCAAC CCCACCGCCT ACAACCACCA CAACACCCCA AACAACGACT GTGACGACGA CGCCTTCAGC CACCCCGACT GCCACCCCCA CCACGCAGAC CGCCACATCC CCCACGGCGA CGCCCAGCGC CACGGCACCC CCCGCGACTA CTCAGCCAAC TCCGGCGCCG ACAGCTGCCG CTACCGCGAC ACCTGCGCCT TCCTACCAGC CCTCGGCCTT CCCAGTAGAG GCAGTTATTT TGTTGGTAAT CGCCGTCGCC GGCGCCGCGG TTATCGCAGT GACAGCGAGG AAAAAGCCGA CGTCTGAAGA GGCCAAGACT AGATAG
|
Protein sequence | MKKLFSLSIL LLVALLSAAS IVDPRWGTPA YVTPGGAFNI TLDAAVAIKS VALAAPGLEK PIELNYATSG NVITAKVPPG TPLGVYDLVI NEGELYEPKA VWVGNVTGPL RIIQLTDIHV GVELDMASLY RLIHGALVAS GGPYDVVFFT GDLADVGGQV WHYTLFTKYA ATIPKPVFAI PGNHDHAGDD PLTNYQKFVG RPVWYRVIGP YLIIGLDSGF DGYLRDDQVK MYEEVLKRYP DKVKIVLIHH PPFYLQDSYV VETYRGPQDV DRLNKDPTGR RSYYIVYTSY LQNRPAFEKF LDLTIRYKVA LVMAGHVHSG NSTIVINGTY FVTTRTLGGS IDTSHGFRTY VVYPDGRVEV DKELWTFKNF TVLTWGTKAA QIYAESSLLP PTITIDLPGE FAGLKVFNGT AELVKAERHP LGKYTRYTLK IAGKRLWLAL GDYQPSPTVE VVRILPRSPT PGDVVTVTIR ATDPNVGVPF ITVNGQRILA SYLEEQPVYI YKFKYTAPTT LQAAAPSGKP LSLQIGQPTT TTPPTPTPPP TTTTTPQTTT VTTTPSATPT ATPTTQTATS PTATPSATAP PATTQPTPAP TAAATATPAP SYQPSAFPVE AVILLVIAVA GAAVIAVTAR KKPTSEEAKT R
|
| |