Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1858 |
Symbol | |
ID | 5056024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1660457 |
End bp | 1661704 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640469404 |
Product | molybdenum cofactor synthesis domain-containing protein |
Protein accession | YP_001154061 |
Protein GI | 145592059 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0303] Molybdopterin biosynthesis enzyme |
TIGRFAM ID | [TIGR00177] molybdenum cofactor synthesis domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.269883 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.565686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGGCT TTAAGACACT GATGCCGATA GCGGAGGCAC AGAGGGCGGT CATCAGCGCC ATTGCCCACA AGCCCTCTGT AGTCACAGTG CCGACGCCCC AGTCGGTGGG GCTGTACGTA GCGCAGGACA TATTTGCGCC TGTAGACGTG CCGCCATTCG ATAGGGCTGC CTTCGACGGT TTTGCGGTGA GGTCTGTTGA CACTATCGGC GCATCAAGGA CAAATCCCAT AATGCTAAAG GTGGTCGGCA AGTCGCTACC GGGCCTCGGC TACCGCGGCG CCATTGGGCC TGGGGAGGCG GTGGAAATAG CAACAGGCGC GCCTCTGCCC GATGGCGCAG ATGCGGTCGT GCCTTATGAA GAGGCGGCGC ACAGGGGGGA GTACATTGAG GTGTATAAGC CAGTACCCCA GTACTACTAC GTCTCGCGCA GGGGAGAGGA CGTATCGGCA GGAGAAGTTG TTTTAAAGCG GGGAAGGCGG ATTAAGCCGT GGGACGTCGG CGTATTGGCC TCCCTAGGCA TTAAAGAGGT GGCTGTTTAC AAAGTGACGG CAGGCCTAGT ATCCACAGGA AATGAGCTCG TTGAGCTAGA AGATGCGCCT CCGCCCCCCG GCAAGATTAT AAACAGCACA CGACATATAA TAACGGCGCT TCTACTTGAA CTTGGAGTAA AGACGACCTA CCTAGGGATA GTCCCCGACG ACGTTGATGC AATACACGGC GTTTTGAAAG AGGCACTAGC CAAGTTCGAT ATCGTGATAA CAACTGGCGG CGTCTCTGTC GGCGAGCCCG ACCACGTAGT GGAGGCGGTA AGGCGCCTTA AGCCGGAGGT GCTGGTCCAC GGCATCGCCG CTAGGCCTGG GAGACCTAAT AGCGCAGCGG TGGTGGGGGG AAAGCCGGTG ATTATGCTCT CGGGCTTCCC AGTCGCCTCT ATTGTCGGCT TTGAGGTATT CGTCAAGCCG GTCATTCTCC ACATGGTCGG CGCCAGAGAG GAGCCTCTGC CCGTGGCCGT GGCCACTTTG ACGAGGAGAG TCACCACACC AATTAACGTG AGGAGTTTAG TGAGGGTCAG GGTCTTCCGC CAAGGCAGAG AGCTATACGC AGAGCCGCTT GCCGTCACGG GGAGCGGCGT TTTGTCAACG CTGACAAGGG GCAACGGCCT TTTGATCATA CCGGAAAACA GAGAAGGCTA CGACGAGGGT GACAAGGTTG AGATCGTACT GCTCGGCCCC ATAGAAGAGG AAAAATAA
|
Protein sequence | MKGFKTLMPI AEAQRAVISA IAHKPSVVTV PTPQSVGLYV AQDIFAPVDV PPFDRAAFDG FAVRSVDTIG ASRTNPIMLK VVGKSLPGLG YRGAIGPGEA VEIATGAPLP DGADAVVPYE EAAHRGEYIE VYKPVPQYYY VSRRGEDVSA GEVVLKRGRR IKPWDVGVLA SLGIKEVAVY KVTAGLVSTG NELVELEDAP PPPGKIINST RHIITALLLE LGVKTTYLGI VPDDVDAIHG VLKEALAKFD IVITTGGVSV GEPDHVVEAV RRLKPEVLVH GIAARPGRPN SAAVVGGKPV IMLSGFPVAS IVGFEVFVKP VILHMVGARE EPLPVAVATL TRRVTTPINV RSLVRVRVFR QGRELYAEPL AVTGSGVLST LTRGNGLLII PENREGYDEG DKVEIVLLGP IEEEK
|
| |