Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1795 |
Symbol | |
ID | 5055723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1611632 |
End bp | 1612852 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469340 |
Product | MoeA domain-containing protein |
Protein accession | YP_001153998 |
Protein GI | 145591996 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0303] Molybdopterin biosynthesis enzyme |
TIGRFAM ID | [TIGR00177] molybdenum cofactor synthesis domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.213804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0671685 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTATC TCAGCGGCCT TAATCCTGTG GCGAAGGTTG CAGAGGTGTT GCCTCTGGTG AAAAGAGTAG ACCAAGTAGA AAAGGTGGCC ACGTGGGACG CCGTCGGCAG AGTTGTTGCA AAGGACGTAA CAGCTCCTCA CGACTATCCC CCGCTTCCCA GGGCTTCCTA CGACGGATAC GCTGTTAACT CAGAGGCGAC GCCTGGCAGA TTCAGGGTAG TGGGCACAGT CCTGGTAGGT CAGTATCGGA GAGATATCGA AGTTAAGCCT GGGGAGGCTG TGTATGTAAC AGTCGGCGCA TTTCTCCCAG AGGGGGCCGA CGCCGTGGTG CCCGAGGAGG CTGTGGAACG TGAAGGAGAC TTCGTGGTAG TAAAGTCAAG ATTTGAGAAG TACGCCAATG TAGACCCACC GGGGTCCTAC GTGCGGAAGG GAACAGTCAT GGCGAGTCAA GGTACCGTGC TGACCCCCTT TGACGTGGTC GGCCTTCTAG ATGTTGGGAT AACCGCAGTT TACGCGTACA GGAGGTTGAG AGTGGGCATA ATCGCCACGG GGGACGAGCT TATAGTTCCG CCAATTGACC CAGAAGTGGC AACTGAGCTG GTTTTGAAAG GGAAGGTAAT TGAGTCGACA GCATCTCTAG TGGCGTGGTA CATTGATACA TACATGCCAT ATGTGAAGGT AGAGGAAAGG GTGGTGACAG GCGATAAGCA CGAAGAGGTG CGCTTCTATG TAGACAAATT CCTAGAAAAT TACGATGCTG TGATAATCAC CGGCGGCGCA GGGCCAAGTG AAATTGACCA CTTCTACAAG CTGGGGTTCA GCGGTCTGAG AGGGTTTAGA ATGAAGCCGG GTAGGCCGAC CAGCGTTGCC GTGATCAACG GGAAGCCTGT CTTCGGCCTC TCTGGCTATC CCATTAGCGC TCTACACGGC GTCATAAGAA TAGTAGAGCC CGTGTTGCGC CACATGGCTA ACGTGACGAG ACCGCCTGGT AGCGGATGGG TATACGCCAC GATGGCTCAA GACGTCCAGG GAGAGATGGC CCAGATAGTC AGAGTGAAGC TGGAGATAAG TGAGGGGGAG TTATTAGCCA GGCCGATTAA GGCGAAACAC CACTCATTCA CAGAGCCTGA GACGTGTGGT GTGGCGCTAA TACCGCCTGG AGGAACGAAA AAGGGCGACG TGGTGCCGGT GTTGGTATTT CGCGACGTCA GGAAGCTCTA A
|
Protein sequence | MRYLSGLNPV AKVAEVLPLV KRVDQVEKVA TWDAVGRVVA KDVTAPHDYP PLPRASYDGY AVNSEATPGR FRVVGTVLVG QYRRDIEVKP GEAVYVTVGA FLPEGADAVV PEEAVEREGD FVVVKSRFEK YANVDPPGSY VRKGTVMASQ GTVLTPFDVV GLLDVGITAV YAYRRLRVGI IATGDELIVP PIDPEVATEL VLKGKVIEST ASLVAWYIDT YMPYVKVEER VVTGDKHEEV RFYVDKFLEN YDAVIITGGA GPSEIDHFYK LGFSGLRGFR MKPGRPTSVA VINGKPVFGL SGYPISALHG VIRIVEPVLR HMANVTRPPG SGWVYATMAQ DVQGEMAQIV RVKLEISEGE LLARPIKAKH HSFTEPETCG VALIPPGGTK KGDVVPVLVF RDVRKL
|
| |