Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0220 |
Symbol | |
ID | 5056416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 198698 |
End bp | 199750 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640467799 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001152487 |
Protein GI | 145590485 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAGA GAGGCTTAAA CCGCTCAGAT GCCCTTTACC TCATGCGCGA AGCGGACGTC TTCACCTTGG CCAAAGCCGC TGAGGAGCTG ACGCGGAAGT ACTACGGCGG CGTTGTGACC TTCGTCAACA ACGTGGTTAT CAACTACTCG AACGTGTGCG TTGCCAAATG CCCCATCTGC GCCTTCTATA GGCTCCCCGG CCACGGGGAG GGCTACGTGA GGAAGCCCGG GGAGGTGGCG GCGATGGTGG AGCGCTTTGC GAAAGAGCTC GGCGTCACCG AGCTCCACAT AAACGGCGGC TTCAACCCTT TCCTGACGCC GGAGTACTTC GATGAGCTGT TCCGAGAGGT GAAGAGGAGG GTTCCCCGCG TGGCGATAAA GGGTCCCACC ATGGCTGAGG TGGACTACTA CGCCAAGCTG TGGCGCGTCT CGCGGCAGGA GGTCCTATCG CGCTGGAAAG AGGCGGGGCT AGACGCCATT TCGGGCGGCG GCGCCGAGAT ATTCGCAGAG GAGGTCAGGA AGGTGGTTGC CCCCCACAAG ATATCTGGCG AAGAGTGGCT CGAAATTGCG GAGCTGGCCC ACAAGATGGG CATACCCAGC AACGCCACCA TGCTCTACGG ACACGTGGAG AGAGAGGAGC ACGTGGTAGA CCACATATTC CGCGTCAAGG ACCTCCAGGA GAGGACTGGG GGCCTCCTCC TCTTCATCCC CGTTAAGTTC AACCCAGATA ACACGGAGCT CAAGGCGAGG GGGGTCGTCG CGAGGCCGGC CCCCTCCACC TACGACGTGA AGGTGGTGGC CATAGCGAGG CTGATCCTAG GGGACAGGCT AAAGGTGGCT GCCTACTGGC TCTCCGTGGG CAAGAAGCTG GCCTCCACCC TCCTACTTGC CGGCGCCAAC GACCTAGTGG GGACGATGTA CAACGAGGCT GTGCTCACCT CGGCCGGGGC GAGGCACAGC GCGTCGGTGG AGGAGTTGGC AGAAATTGCA AGAGAGGTGG GCAAAACACC TGCACTGAGG GACACATTCC ACAGAGTACT GGCCTACTTG TAG
|
Protein sequence | MAERGLNRSD ALYLMREADV FTLAKAAEEL TRKYYGGVVT FVNNVVINYS NVCVAKCPIC AFYRLPGHGE GYVRKPGEVA AMVERFAKEL GVTELHINGG FNPFLTPEYF DELFREVKRR VPRVAIKGPT MAEVDYYAKL WRVSRQEVLS RWKEAGLDAI SGGGAEIFAE EVRKVVAPHK ISGEEWLEIA ELAHKMGIPS NATMLYGHVE REEHVVDHIF RVKDLQERTG GLLLFIPVKF NPDNTELKAR GVVARPAPST YDVKVVAIAR LILGDRLKVA AYWLSVGKKL ASTLLLAGAN DLVGTMYNEA VLTSAGARHS ASVEELAEIA REVGKTPALR DTFHRVLAYL
|
| |