Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0217 |
Symbol | |
ID | 5054198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 195643 |
End bp | 196704 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640467796 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001152484 |
Protein GI | 145590482 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.896474 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAAAAG CAGAGACCAT GGACGTTCCC AGCAGGAGTT ATATTGAAGA TCTACTGAAG GCGGACTTGT GGGAGCTTGG GAGGAGGGCT TACGAGATCA GGACGAGGCT ATATGGGGAC TTAGCCACCT TCATACCAAA CATGGTCCTG AACTACACAA ACGTCTGCGT TGTCGGGTGC TCCTTCTGCG CCTTCTACCG CCCGCCCCGC CACCCGGAGG CGTATGCGTA CTCTGTGGAG GAGGCCGTGA AGAGGGTCCT CGCCGTAGAC GCGAAGTACG GCATTAGGCA GGTTTTAATA CAGGGCGGGG TCAACCCCGA AATCGGCATT GAGTACTTCG AGGAGCTGTT CCGCGCCATC AAGGCCAAGG CGCCGCACAT CGCGATCCAC GCCCTGTCTC CTCTGGAGGT GGAGTACCTC GCGAGGCGGG AGAGGGCCAC TTACAGGGAG GTGCTGGAGA GGCTTAAGGC GGCGGGGATG GAGTCTATGC CAGGGGGCGG CGGGGAGATC CTCGTCGACG ATGTCAGGAA GATCATAGCG CCGAAGAAGA TCGACAGCGA TACTTGGCTC CGGATTATGG AGGAGGCGCA TAAGGCCGGC ATCCTGTCCT CGGCGACGAT GATGTATGGC CACGTGGAGA CGCCCAGCGA CATAGCAGAG CACATGTACA AAATCGCCCA GCTCCAGGCC AAGACCGGCG GCTTCGTGGC GTTCATCGCG TGGAACTTCG AGCCCGGGAC GAGCGAGCTT GGGAAGAAGA TCCCATACCC CAAGACCTCG GCCACCCTGC TGAGGATGGT CGCCGTAGCC CGCATAGTGT TCAACGGCCT CATCCCCCAC ATACAGGCCG GGTGGCTCAC CACAGGCCCC GAGACCGCCC AGCTGGCGCT GTATTTCGGC GCCGACGACT TCGGTGGGAC CCTCTACGAA GAGAAGGTCT TGGAGTGGGT AAGGGCAGAG GCCCCCATCG ACAGGAAGAC AGACGTCGTC AACATTATCC GAGACGCCGG GTTTAGGCCG GCGGAGCGGG ACAACCTCTA CAGGGCTGTG GCCTACTACT AA
|
Protein sequence | MLKAETMDVP SRSYIEDLLK ADLWELGRRA YEIRTRLYGD LATFIPNMVL NYTNVCVVGC SFCAFYRPPR HPEAYAYSVE EAVKRVLAVD AKYGIRQVLI QGGVNPEIGI EYFEELFRAI KAKAPHIAIH ALSPLEVEYL ARRERATYRE VLERLKAAGM ESMPGGGGEI LVDDVRKIIA PKKIDSDTWL RIMEEAHKAG ILSSATMMYG HVETPSDIAE HMYKIAQLQA KTGGFVAFIA WNFEPGTSEL GKKIPYPKTS ATLLRMVAVA RIVFNGLIPH IQAGWLTTGP ETAQLALYFG ADDFGGTLYE EKVLEWVRAE APIDRKTDVV NIIRDAGFRP AERDNLYRAV AYY
|
| |