Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_0738 |
Symbol | |
ID | 4908708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | - |
Start bp | 696663 |
End bp | 697742 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640124487 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001055630 |
Protein GI | 126459352 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000000230285 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCTCAAGC CAGAAGACGT GGTGGAGGCC GCCCTTAGGG GGCTCTCTAG GCACGACGCA GAGGCGCTGA TGCGCGAGGC GGACATGTTC ACGCTGGCCC AGGCCGCCCA CGCCGTCTCG CAGAAGTTCT ACGGCGACGT GGTGACCTTC GTCAACAACA TCGTGGTGAA CTACAGCAAC GTCTGTGTGG CCAAGTGCCC CATATGCGCC TTCTACCGCC TGCCGGGGAG CCCCGACGCG TATGTGCGTA AGCCCGAGGA AGTGGCCGCC GCAGTGAGAC AAGCCGTGGA GCAACGCGGC GTCACGGAGT TGCACATAAA CGGCGGCTTT AACCCCTTCC TAACCCCCGA GTACTTCGAC CAGCTGTTCT CCGCCGTCAA GAAGGCCGCC CCCCGCGTGG TAATAAAGGG GCCCACCATG GCCGAGGTGG ACTACTACGC CAAGCTGTGG CACATGTCTT GGCGAGAAGT CCTCTCCAGG TGGAAAGACG CGGGGCTAGA CGCCATATCG GGCGGCGGCG CCGAGATATT CGCAGAAGAG GTGAGGAGGG TGGTGGCGCC ACACAAGATA TCCGGCGAAG AGTGGATCAG GATAGCCGAG GTGGCCCACG AGCTGGGGAT CCCCAGCAAC GCCACAATGC TCTACGGCCA CGTAGAAGAG GACCGCCACG TGGTAGACCA CATATTCCGC GTAAAAGAGC TACAGGAAAA GACCGGGAGG GTACTCCTCT TCATACCCGT GAAGTACAAC CCGCAAAACA CCGAGTTGTA CAAGAGGGGG GTCGTGAAGG GCCCCGCCCC AGCCACATAC GACGTAAAAG TGGTGGCCAT ATCCCGCCTA ATCCTCTTAG ACAGGCTAAA GGTAGCCGCC TACTGGCTCT CCGTGGGGAA GAAGCTCGCC TCCACGCTAC TCCTCGCGGG GGCCAACGAC CTTGTCGGCA CCATGTACAA CGAGGCCGTG CTGAGGTCGG CCGGGGCGCC CCACTCCGCC ACGCCAGAGG AGCTGGCCGC CATAGCGAGA GAAGTGGGCA AAAGACCCGC CGAGCGGGAC ACCTTCCACA GAATAATTAG ATATATATAG
|
Protein sequence | MLKPEDVVEA ALRGLSRHDA EALMREADMF TLAQAAHAVS QKFYGDVVTF VNNIVVNYSN VCVAKCPICA FYRLPGSPDA YVRKPEEVAA AVRQAVEQRG VTELHINGGF NPFLTPEYFD QLFSAVKKAA PRVVIKGPTM AEVDYYAKLW HMSWREVLSR WKDAGLDAIS GGGAEIFAEE VRRVVAPHKI SGEEWIRIAE VAHELGIPSN ATMLYGHVEE DRHVVDHIFR VKELQEKTGR VLLFIPVKYN PQNTELYKRG VVKGPAPATY DVKVVAISRL ILLDRLKVAA YWLSVGKKLA STLLLAGAND LVGTMYNEAV LRSAGAPHSA TPEELAAIAR EVGKRPAERD TFHRIIRYI
|
| |