Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1417 |
Symbol | |
ID | 5709303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 1495939 |
End bp | 1497033 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641275927 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001541232 |
Protein GI | 159041980 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.111618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCACCA TTAATGATGT TAGGAACGCG GTTGCCAATG GGGTTAGTAA AACAGCTGCC TTAAGCCTAT TCACTCAATG CGACTTCAGG TTGCTGGCTA AGGTGGCTGA TGAATTAACA ATGAAGATCA CCGGGGACAC GGTTACCTTC GTTAATAACG TGGTTATTAA CTACACTAAC GTATGCGTGG CTCAATGCCC AATATGCGCC TTCTATAGGC TTAAGGGTCA TCCTGAAGCC TACCTGAGGA GTCCCCAGGA GGTTGTTAAG GGTGTTCTTG AGGCTAGGGC TAATTACGGG GTTAGTGAAC TCCACATTAA CGGTGGCTTT CACCCAGACT TAACCATAGA GTACTTTGAG GAGTTATTCA GCAGTGTTAA GAAGGCTGCA CCGGAGATTA CTATTAAGGG GCCTACTGCC GCTGAGGTGG ATTTCTACGC TAAGATTTGG CGCATGAGCA CTAGGGAGGT TCTTGAGAGG CTTAAGGAGG CTGGGTTAGA CGCATTATCA GGGGGTGGGG CTGAAATCCT CAGTGAGGAT GTTAGGAAGA TTATTACCCC ATATAAGTTT AATGCCGAAG CCTGGTTGAG GATTTCCCAT GAGGCCCATG AATTGGGTTT AAGGAGTAAT GCAACAATGA TGTATGGGCA CGTGGAGGAG CCCCACCACA TAATAGACCA CTTATTCTCA ATAAGGAGGC TTCAGGAGGA GACCAATGGC ATACTACTCT TCATACCCAT TAAGTTCGTT CCATGGAATA CAAGGCTCTA TAAGGATGGA TTAGTTAAGG GTCCTGCACC CACTGAACTG GATGTTAAGG TTACTGCAAT ATCAAGGATT GTGCTTGGGG GCGTTATTAA GAGGATTGGG GCCTACTGGA CCTCAGTGGG TAAGAGGATT GCCTCAACAC TCCTACTGGC TGGGGCTAAT GACTTAGTTG GAACCATGAT TAATGAACAA GTGCTTAGGC AAGCCGGCTC AGGCGACGCC TTGAAGCCAA CCACCGTGGC TGAGTTGGCT CACATTGCTA GGGAGGTTGG TAGGAGGCCT ATGGTTAGGG ACACATTCCA CACAAGGCTC ATTGAGGTGA ATTAA
|
Protein sequence | MCTINDVRNA VANGVSKTAA LSLFTQCDFR LLAKVADELT MKITGDTVTF VNNVVINYTN VCVAQCPICA FYRLKGHPEA YLRSPQEVVK GVLEARANYG VSELHINGGF HPDLTIEYFE ELFSSVKKAA PEITIKGPTA AEVDFYAKIW RMSTREVLER LKEAGLDALS GGGAEILSED VRKIITPYKF NAEAWLRISH EAHELGLRSN ATMMYGHVEE PHHIIDHLFS IRRLQEETNG ILLFIPIKFV PWNTRLYKDG LVKGPAPTEL DVKVTAISRI VLGGVIKRIG AYWTSVGKRI ASTLLLAGAN DLVGTMINEQ VLRQAGSGDA LKPTTVAELA HIAREVGRRP MVRDTFHTRL IEVN
|
| |