Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0021 |
Symbol | |
ID | 5710443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 34596 |
End bp | 35645 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641274524 |
Product | 5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein |
Protein accession | YP_001539865 |
Protein GI | 159040613 |
COG category | [R] General function prediction only |
COG ID | [COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.779693 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAACTA TAGCCGTACT GGCAAGCCAC TCAGCATTAG ACGTACTGGA TGGTGCAAAG GATGAGGGCT TCAACACAAT AGCGGTAGCC AAGAGGGGTA GGGATTTACC TTACCGTGAA TTCCCAGTGG TGGATAAGTT GATTCTCGTC GATGATTTCA AGGAGTTGAT CAGTGACCGT GTAATTAATG AGTTGAGGAA CAGTGAAGCC GTCTTCATAC CCAATAGATC CTTTGCAGTG TACGTAGGTT ACGATAATAT TGAGGATAAA TTCCCAATAC CAATATTCGG TAATAGGAGG CTACTCAGGT GGGAGGAGAG GACTGGTCCA TGTAATTACT ATAAGTTACT TGACCATGCT GGGATTAGGA GACCAAGAAC CTTCAATAGT ATTGATGAGG TTGATAGACC CGTTATAGTT AAGCTCCCTG AAGCTGCTAG GAGGGTTGAG AGGGGGTTCT TTATAGCTAG GGATAGGGAT GACGCCTTAA GGAAGGTTAA GGAATTAGCT GATAAAGGTA TCATAAGGTT AAGTGACTTG GATAATGCAT CAATAGAGGA GTTAGTTATT GGCGCCCATT TCAACGCCAA TTACTTCCAC TCTAAGGTAA GGGGAAGGCT TGAGTTACAT AGTTTCGATA GAAGGATTCA ATCAAACCTA GATGGCGTCT ACAGGATACC GGCTCAGGAT CAAATTGGGC TTAACATTGA TGTTAGGTAC ATTGAGGTTG GGCATGAACC AGCAACCATA AGGGAGAGCC TACTTGAGAA GGTTTTTGAA ATTGGGCATA GGTTCGTTAA GGCAACTGAG GAAATGGTAC CACCCGGGGT TATAGGGCCG TTTACCCTTC AATTCATGGT AACCCCTGAA TTAGACCTGG TGGTTTATGA TGTCGCACCC AGAATAGGCG GTGGCACTAA TGCTTACCTG GGTATTGGCG GGCAATACAG TAAACTCTAC TTCGGTAAAC CCATATCAAT AGGTAGGAGA ATAGCCATTG AGGTTAAGGA GGCTGTAGCC AATAGCATGC TTGAGAAGGT GACCACATGA
|
Protein sequence | MVTIAVLASH SALDVLDGAK DEGFNTIAVA KRGRDLPYRE FPVVDKLILV DDFKELISDR VINELRNSEA VFIPNRSFAV YVGYDNIEDK FPIPIFGNRR LLRWEERTGP CNYYKLLDHA GIRRPRTFNS IDEVDRPVIV KLPEAARRVE RGFFIARDRD DALRKVKELA DKGIIRLSDL DNASIEELVI GAHFNANYFH SKVRGRLELH SFDRRIQSNL DGVYRIPAQD QIGLNIDVRY IEVGHEPATI RESLLEKVFE IGHRFVKATE EMVPPGVIGP FTLQFMVTPE LDLVVYDVAP RIGGGTNAYL GIGGQYSKLY FGKPISIGRR IAIEVKEAVA NSMLEKVTT
|
| |