Gene Cmaq_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0021 
Symbol 
ID5710443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp34596 
End bp35645 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content45% 
IMG OID641274524 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein 
Protein accessionYP_001539865 
Protein GI159040613 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.779693 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAACTA TAGCCGTACT GGCAAGCCAC TCAGCATTAG ACGTACTGGA TGGTGCAAAG 
GATGAGGGCT TCAACACAAT AGCGGTAGCC AAGAGGGGTA GGGATTTACC TTACCGTGAA
TTCCCAGTGG TGGATAAGTT GATTCTCGTC GATGATTTCA AGGAGTTGAT CAGTGACCGT
GTAATTAATG AGTTGAGGAA CAGTGAAGCC GTCTTCATAC CCAATAGATC CTTTGCAGTG
TACGTAGGTT ACGATAATAT TGAGGATAAA TTCCCAATAC CAATATTCGG TAATAGGAGG
CTACTCAGGT GGGAGGAGAG GACTGGTCCA TGTAATTACT ATAAGTTACT TGACCATGCT
GGGATTAGGA GACCAAGAAC CTTCAATAGT ATTGATGAGG TTGATAGACC CGTTATAGTT
AAGCTCCCTG AAGCTGCTAG GAGGGTTGAG AGGGGGTTCT TTATAGCTAG GGATAGGGAT
GACGCCTTAA GGAAGGTTAA GGAATTAGCT GATAAAGGTA TCATAAGGTT AAGTGACTTG
GATAATGCAT CAATAGAGGA GTTAGTTATT GGCGCCCATT TCAACGCCAA TTACTTCCAC
TCTAAGGTAA GGGGAAGGCT TGAGTTACAT AGTTTCGATA GAAGGATTCA ATCAAACCTA
GATGGCGTCT ACAGGATACC GGCTCAGGAT CAAATTGGGC TTAACATTGA TGTTAGGTAC
ATTGAGGTTG GGCATGAACC AGCAACCATA AGGGAGAGCC TACTTGAGAA GGTTTTTGAA
ATTGGGCATA GGTTCGTTAA GGCAACTGAG GAAATGGTAC CACCCGGGGT TATAGGGCCG
TTTACCCTTC AATTCATGGT AACCCCTGAA TTAGACCTGG TGGTTTATGA TGTCGCACCC
AGAATAGGCG GTGGCACTAA TGCTTACCTG GGTATTGGCG GGCAATACAG TAAACTCTAC
TTCGGTAAAC CCATATCAAT AGGTAGGAGA ATAGCCATTG AGGTTAAGGA GGCTGTAGCC
AATAGCATGC TTGAGAAGGT GACCACATGA
 
Protein sequence
MVTIAVLASH SALDVLDGAK DEGFNTIAVA KRGRDLPYRE FPVVDKLILV DDFKELISDR 
VINELRNSEA VFIPNRSFAV YVGYDNIEDK FPIPIFGNRR LLRWEERTGP CNYYKLLDHA
GIRRPRTFNS IDEVDRPVIV KLPEAARRVE RGFFIARDRD DALRKVKELA DKGIIRLSDL
DNASIEELVI GAHFNANYFH SKVRGRLELH SFDRRIQSNL DGVYRIPAQD QIGLNIDVRY
IEVGHEPATI RESLLEKVFE IGHRFVKATE EMVPPGVIGP FTLQFMVTPE LDLVVYDVAP
RIGGGTNAYL GIGGQYSKLY FGKPISIGRR IAIEVKEAVA NSMLEKVTT