Gene Cmaq_0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0014 
Symbol 
ID5709080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp26412 
End bp27662 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content43% 
IMG OID641274517 
Productthiamine biosynthesis/tRNA modification protein ThiI 
Protein accessionYP_001539858 
Protein GI159040606 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.735237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTA ACCCAATTGT ACTCGTTAGG TATGGGGAAA TAGGAGTGAA GAGTAATAGG 
GTTAGGGTTA GGTTGGAGAA CCTGCTCACT AAGAATATTC AGGAAGCCTT AAGGAGAGGA
GGTGTATTAA ACTATAGTAT TAGTAAGACT AGGGGTAGGA TTCTGATTAA CGTGCCTAAG
GAGAATTTAA TTAATACAGC CCTAATGGCC GCGAGGGTTT TTGGCGTTGT ATCAACATCA
CCAGCCTACT CGCTTAACTT CAGTAGCATT AATGATATAG TTATTGCGGC CTATGAACTA
TGGCGTGGTA AGGTGAATGG GAGGAAGTTC GCGGTCAGGG TTAGTAGGAC TGGTAATCAC
CCATTCACAT CAATTGACGT GGCTAAGCGT GTTGGTGCAG TACTCTACCC ATTCTCCAAT
GGTGTTGACT TAGATAACCC GGAGGTGGAG TTGTATGTGG AGATTAGGGA TAATTACGCC
TTCCTATTCG ATGAGGTAAT AGATGGGCCA GGTGGCTTAC CTTTAGGTTC ACAGGGAGGT
AAGGTACTTG CCCTAGTCTC CGGTGGCTTT GACTCACCTG TGGCTTATTG GTTAATGAGT
AGGAGGGGGG CCTTGGTTGA TGCATTGTTC TGCAGTCTAG CCCCACCAGT GGATGTTATT
GGGTTAATTA GGGTTATTAG GTATCTTTAC GAGAATTGGG TATTTGGGTA CGACCCATTA
ATCATGATCG CTGACTGCAC TCAATTAGTT AACGCCATGA GGAGTTCAGT TAACAGCCAC
TTAATGAACA CTGTCTTTAA GAAATTCCTA TACAGGTTAG CTGAATCAAT AGCTGTGGAG
GGTGGTTACA TGGGTATAGT TACCGGTGAA TCCCTAGGTC AAGTTAGTAG TCAAACCTTA
AGTAACCTGT ACTCAGCATC AGCGGGTATT AATGTGCCTA TATTCAGGCC CTTAATAGGT
ATGGATAAGG ATGATATAAT TAAGTTAGCT AAGAGGATAG GTACGTATGA GGAATCAGTG
AAAATGATTG AACCCTGCTC AGTGTTCTCT AGGAAGCCGA GGACTAGATC AACACCAAGT
AGGCTTGACG AGGAATTAAG CAGGGTAATT AACCTATTAG GCGTAATTAA GGCAAGTATT
ATTAAGGTAA AGGCCAGTGA ATTAAGCAGT ATTAATGAAA GGTCACTGTT AAGTAACTGG
AGAATCAACA TTATGGGCAC TGATGCAGGT GGTTTAGGTT CATGTAAGTG A
 
Protein sequence
MNLNPIVLVR YGEIGVKSNR VRVRLENLLT KNIQEALRRG GVLNYSISKT RGRILINVPK 
ENLINTALMA ARVFGVVSTS PAYSLNFSSI NDIVIAAYEL WRGKVNGRKF AVRVSRTGNH
PFTSIDVAKR VGAVLYPFSN GVDLDNPEVE LYVEIRDNYA FLFDEVIDGP GGLPLGSQGG
KVLALVSGGF DSPVAYWLMS RRGALVDALF CSLAPPVDVI GLIRVIRYLY ENWVFGYDPL
IMIADCTQLV NAMRSSVNSH LMNTVFKKFL YRLAESIAVE GGYMGIVTGE SLGQVSSQTL
SNLYSASAGI NVPIFRPLIG MDKDDIIKLA KRIGTYEESV KMIEPCSVFS RKPRTRSTPS
RLDEELSRVI NLLGVIKASI IKVKASELSS INERSLLSNW RINIMGTDAG GLGSCK