Gene Cmaq_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1417 
Symbol 
ID5709303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1495939 
End bp1497033 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content48% 
IMG OID641275927 
Productradical SAM domain-containing protein 
Protein accessionYP_001541232 
Protein GI159041980 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.111618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCACCA TTAATGATGT TAGGAACGCG GTTGCCAATG GGGTTAGTAA AACAGCTGCC 
TTAAGCCTAT TCACTCAATG CGACTTCAGG TTGCTGGCTA AGGTGGCTGA TGAATTAACA
ATGAAGATCA CCGGGGACAC GGTTACCTTC GTTAATAACG TGGTTATTAA CTACACTAAC
GTATGCGTGG CTCAATGCCC AATATGCGCC TTCTATAGGC TTAAGGGTCA TCCTGAAGCC
TACCTGAGGA GTCCCCAGGA GGTTGTTAAG GGTGTTCTTG AGGCTAGGGC TAATTACGGG
GTTAGTGAAC TCCACATTAA CGGTGGCTTT CACCCAGACT TAACCATAGA GTACTTTGAG
GAGTTATTCA GCAGTGTTAA GAAGGCTGCA CCGGAGATTA CTATTAAGGG GCCTACTGCC
GCTGAGGTGG ATTTCTACGC TAAGATTTGG CGCATGAGCA CTAGGGAGGT TCTTGAGAGG
CTTAAGGAGG CTGGGTTAGA CGCATTATCA GGGGGTGGGG CTGAAATCCT CAGTGAGGAT
GTTAGGAAGA TTATTACCCC ATATAAGTTT AATGCCGAAG CCTGGTTGAG GATTTCCCAT
GAGGCCCATG AATTGGGTTT AAGGAGTAAT GCAACAATGA TGTATGGGCA CGTGGAGGAG
CCCCACCACA TAATAGACCA CTTATTCTCA ATAAGGAGGC TTCAGGAGGA GACCAATGGC
ATACTACTCT TCATACCCAT TAAGTTCGTT CCATGGAATA CAAGGCTCTA TAAGGATGGA
TTAGTTAAGG GTCCTGCACC CACTGAACTG GATGTTAAGG TTACTGCAAT ATCAAGGATT
GTGCTTGGGG GCGTTATTAA GAGGATTGGG GCCTACTGGA CCTCAGTGGG TAAGAGGATT
GCCTCAACAC TCCTACTGGC TGGGGCTAAT GACTTAGTTG GAACCATGAT TAATGAACAA
GTGCTTAGGC AAGCCGGCTC AGGCGACGCC TTGAAGCCAA CCACCGTGGC TGAGTTGGCT
CACATTGCTA GGGAGGTTGG TAGGAGGCCT ATGGTTAGGG ACACATTCCA CACAAGGCTC
ATTGAGGTGA ATTAA
 
Protein sequence
MCTINDVRNA VANGVSKTAA LSLFTQCDFR LLAKVADELT MKITGDTVTF VNNVVINYTN 
VCVAQCPICA FYRLKGHPEA YLRSPQEVVK GVLEARANYG VSELHINGGF HPDLTIEYFE
ELFSSVKKAA PEITIKGPTA AEVDFYAKIW RMSTREVLER LKEAGLDALS GGGAEILSED
VRKIITPYKF NAEAWLRISH EAHELGLRSN ATMMYGHVEE PHHIIDHLFS IRRLQEETNG
ILLFIPIKFV PWNTRLYKDG LVKGPAPTEL DVKVTAISRI VLGGVIKRIG AYWTSVGKRI
ASTLLLAGAN DLVGTMINEQ VLRQAGSGDA LKPTTVAELA HIAREVGRRP MVRDTFHTRL
IEVN