Gene Cmaq_0579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0579 
Symbol 
ID5709070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp609356 
End bp610441 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content46% 
IMG OID641275080 
ProductGHMP kinase 
Protein accessionYP_001540410 
Protein GI159041158 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value4.95739e-11 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.000433765 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATAGTTT CATCAGCCCC AGGCAGGGTG GATCTATTCA ACACTCACCA GGACTATAAG 
GGGTTACCTG TGGTTCCAGC TGCAGTGGAT CTAAGGACTA CTGTTGAGGG GGAGTTGAGG
ATGGGTGATG TTATTAAGGT TAAGTCAATT AACATGAATG AGGAGGTTGA GTTAAGGATG
AGTGATGAAT TGAAGGTTGG GCAATGGAGC AGTTACGTAT TAGCCGCATT GAAGGCTTTA
TCAATGCACG GTTACTTCAT AGGTGGTGCC GAGTTAACAA TAAGAAGCAG TGTACCCGTG
GGTTCAGGGT TAGCCAGCAG TGCTGCATTA CTGGTTTCAG TGGTGAACTG GTTCAATAAG
GCTTACGGCT TAGGCTTATC TAAAAGGGAT ATTGCTGAAT TGGCCTATGA GGCTGAGCAC
AATGTAATGG GCATACCCTG TGGTAGACTT GACCAATACT CCTCATCATA TGGAGGCCTA
ATAATACTAG AGACTAGGCC ACCCTACAGG GTTGAGGAAC TGGGTGTTAA GGGCCTTGAC
TTCATTGTTG TTGACTCAGG GGTTAGGCAT AGTACAATGA ATGTGCACAC GGTTAGGCAG
AGGGAGCTTA GGGAGGCCCT CAGTATGCTT AAGGAGTCCA TACCGAGGAG TCATTGGGAT
AAATTAAATA AGCCACTTGA TGAGATTGAT TGGGATTGGT TAGCCGCAGC GGCTAAGGAT
TACTTAAACA CACTGAGTGA CGTGCATAGG AGGAGACTTG AATTCACAAT ACTCATGAAT
GAGTCAACAA AGAAGGCTAT AGTTGAGTTG AGGAAGAGTA AACCGGATAG GAGGGTTTTA
GGTGAAATCA TGAATGAGCA GCATAGGTTA CTTAGGGACC TATATGAAGT CAGTATACCG
GAGCTTGAGG AGATTAAGAG GGTCCTCGAC CTCAATGGCG CATTAGGGTC TAAGATAAGT
GGCGCAGGCA TGGGGGGTTC AATAGTGGCC TTAGCTGAGG ATAGGAAGGA GGCTGAGAGG
ATCCTTGACT CCATTAAAAG TAAGTGGAGG GGTTGGGTTG TATCAATTGA TCAAGGCGTG
TCATAA
 
Protein sequence
MIVSSAPGRV DLFNTHQDYK GLPVVPAAVD LRTTVEGELR MGDVIKVKSI NMNEEVELRM 
SDELKVGQWS SYVLAALKAL SMHGYFIGGA ELTIRSSVPV GSGLASSAAL LVSVVNWFNK
AYGLGLSKRD IAELAYEAEH NVMGIPCGRL DQYSSSYGGL IILETRPPYR VEELGVKGLD
FIVVDSGVRH STMNVHTVRQ RELREALSML KESIPRSHWD KLNKPLDEID WDWLAAAAKD
YLNTLSDVHR RRLEFTILMN ESTKKAIVEL RKSKPDRRVL GEIMNEQHRL LRDLYEVSIP
ELEEIKRVLD LNGALGSKIS GAGMGGSIVA LAEDRKEAER ILDSIKSKWR GWVVSIDQGV
S