Gene Caci_7279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7279 
Symbol 
ID8338647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8456558 
End bp8457814 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content73% 
IMG OID644960360 
Productgalactokinase 
Protein accessionYP_003117949 
Protein GI256396385 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGTGA GCCTCGACAC GGGATCGGTG GCGTCGACGG TGGACGGCGC GCGGCGTGGA 
TTCCACGAGC GGTTCGGCAG CGAGCCCGCC GGAGTGTGGG CCGCGCCGGG GCGGGTGAAC
GTCATCGGCG AACACACCGA CTACAACGAC GGCTTCGTCC TGCCGATGGC GATCGACCGC
GCCTGCTACG CGGCCGCCGC GCCCCGACAG GACCGGACAC TGCGGGTGTA CGCCGCACAG
CTCGACGAGA CGGTCGAGAT CTCCCTCGAC GCCCTCGTCG GCCCGGGGTC GGCCGGCGAC
AGCCGCGACG CCGCGGACCC CGCCATGGCG CCGCTGGCCG AACCGGCGGT CCGCGGCTGG
GCCGGCTACC CGGTCGGCGT CGCGTGGATC CTGCAGCAGG CCGGGTTCCC GGTCGGCGGC
GCGGACGTCT ACCTGACCAG CACTGTGCCG GTCGGTTCCG GGCTGTCCTC CTCGGCTGCC
CTGGAGTGCG CGACAGCGCT CGCGCTCGCG GGTGTCTCAG GGTTCGAGCT CACCACGGCC
GAACTCGCGC GGCACACGCA GCGCGCGGAG AACATCTACG CCGGCGTGCC GTGCGGCCCG
CTGGACCAGA TGTCCTCGGC GTTCGGCCAG GACGGCTCGG TGTTGTACAT CGACACCCGC
AGCGGCGAGG TGCGGCCGCA GCCCTTCGAT CTGGCCGCCG AGAACGCGCT GCTGCTCATC
ATCGACACCC GGGTGTCGCA CGCGCACGGC GAGAACGGCT ACGCCGACCG CCGCTCGGCG
TGCGAGCGTG CCGCCGAGTT CCTGGGCGTC GCCGCCCTGC GCGACGTCTC GGTGGCCGAG
CTTCCTGAAT CTTTCGCGAA GGTGTCGGTC GGGCTCGGCG AGACCTTCGC GCGGCGGCTG
CGGCACGTCG TCACCGAGGA CGCGCGCGTC GAGCAGTTCG TCGAGATCCT GAACCAGGAG
CCGCTCTCCC TGCCCCGGCT GGGGCACCGC ATGATGCAGT CCCACGCCTC CCTGCGCGAC
GACTACGAAG TCAGCGCCCC CGAACTCGAC CTCGCCGTGG CCACCGCGGT CGCGGCCGGC
GCGCACGGCG CGCGGATGAC CGGCGGCGGC TTCGGCGGCA GCGCCATCGC GCTGGTCGAC
CGCGATCTGC TGGACGACGT GCGCGCGGCC GTCGTCGCCG CTTTCGCAGA ACACGGATAC
ACCGAGCCCC AGTTCTTCCC CGCCGTCGCT TCGGCGGGCG CGCACCGGAT CCCCTGA
 
Protein sequence
MAVSLDTGSV ASTVDGARRG FHERFGSEPA GVWAAPGRVN VIGEHTDYND GFVLPMAIDR 
ACYAAAAPRQ DRTLRVYAAQ LDETVEISLD ALVGPGSAGD SRDAADPAMA PLAEPAVRGW
AGYPVGVAWI LQQAGFPVGG ADVYLTSTVP VGSGLSSSAA LECATALALA GVSGFELTTA
ELARHTQRAE NIYAGVPCGP LDQMSSAFGQ DGSVLYIDTR SGEVRPQPFD LAAENALLLI
IDTRVSHAHG ENGYADRRSA CERAAEFLGV AALRDVSVAE LPESFAKVSV GLGETFARRL
RHVVTEDARV EQFVEILNQE PLSLPRLGHR MMQSHASLRD DYEVSAPELD LAVATAVAAG
AHGARMTGGG FGGSAIALVD RDLLDDVRAA VVAAFAEHGY TEPQFFPAVA SAGAHRIP