Gene CPR_1345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1345 
SymbolgalK 
ID4206052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1517637 
End bp1518800 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content30% 
IMG OID642565899 
Productgalactokinase 
Protein accessionYP_698665 
Protein GI110803879 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.659508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAA ATACACTTAA ATCAACTTTT ATTAATAATT TTGGTAAAGA ACCTAATTCA 
TTATTCTTCT CACCAGGAAG AATAAATCTT ATAGGTGAAC ATATTGACTA CAATGGGGGA
TTCGTATTTC CTTGCCCAAT AACTCTTGGA ACTTTTGCAG CGGCAACCTT AAGAGATGAT
AGAATTTGTA GAGCTTACTC TCTTAACTTT GAATCTCTTG GAGTAATAGA GTTTTCTTTA
GATGATTTAT CTTATAAAAA AGAGGATAAT TGGACAAACT ATCTTAAAGG AGTACTAAAA
GTACTTATAG AAAAAGGATA TAAAATAGAT AAAGGTATTG ACTTAGTTAT CAATGGAAAT
CTTCCAAACG GAGCTGGTCT TTCCTCTTCA GCATCTTTAG AAATGTTAAT AGTAAAAATT
TTAGATACTT TCTTTTCTCT TAACATTTCA AAGGTAGATG CTGCACTAAT AGGTAAAGAG
GTAGAAAATA CTTATATAGG TGTTAATAGT GGTATAATGG ATCAATTCGC TATTTCTCTA
GGAGAAAAGG ATAAGGCAAT TTTACTTGAT TGTAATAGCT TATATTATGA ATATGTTCCT
TTAAACTTAG GAGATAATTC AATAATTATA ATGAACACTA ATAAAAGACG TGAACTTGCA
GATTCTAAAT ATAATGAAAG AAGAAAAGAA TGTGATGATT CTTTAGACAC TTTAAAGAAA
TATACTAATA TTTCTTCTCT TTGCGAACTA ACTTCCTTAG AATTTGAAAC ATATAAAGAT
AAAATAGAAG ATTCTAATAA ATTACGTAGA TGTGTACATG CTATTTCTGA GAATGAAAGA
GTAAAAGATG CTGTAAAAGC TTTAAAAGAA AATAATCTTG AATTATTTGG ACAACTTATG
AATCAATCTC ATATTTCTCT TAGAGATGAT TATGAAGTTA CTGGTAAAGA ATTAGATACC
CTAGCTGAAA ATGCTTGGAA ACAACCTGGA GTTTTAGGTG CTCGTATGAC TGGGGCTGGC
TTTGGTGGAT GTGCCATAGC AATAGTTAAT AATAATCATG TTGATGAATT TATTAAAAAC
GTTGGACAGG CTTATAAGGA TGCTATAGGA TATGAAGCAT CATTCTATGT TGCTTCTATA
GGTAATGGTC CTACTGAACT TTAA
 
Protein sequence
MELNTLKSTF INNFGKEPNS LFFSPGRINL IGEHIDYNGG FVFPCPITLG TFAAATLRDD 
RICRAYSLNF ESLGVIEFSL DDLSYKKEDN WTNYLKGVLK VLIEKGYKID KGIDLVINGN
LPNGAGLSSS ASLEMLIVKI LDTFFSLNIS KVDAALIGKE VENTYIGVNS GIMDQFAISL
GEKDKAILLD CNSLYYEYVP LNLGDNSIII MNTNKRRELA DSKYNERRKE CDDSLDTLKK
YTNISSLCEL TSLEFETYKD KIEDSNKLRR CVHAISENER VKDAVKALKE NNLELFGQLM
NQSHISLRDD YEVTGKELDT LAENAWKQPG VLGARMTGAG FGGCAIAIVN NNHVDEFIKN
VGQAYKDAIG YEASFYVASI GNGPTEL