Gene CPF_1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1552 
SymbolgalK 
ID4201933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1771369 
End bp1772532 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content30% 
IMG OID638082430 
Productgalactokinase 
Protein accessionYP_695995 
Protein GI110801332 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAA ATACACTTAA ATCAACTTTT ATTAATAATT TTGGTAAAGA ACCTACTTCA 
TTATTTTTCT CACCAGGAAG AATAAATCTT ATAGGTGAAC ATATTGACTA CAATGGGGGC
TTCGTATTTC CTTGCCCAAT AACTCTTGGA ACTTTTGCAG CAGCAAGCTT AAGAGAAGAT
AGAATTTGTA GAGCTTACTC TCTTAACTTT GAATCTCTTG GAGTAATAGA GTTTTCTTTA
GATGATTTAT CTTATAAAAA AGAGGATAAT TGGACAAACT ATCTTAAAGG AGTACTAAAA
GTACTTATAG AAAAAGGATA TAAAATAGAT AAGGGTATTG ACTTAGTTAT CAATGGAAAT
CTTCCAAACG GAGCTGGTCT TTCCTCTTCA GCATCTTTAG AAATGTTAAT AGTAAAAATT
TTAGATACTT TCTTTTCTCT TAATATTTCA AAGGTAGATG CTGCACTAAT AGGTAAAGAG
GTAGAAAATA CTTATATAGG TGTTAATAGT GGTATAATGG ATCAATTTGC TATTTCTCTA
GGAGAAAAGG ATAAGGCAAT TTTACTTGAT TGTAATAGCT TATATTATGA ATATGTTCCT
TTAAACTTAG GAGATAATTC AATAATTATA ATGAACACTA ATAAAAGACG TGAACTTGCA
GATTCTAAAT ATAATGAAAG AAGAAAAGAA TGTGATGATT CTTTAGACAC TTTAAAGAAA
TATACTAATA TTTCTTCTCT TTGTGAACTA ACTTCATTAG AATTTGAAAC ATATAAAGAT
AAAATAGAAG ATTCTAATAA ATTACGTAGA TGTGTACATG CTATTTCTGA AAATGAAAGA
GTAAAAGATG CTGTAAAAGC TTTAAAAGAA AATAATCTTG AATTATTTGG ACAACTTATG
AATCAGTCTC ATATTTCCCT AAGAGATGAT TATGAAGTTA CTGGTAAAGA ATTAGATACC
CTAGCTGAAA ACGCTTGGAA ACAACCTGGA GTTTTAGGTG CTCGTATGAC TGGGGCTGGC
TTTGGTGGAT GTGCCATAGC AATAGTTAAT AATGCTCATG TTGATGAATT TATTAAAAAT
GTTGGACAGG CTTACAAGGA TGCTATAGGA TATGAAGCAT CATTCTATGT TGCTTCTATA
GGTAATGGTC CTACTGAACT TTAA
 
Protein sequence
MELNTLKSTF INNFGKEPTS LFFSPGRINL IGEHIDYNGG FVFPCPITLG TFAAASLRED 
RICRAYSLNF ESLGVIEFSL DDLSYKKEDN WTNYLKGVLK VLIEKGYKID KGIDLVINGN
LPNGAGLSSS ASLEMLIVKI LDTFFSLNIS KVDAALIGKE VENTYIGVNS GIMDQFAISL
GEKDKAILLD CNSLYYEYVP LNLGDNSIII MNTNKRRELA DSKYNERRKE CDDSLDTLKK
YTNISSLCEL TSLEFETYKD KIEDSNKLRR CVHAISENER VKDAVKALKE NNLELFGQLM
NQSHISLRDD YEVTGKELDT LAENAWKQPG VLGARMTGAG FGGCAIAIVN NAHVDEFIKN
VGQAYKDAIG YEASFYVASI GNGPTEL