Gene CPF_0076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0076 
Symbol 
ID4202124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp90897 
End bp91844 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content36% 
IMG OID638080957 
Productputative glucokinase 
Protein accessionYP_694540 
Protein GI110800941 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATT ACGTTGTTGG AATAGATCTA GGGGGAACAA AAATTAGCTG TGCTCTTGCT 
GATCTAGAAG GAAATGTTAA AGCTCAACAT ACAACTCCAA CTAATGCTCA TGAAGGAGAG
CAAGCAGTTT TAGATAGAAT TATAGGCTGT GTTGAAACTG TAATATGTGA AGGAAAAGTA
ACTATTGATG AAGTAGAAGC AATAGGTATT GGATCACCAG GACCACTAGA TGCTAGAACT
GGTATAATAA TAACAACTCC AAATTTACCT TTCAAAAACT TCAACTTAGT TTCACCATTA
AAAGCTAAGT TTGGTATTCC TGTTTACTTA GATAATGATG CTAACGTAGC TGCTATAGGT
GAATTTATGT TAGGTGCTGG AAAAGGTACT GAAAATATGA TTTATATAAC TGTAAGTACT
GGTGTAGGTG GAGGAGCAAT CCTTAACGGT AAAATTTACA GAGGAAGTAC TTCAAACGCA
TTAGAAATTG GACATTCAAC TGTTGCACCT GGAACTGTAA GATGTAATTG TGGTAACATG
GGATGTCTAG AAGCTGTATC ATCAGGAACA GCTATTGGTA AAAGAGGAAG AGAGGCAGTT
GCTACAAATG TAGAAACAAG CTTAAAAGAT TACGACAATG TAACTTCATA TGAAGTATTT
GTTGAAGCAG CTAAAGGTGA TAGAGTTGCA AAATCAATAA TAGATGAAGC TTTAAACTAC
TTAGGAATTG GTGTTGCAAA TGCAATAGCA ACTTTTGACC CAGACATGGT TGTTATAGGT
GGAGGAGTTT CAAAAGCTGG AGAAGTTGTT TTTGAAACAG TTCAAGAAGT TGTTAATGAA
AGATGTTTTA AAGCTATGGC TGAGCATTGT AAAATAGTTC CTGCTGGATT AGGAACTGAT
GCAGGAGTTA TTGGAGCAGT AGCTTTAGCA TTATTAGAGT GCAAATAA
 
Protein sequence
MKNYVVGIDL GGTKISCALA DLEGNVKAQH TTPTNAHEGE QAVLDRIIGC VETVICEGKV 
TIDEVEAIGI GSPGPLDART GIIITTPNLP FKNFNLVSPL KAKFGIPVYL DNDANVAAIG
EFMLGAGKGT ENMIYITVST GVGGGAILNG KIYRGSTSNA LEIGHSTVAP GTVRCNCGNM
GCLEAVSSGT AIGKRGREAV ATNVETSLKD YDNVTSYEVF VEAAKGDRVA KSIIDEALNY
LGIGVANAIA TFDPDMVVIG GGVSKAGEVV FETVQEVVNE RCFKAMAEHC KIVPAGLGTD
AGVIGAVALA LLECK