Gene CPF_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1978 
SymboleutD 
ID4203179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2217018 
End bp2218019 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content35% 
IMG OID638082847 
Productphosphotransacetylase 
Protein accessionYP_696411 
Protein GI110799517 
COG category[C] Energy production and conversion 
COG ID[COG0280] Phosphotransacetylase 
TIGRFAM ID[TIGR00651] phosphate acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTTA TGAAACAAAT ATGGGAAAGT GCTAAAAATA ATAGAAAAAA AATAGTACTT 
CCAGAAGGTG ATGAAGAGAG AACTCTTGTC GCTTCACAAA GAATAAAAGA AGAAGGACTT
GCTGACGTTT ACTTAGTAGG TTCAGAACAA GTAATAAGAG AAAAAGCTGA AGCTTTAGGC
GTAAATTTAG AGGGAGTAAA CATAGTTGAT CCTGAAACTT CAGATAAATT AGACACTTAT
ATAAACGAAT TTTATGAATT AAGAAAAGCT AAAGGAATGA CAGTTGAAAA AGCTGGGAAA
ATCGTAAGAG ATCCTTTATA CTTCGGAACA ATGATGGTAA AAATGGGTGA TGCAGACGGA
ATGGTATCAG GTGCTGTACA CACAACTGGT GACCTTTTAA GACCAGGCCT TCAAATAATA
AAAACTGCTC CAGGAGTATC TGTAGTATCA AGTTTCTTCA TAATGATGGT ACCAGGTTCA
CAATATGGAG AAGGTGGAAT GTTATTATTC TCAGACTGTG CTGTTAATCC AAATCCAAAT
GCAGACCAAT TAGCTGCTAT AGCTATAGCT ACTGCTGATA CTGCTAAAAA TCTTTGCAAA
ATGGATCCAA AGGTTGCAAT GCTTTCATTC TCAACAATGG GAAGTGCTGA TCATGATTTA
GTAACAAAGG TAAGAGTAGC AACAGAAAAA GCTAAAGAAT TAAGACCAGA TTTAGATATA
GATGGTGAAT TACAATTAGA TGCTGCTATA GTTGGAAAAG TTGCTTCACA AAAAGCTCCA
AATAGTAAAG TAGCAGGAAA TGCTAATGTT TTAGTATTCC CAGATTTACA AGCTGGAAAT
ATAGGATATA AATTAGTTCA AAGATTTGCT AATGCAGAAG CTATTGGACC AGTTTGTCAA
GGTTTCGCTA AACCAATAAA CGACCTTTCA AGAGGATGTA GTTCAGAGGA TATAGTTAAC
GTTGTTGCAA TAACTGCTGT TCAAGCACAA GCAACTAAAT AG
 
Protein sequence
MELMKQIWES AKNNRKKIVL PEGDEERTLV ASQRIKEEGL ADVYLVGSEQ VIREKAEALG 
VNLEGVNIVD PETSDKLDTY INEFYELRKA KGMTVEKAGK IVRDPLYFGT MMVKMGDADG
MVSGAVHTTG DLLRPGLQII KTAPGVSVVS SFFIMMVPGS QYGEGGMLLF SDCAVNPNPN
ADQLAAIAIA TADTAKNLCK MDPKVAMLSF STMGSADHDL VTKVRVATEK AKELRPDLDI
DGELQLDAAI VGKVASQKAP NSKVAGNANV LVFPDLQAGN IGYKLVQRFA NAEAIGPVCQ
GFAKPINDLS RGCSSEDIVN VVAITAVQAQ ATK