Gene CPF_1226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1226 
Symbol 
ID4201039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1394596 
End bp1396506 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content25% 
IMG OID638082107 
Producthypothetical protein 
Protein accessionYP_695672 
Protein GI110799071 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000692495 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGAAGA AACAAATTAT GTATCTTAAA ATGTTTTTGA TAGTGGTAAT TTCTATACTA 
ATATTTGTTT TATTATTTGG AAAAGAAAAT CTTTTTATTG GATTAGCCGC TGTTATTACA
GTTACTACAA TGTTTGGAGA GGATTATACA ATAAATCCAA TACATCATAC ATTATACTTT
ACTGGAGTTG AACTTTTTGT AGGTTTAGGA GCATATTTTG CAGGATTAAA TCCTATTTTA
GGGGCCATAA TGACTTTAAT TGTATCATTC TTTATATACT TTTCTTTTAC TTATGATACA
AAGCCTACTA AAGCCTTAGG ATTTATACAG TTATATTTAT TTTTATTATA TGAACCAGTT
ACAACTAGCG AATTACCAAA AAGAGTATTT GCTTTAGTTT TTGGTGGTAT AGTCATAATG
ACACTTTACT ATATCTTAGC TAGATATAAT TTTAATAATA TATTTAACAA ATCAATAAAA
GGGTGTATTA ATTCATTAAT TGAGAACTTA AATGAATTGA TAAATACTGG TTGTATAGAA
AAAAATAATA GTGAAAAGGT AGCTTTAATG ATTAAAGAAC TTGAGATAAA GGTTTATGAA
AGATTGGAAT TAGATAAAAA TCAAGTTTAT TCTATATATT CAAAAGATTT AATAGTAGTT
TTTCTAAAGA GAGTATCAAG TCTTTCATCT GAAGCCATGA AGAGAGATGT TAATAAGGAA
CTTATTAAAA AGACCATTGA ATTACTTAAA AATGTGCAAG ACTTTATAGA GGATGGAAAT
AAGGAAAAGT TAAAAAATAT ACTAAATGAA TATTATAATT ATTTAGATTC TTTTAAATTA
AGAGATGATT TAGATAGGTA TAGCTATTAC AATGCTAAAT TAAGTGTAAA GGAGTTTTTA
CAAGGACTTA ATGTGAAAGA AAATCAGATT GAAAGCTATT CAAATTATAT TGCAAATTTT
TTAAAAGCTT TAAAATCTAA TTCAAATAAT TTTAGAAAAA GCCTAAACAT AAGATCCCTA
AGATTTAATT TGGCAGTAAA AGCAGCCATA ACATTGGCTT TTTCAGTATT TATAGTAAAT
TACTTTGAAA TTTTTCAAGG TAAGTGGGCA TTATTCACTA TTTCCTTATT ATTAATTCCC
TATGCGGAAC AAAGTAATAA GAAGGCAAAG GCAAGGGTTT TAGGAACTAT AATAGGAGCA
ATTTTATTTA ATGTAATTTT TTATTTTATT CAGTCAAATG TAATACTGAT ATTTGCTTTT
ATAGTTGTTT GTATATACTT AAGTATGTCT GTTGTTTCTT ATAAAATTAG ATGTATATTT
ATAACATTAA ATGCATTATT AATGGCTGGG GTAATGGACC CATCACATAC TCCTTATTAT
GTGCTAAGTG AATATAGGGT ATTTTTTATC TTAATAGCAT CTATAATTGT TGCTATAGTA
ATGAATTATA TTTTCCCTTA TAAAATGAAG GATGAAACTA ATAAAGCAAT AAAATTATAT
GTTTATTTAA ATGAGAGAAT ACTAGATGGT TTAATAGAAG AAAATTCAGA TAAGGAAAAA
TTAATATTAT CTTTTTTTGA AAGTTATAGA ATATGGAAGA AGATAAATTA TAACAATAAA
GAGATGAAAT CAGAGCAAAT TCAAGAGTTA TTGATATTGC AAAATGATTT TGTATCAGAT
GTAAACTTCT TAATAAAAAG CCCTATGATA CATAATAATA AAAGCTTATT TAAAAAGAGC
TTTGAAGACT TTAAAAATTA TACTAAGAAA AAAGACTTTG AAGATTTAGC AATAGAAGAT
TTAGAGAAGG CAAAAACAGA AGAAGAGAGA TTAATACTTG TATTGCTTTA TAAGCTTTTC
TATGTGATAA AAGAGATGAG AAAGTTAAGT GATTTAATAT GTAAAGAATA A
 
Protein sequence
MWKKQIMYLK MFLIVVISIL IFVLLFGKEN LFIGLAAVIT VTTMFGEDYT INPIHHTLYF 
TGVELFVGLG AYFAGLNPIL GAIMTLIVSF FIYFSFTYDT KPTKALGFIQ LYLFLLYEPV
TTSELPKRVF ALVFGGIVIM TLYYILARYN FNNIFNKSIK GCINSLIENL NELINTGCIE
KNNSEKVALM IKELEIKVYE RLELDKNQVY SIYSKDLIVV FLKRVSSLSS EAMKRDVNKE
LIKKTIELLK NVQDFIEDGN KEKLKNILNE YYNYLDSFKL RDDLDRYSYY NAKLSVKEFL
QGLNVKENQI ESYSNYIANF LKALKSNSNN FRKSLNIRSL RFNLAVKAAI TLAFSVFIVN
YFEIFQGKWA LFTISLLLIP YAEQSNKKAK ARVLGTIIGA ILFNVIFYFI QSNVILIFAF
IVVCIYLSMS VVSYKIRCIF ITLNALLMAG VMDPSHTPYY VLSEYRVFFI LIASIIVAIV
MNYIFPYKMK DETNKAIKLY VYLNERILDG LIEENSDKEK LILSFFESYR IWKKINYNNK
EMKSEQIQEL LILQNDFVSD VNFLIKSPMI HNNKSLFKKS FEDFKNYTKK KDFEDLAIED
LEKAKTEEER LILVLLYKLF YVIKEMRKLS DLICKE