Gene CPF_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0454 
Symbol 
ID4203307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp538113 
End bp540347 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content29% 
IMG OID638081338 
Producthypothetical protein 
Protein accessionYP_694911 
Protein GI110801230 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.110382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AAAAATTAAG TATTTTTATA CTATCAGCAG CGATAGTAAC AAGTTCAGTA 
TCATATGGAA TGACAGCGAA AGCTGATGAG GTCAAAGAGA CTAAAGGATA TAATTACAAT
GTAAATTTAT CAGAGCAAAA AGCTGTAGCT AATAATAAAA TTTCAGTGAA AAAAGTTAAT
GGAGAAATTT TAGGCTATGC AAATCCATTT ACTATGTTAC AAATATTAAA TATGAATGGA
AAAACAGTAG AAGTAATTAC TCAAAGTGGC TTAAGAGGGT ATGTAGATGC TAATGAATTC
ACTTTAGTAG AATCTGCTGT TAATGATAAA TTAATTGAGA AAAATATAGA GGGACATGTT
ACTAATGTTT CAACAGTATT AAATTTAAGA AAAGAGCCTA GAATAGGTGC TGAAATAATA
AATAGATTAC TTAATAATAC TAAGGTTAAC ATACTTGGAA AACAAGGTTC TTGGTATAAG
ATTGAGTTAA ACGGACAAAA AGGATATGTT TATGGCATGT TCTTAAATGA AGGAACTATG
AAAGAGAATA GTTCAAAGAC TATAGCAAAC AAAAAATCAG AAGTTAAATC TGAATATAAG
AAAGAGAAAA AATCTTCAGT TAAAAAAGAA GCTAAAAAAG AGGTTGTTTC AGGGGCTAAG
GCTAAAGTAG CACAAAAGCA AAGAGAAGAA GCTAAAGCTA ATATGACTTC AGAAGTAAAA
CCAGCAATAA ACAAAGAAAC AAAATCTAAT AATAAGGTTG AAGTGAAATC AGAAGTTAAA
AAAGAGGCAA AGAATGAAGT TAAAACTGAA ATGAATTCAG TTACAAAGGC TGAAACAAAC
CAAGTAGTAA CTTCTGAAAT TGTACAAGCA GTAAAATCAG AGAAAAATCC AGAAACTAAT
CTTCAAGTAA GACCAGAGAT TAAATCTGAA GAAAAAACTG AGATGATTTC TAAAGAAAAT
TTAAATGTGA ATCCAGAAGC TAAAAAAGAG GGAATTTCAG AAGAAATAAA GCCTGAGGTT
AATACTGAGG TAGAATCTAA GCCTGAAATG AATAAAGAGG TTAAAGTTGA AGAAGCTAAA
GTAGAGGAAA AACAAGAACC TAAGCAAGAG GAAGTTCAAG ATTCTAAGGC TAAAGAAATG
GATGAACTTT TAAAAAGTGA AGAGTATAAA TTCTTAAGAA TGAGTATATC TTACGGTGAA
AGATTAGTTG ATAGAAATGA TGTATACACA AAAGACTCAT TAAAGATTTT AAGAACTGAA
TTAGCAAAAG CTAAAAAAGT ATTCCAAGAT AAAGATAATT TAACTATGAA GTTAGTTCAA
GATACAAGGG ATGATTTAGA AAGAGCTATA GTAGATTTAG TTAAGTTAAG TGATGTTAAG
AAGCCAGTTA AATCAAAAGC TAAGGTGGAA GTGAAGAAAG AGTCTAATAC TGAGAAAAAA
GAAGAAGTTA AACCTGTTGA AGCTTCAGAA GAAAAAGTAG AAGTTAAAGA AGAGCCTGTT
AAGGTAGAAG AAAAAGTAGA GGCTAAGAAA GAGGAAGTTC AAGATCCTAA GGCTAAAGAA
ATGGATGAGC TTTTAAAAAG TGAAGAGTAT AAGTTCTTAA GAATGAGTAT ATCTTACGGT
GAAAGATTAG TTGATAGAAA TGATGTATAC ACAAAAGACT CATTAAAGAT TTTAAGAACT
GAATTAGCAA AAGCTAAAAA AGTATTCCAA GATAAAGATA ATTTAACTAT GAAGTTAGTT
CAAGATACAA GAGATGATTT AGAAAGAGCT ATAGTAGATT TAGTTAAGCT AAGTGATGTT
AAGAAGCCAG TTAAATCAAA AGCTAAGGTG GAAGCTAAGA AAGAGTCTAA TACTGAGAAA
AAAGAAGAAG TTAAAGAACA ACCAGCTAAG GTTGAAGTAA TAGTTCCAGA AGTGAAGGAA
AATAAAGAGA TTAAGAAAGA GGAGCCAGTT AAAGCTAAAG AAGTAGTTAA AGAGAAGAAA
TCTAGAGAAG AAGAAATAGA AGAGATTCTA AAAAGTGAAG AATATAGATT TTTATTAAAA
GATATAGCTT ATGGTGAAAG ATTATCTGAG AAAATTACTG TTTACACAAG AGATTCATTA
AAAGTATTAA ATGCTGCCAT AGAAAAAGCT GGAAGAATAG TTAGACAAAA AGAAAAATTA
ACTGTAAAAT TAGTTCAAGA TACAAGAGAT GATGTAGATA GAGCTATAGT AGATTTAGTT
AAAAAAAATA ATTAA
 
Protein sequence
MKRKKLSIFI LSAAIVTSSV SYGMTAKADE VKETKGYNYN VNLSEQKAVA NNKISVKKVN 
GEILGYANPF TMLQILNMNG KTVEVITQSG LRGYVDANEF TLVESAVNDK LIEKNIEGHV
TNVSTVLNLR KEPRIGAEII NRLLNNTKVN ILGKQGSWYK IELNGQKGYV YGMFLNEGTM
KENSSKTIAN KKSEVKSEYK KEKKSSVKKE AKKEVVSGAK AKVAQKQREE AKANMTSEVK
PAINKETKSN NKVEVKSEVK KEAKNEVKTE MNSVTKAETN QVVTSEIVQA VKSEKNPETN
LQVRPEIKSE EKTEMISKEN LNVNPEAKKE GISEEIKPEV NTEVESKPEM NKEVKVEEAK
VEEKQEPKQE EVQDSKAKEM DELLKSEEYK FLRMSISYGE RLVDRNDVYT KDSLKILRTE
LAKAKKVFQD KDNLTMKLVQ DTRDDLERAI VDLVKLSDVK KPVKSKAKVE VKKESNTEKK
EEVKPVEASE EKVEVKEEPV KVEEKVEAKK EEVQDPKAKE MDELLKSEEY KFLRMSISYG
ERLVDRNDVY TKDSLKILRT ELAKAKKVFQ DKDNLTMKLV QDTRDDLERA IVDLVKLSDV
KKPVKSKAKV EAKKESNTEK KEEVKEQPAK VEVIVPEVKE NKEIKKEEPV KAKEVVKEKK
SREEEIEEIL KSEEYRFLLK DIAYGERLSE KITVYTRDSL KVLNAAIEKA GRIVRQKEKL
TVKLVQDTRD DVDRAIVDLV KKNN