Gene CPF_0394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0394 
Symbol 
ID4203299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp473690 
End bp476701 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content30% 
IMG OID638081278 
Productpolysaccharide lyase family protein 8 
Protein accessionYP_694851 
Protein GI110801222 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAT CTAGAAAAAA ACTTAACAGA CTTATATCAA CGGCAATTGT AAGTTGTATG 
GTTATGGGAG CTTCCTATGA GAATGTTTTG GCACTTGAAA ATAAGAGTCA AAATAATTCT
GAAAAGGTAG TGAAATATGA CAACGAAGCT GAGTATATTA AAAACATTAG ATTAAGATGG
AAGGAGGATT TAGTAGGTAA TTCTTCTTTA GATACAAGTA ATGCTACAAT ATCTAAGAAG
ATAATAAGTT ATACAAATAA TACAGACAAA TTAGTAGCAA AACTTAATAT GGATCCTAAC
GCTCAATGGC TTTGGGAAGA TTTAAAAGAT TATAAACAAA ATCCAGCTAG AATAACATCT
ATGTTCAATA ACTTAGTAAC AATGTCTATG GCATATAGCT TACCTAATAA TAAGTATTAC
AAAAATGAAG ATTTGAAAAA TAAGATAATA TATTCCTTAG ACTGGATCAA TAAAAATGCT
TACAATGAGA ATATTGACCA GTATGGCAAC TGGTGGGATT GGATGATTGG AATACCAGCA
AGATTAAATA ATATTGTTGT TTTAATGTAT GATGATTTAA CTGAAGAACA AGTTAAAAAT
TATATGAATG CAATACAAAA ATTCTTACCA AGTATTGAGC CTGGAAGTAA ATATCATACA
GGAGCAAATT TAGCAGATGT ATGTATGAAT AAGTTATTGC AAGGTGTCAA TGAGAATGAT
CCAGAAAAAA TTAAAGAAGC ATCAGAGGAT ATAGTTGGAG TTTTTGATTA TGTAACTAGT
GGGGATGGAT TCTATAAGGA TGGTTCATAT CTTCAACATG GAATGGTAGC ATACACAGGT
TCATATGGAA ATGTTCTTAT TGAGAAAATA TCTAATATAA TGTTTTTATT AGAAAAAACT
CCATGGTCAA TAAAATCTGA AAGTAAAGAT AACGTTTATA AGTGGATTTT TGAGAGTTTC
AATCCAATTA TATATAAAGG ATACGTTATG GACATGGTTA GAGGAAGAGC AATATCAAGA
TATAATGCTA ATGGATACTT ACAAGCATCT GGAATTATTG AAGGTATGAT TAAAATTGGA
ATGATTTCTG ATGGAGATAA GGCTAGTGAA ATAAATGCTC TAGTTAAAAA ATGGGCTACA
GAAGCTAAGA GTGTATTAGA TTTTGGAGCA AGATTTAAGT CAATTAATGT AATAGGTGAA
TTCTATGGAA TTATGAATAA TGACAATATA AAACCTTTAG AAGAAGGTGA TAAGCATTAT
GCATTAAATA GTATGGACAA GACTGTTCAT AAAAGAGAGA ACTTCGCTTT AGGTATATCA
AGAAGTTCAA GTAGAATTAG TAAATATGAA TTCATGAACA AGGAAAATTT AACACCATGG
TTCCAAGGGG ATGGAATGAC TTATTTATTC AATAATGATT TAAATCAATT CTCAGGAAAT
TTTTGGGCTA CAGTAGATCC ATATAGAATG CCAGGTACAA CTGTAGACAC TAGAAAAAGA
GAACCAAAAG AAATATTACC AGGGTTAGAT CCAGGAGCAT CACAACAAAA TGAAATTTAT
TATGAATTAG GAAAGAGTAA TTGGTCTGGT GGAAGTAAGT TAGGAGCTTA CGGAGTAGCT
GGAATGGAAA TAGATAATAA GTACGATTCC TTAAAAGCTA AGAAATCTTG GTTTATGTTT
GATGATGAAA TAGTTGCCTT AGGTTCAGGA ATAACTAATC CAGAAGATTT TGAAACTGAA
ACAATAGTTG AAAATAGAAA GATAAAAAGT GATGGATCAA ATAAATTTAT AGTAGATGGA
AAAGAAAGAG TAAGTAAATT AAAAGAAAAA GATAAAGTTG ATAATGCAAA ATGGGCTTAC
TTAGAAGGAA ATGTAAGTGG ATCAAATATA GGATATTATT TCCCAGAGGG AGCAAATATT
AATTTAATAA AAGATGAAAG AGAAGGTAAT TGGATTAATG TAAACTCTTC TAAACCAGAA
GCAGATAAGG TGGTTAAAGA TAATTACTTA ACTATGTATA TAGATCATGG AAAAGCTATA
AAAGATCAAA AATATAGTTA TGTATTATTA CCAAATAAGA CTGAGGATAA GGTAAAAGAA
TATTCTGAGA ATCCAAATGT TGAAATTATT CAAAATGATG ATGTAGCTCA TAGTGTTAAG
CATAAAAAAT TAAATATTGA AGCAGCTAAC TTCTGGAAAG AGGGAAAAAA TACTGCTGGA
AATATAACAT CAACAGGAAA ATCATCTATA ATAATAAAAG AAAATAAAGA TAATACCTTA
AGCATAGCTG TGTCAGATCC AACTTTCTTA GAAAAAAATC TTTCTATAGA AATAAATAAA
CCAGCAATGG AAGTAATAAA ATCAGATAAA AGAATATCAA ATATAAATTT AGAAAATGGA
AAAATAAAAT TTGATGCAAA CACAGAAAAC CTTTCAGGGG CACCTTTAGA GCTTCTTGTA
AAATTAGGCG ACAAAAATAA TGGAAACAAT GAAAATAATA ATGAAATTAA AAATGAAGCT
CCTGTAATAG AAGGAGAAGA TGCTAATTTA TTTGTAGGAG ATAAGTGGGA TAAATCTCTT
CACAAACTTA AGGCTAAAGA TAAGGAGGAT GGAGATTTAA CTAAAAATAT TAAGATTAAG
GATAATCAAA TTCCTTTAAA TGATCAATTT GAAGTTACAA AGCCTGGGAC ATATCCAGTT
ACTTTTGAAG TAAGTGATAA TAATGGGAAA AAAGCAGAGA AAAAGCTTAA TGTTTTAGTT
AAAGAAAAGG AAGAGAACAA GCCAGAAAAT AAACCGGAAA ATCAAGAGAA TAAACCAGAG
ATTAAACCAG AGGATCAAGA AAATCAATCA ACAAAACCAG AAAGTGAAGA AAATAAAGGG
GAAAATCCAC AAGCAAATAA TAATACTGAA AAGCTACCTA ACACTGGAGG AGCAAGTAAT
CTAAGCCTTG GAGCAATAGG TGTTCTTCTA GCTACTGTTG GAACAATGTT TACTAAGAAA
AGAAAAAAAT AA
 
Protein sequence
MKESRKKLNR LISTAIVSCM VMGASYENVL ALENKSQNNS EKVVKYDNEA EYIKNIRLRW 
KEDLVGNSSL DTSNATISKK IISYTNNTDK LVAKLNMDPN AQWLWEDLKD YKQNPARITS
MFNNLVTMSM AYSLPNNKYY KNEDLKNKII YSLDWINKNA YNENIDQYGN WWDWMIGIPA
RLNNIVVLMY DDLTEEQVKN YMNAIQKFLP SIEPGSKYHT GANLADVCMN KLLQGVNEND
PEKIKEASED IVGVFDYVTS GDGFYKDGSY LQHGMVAYTG SYGNVLIEKI SNIMFLLEKT
PWSIKSESKD NVYKWIFESF NPIIYKGYVM DMVRGRAISR YNANGYLQAS GIIEGMIKIG
MISDGDKASE INALVKKWAT EAKSVLDFGA RFKSINVIGE FYGIMNNDNI KPLEEGDKHY
ALNSMDKTVH KRENFALGIS RSSSRISKYE FMNKENLTPW FQGDGMTYLF NNDLNQFSGN
FWATVDPYRM PGTTVDTRKR EPKEILPGLD PGASQQNEIY YELGKSNWSG GSKLGAYGVA
GMEIDNKYDS LKAKKSWFMF DDEIVALGSG ITNPEDFETE TIVENRKIKS DGSNKFIVDG
KERVSKLKEK DKVDNAKWAY LEGNVSGSNI GYYFPEGANI NLIKDEREGN WINVNSSKPE
ADKVVKDNYL TMYIDHGKAI KDQKYSYVLL PNKTEDKVKE YSENPNVEII QNDDVAHSVK
HKKLNIEAAN FWKEGKNTAG NITSTGKSSI IIKENKDNTL SIAVSDPTFL EKNLSIEINK
PAMEVIKSDK RISNINLENG KIKFDANTEN LSGAPLELLV KLGDKNNGNN ENNNEIKNEA
PVIEGEDANL FVGDKWDKSL HKLKAKDKED GDLTKNIKIK DNQIPLNDQF EVTKPGTYPV
TFEVSDNNGK KAEKKLNVLV KEKEENKPEN KPENQENKPE IKPEDQENQS TKPESEENKG
ENPQANNNTE KLPNTGGASN LSLGAIGVLL ATVGTMFTKK RKK