Gene CPF_1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1645 
SymbollonB 
ID4201241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1859334 
End bp1861046 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content33% 
IMG OID638082522 
ProductATP-dependent protease LonB 
Protein accessionYP_696086 
Protein GI110798596 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR02902] ATP-dependent protease LonB 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACTT ATACATTTAT AATGTTTCTA CAATTACTTA TGTCAATTTT ATTCTATATA 
TATATGAGCA AGTCCTTTGC GAGTAAGAAG AAAGATAATA GTGTTTTAGA AAAAGAAAAT
GAAAAAGAAA TGGAAAAATT AAATAAATTA AGAATGATAA AACTGACAGA ACCTTTAACT
GAAAAAAGTA GACCAAGTAA TTTAGAAGAA ATAATAGGAC AGGAAAAGGG AATAAAAGCT
CTTAAAGCAG CACTTTGTGG GCCAAATCCA CAGCATGTAA TAATATATGG TCCGCCAGGG
GTAGGAAAAA CTGCAGCTGC TAGAATAATT TTAGAAGAGG CTAAGAAAAT GGCAGCATCT
CCTTTTAATA AGGACTCTAA ATTTGTTGAA ATAGATGCCA CAACTTTAAG ATTTGATGAG
AGGGGGATAG CAGATCCACT AATAGGTTCC GTTCATGATC CAATATATCA AGGAGCAGGT
TCCTTAGGGA TTGCAGGGGT TCCTCAACCT AAGCCAGGAG CTGTAACAAA GGCTCATGGA
GGAATACTTT TTATAGATGA AATAGGAGAA CTCCATCCTA TTGAATTAAA TAAACTTCTT
AAAGTTTTAG AGGATAGAAA AGTTTTTTTA GATTCAGCCT ATTATAGTTC AGAAGATCCC
AATACTCCTA GATATATAAA AGAAATATTT GATAATGGAT TACCAGCAGA TTTTAGATTA
ATTGGTGCAA CTACAAGAAG TCCAGAGGAA ATAGTGCCAG CTATAAGGTC AAGGTGCGTA
GAAATATTTT TTAGGGGGCT AACTGTTGAA GAGATTAGAA AAATTGCTTT AAATGCCACA
AATAAGGTTG GTTATAGAAT AAGTGATGAG GGATTAGACA TAGTATCTAG ATATTGTACT
AATGGAAGAG AAGTTATAAA CTTAGTGCAA TTATGTTCTG GCCTTGCAAT AAATGAAAAT
AGAGATTACA TAAAAGAGAG TGATATTTAT TGGGTTATTG AAAATGGTCA ATATAATCCT
AGAATGGAAA GAATGATAAA TGATAAACCT GAAATTGGGT ATGTAAATGG CTTAGCTGTG
TATGGAGCTA ACAATGGAGC TTTAATGGAA ATAGAAGCTA CAGCAAAGCT ATCAAGTAAT
AGTATAGGTA GTATAAAAAT TACTGGAATA GTTGATGATG AGGAACTAGG CGGTGGAGAG
AAGAAAATAA AGAGAAAAAG CACAGCATAT TGTTCTGTAC AGAATGTATT GACAGTATTA
GATAATATAT TTAATTTAAA TTCAAAGGCA TATGATATAC ATGTTAACTT TCCAGGCGGA
ATACCAGTAG ACGGTCCATC TGCTGGAATA AGTATAGCTA CAGCCATATA TAGTGCCATA
AAAGGAGTGC CTGTAAATAA TAGAGTGGCT ATGACTGGTG AGATATCAAT AAAGGGAAAG
GTAAAACCAA TAGGGGGAGT AAATGCAAAG ATATTAGCAG CAAAGAGAGC GGGAGTAGAA
TTGGTAATTG TTCCAAAGGA AAATTTAAGT AGTATAACTA GAGATATTGA TGGAATAAAG
ATAGTTGGTG TTAAGAAAAT TGAAGAGGTG TTAGATCTTG CACTTTATGA AGAAGAATGT
ATAGAAAAAG AGAGTTTAAT AATTAAAGAT AATAGGGCAT TTTTTGGTGC TGGTGCCTTA
AATGCAGAAT CTATAAAGAA AGCTAACACT TAA
 
Protein sequence
MNTYTFIMFL QLLMSILFYI YMSKSFASKK KDNSVLEKEN EKEMEKLNKL RMIKLTEPLT 
EKSRPSNLEE IIGQEKGIKA LKAALCGPNP QHVIIYGPPG VGKTAAARII LEEAKKMAAS
PFNKDSKFVE IDATTLRFDE RGIADPLIGS VHDPIYQGAG SLGIAGVPQP KPGAVTKAHG
GILFIDEIGE LHPIELNKLL KVLEDRKVFL DSAYYSSEDP NTPRYIKEIF DNGLPADFRL
IGATTRSPEE IVPAIRSRCV EIFFRGLTVE EIRKIALNAT NKVGYRISDE GLDIVSRYCT
NGREVINLVQ LCSGLAINEN RDYIKESDIY WVIENGQYNP RMERMINDKP EIGYVNGLAV
YGANNGALME IEATAKLSSN SIGSIKITGI VDDEELGGGE KKIKRKSTAY CSVQNVLTVL
DNIFNLNSKA YDIHVNFPGG IPVDGPSAGI SIATAIYSAI KGVPVNNRVA MTGEISIKGK
VKPIGGVNAK ILAAKRAGVE LVIVPKENLS SITRDIDGIK IVGVKKIEEV LDLALYEEEC
IEKESLIIKD NRAFFGAGAL NAESIKKANT