Gene CPF_1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1568 
Symbol 
ID4201671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1785241 
End bp1786269 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content29% 
IMG OID638082446 
Productglycosy hydrolase family protein 
Protein accessionYP_696011 
Protein GI110800996 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3757] Lyzozyme M1 (1,4-beta-N-acetylmuramidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00668816 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAGTA GAAACAATAA TAATTTAAAA GGAATTGATG TATCAAACTG GAAAGGAAAT 
ATAAATTTTG AGAGTGTAAA AAATGATGGC GTAGAAGTAG TTTATATTAA AGCTACAGAA
GGTAATTACT TTAAGGATAA ATACGCTAAA CAAAATTATG AGGGAGCAAA AGAACAAGGA
TTAAGTGTAG GGTTTTACCA TTTCTTTAGA GCTAATAAAG GGGCTAAGGA TCAAGCTAAT
TTCTTTATAG ATTATTTAAA TGAAATAGGA GCTGTTAATT ATGATTGTAA ATTAGCTTTA
GATATAGAAA CTACTGAAGG AGTAGGAGCA AGAGATTTAA CATCTATGTG TATAGAATTT
TTAGAAGAGG TAAAAAGACT TACAGGAAAA GAAGTTGTTG TATATACTTA TACAAGTTTT
TCAAATAATA ATTTAGATAG TAGATTATCT AATTATCCAG TTTGGATAGC ACATTATGGG
GTGAACACTC CTGGAGCTAA TAATATTTGG AGTGAATGGG TTGGATTTCA ATATTCAGAG
AATGGAAGTG TAGATGGTGT AAGCGGTGGA TGTGATATGA ATGAGTTTAC AGAAGAAATA
TTTATTGATT CAAGTAACTT TAATTTAGAT AATGCTACTA CTAAAAATGT AAGCACTAAA
TTAAATATAA GAGCTAAAGG AACTACTAAT TCTAAAGTAA TTGGTTCAAT ACCAGCAAAT
GAAACCTTTA AAATAAAATG GGTTGATGAA GATTATCTTG GTTGGTATTA CGTTGAGTAT
AATGGAATAG TTGGTTATGT AAATGCAGAT TATGTAGAAA AGCTACAAAT GGCTACTACT
CATAATGTAA GTACTTTTTT AAATGTAAGA GAAGAAGGAT CATTAAATTC TAGAATAGTA
GATAAGATAA ATGCAGGTGA TATTTTTAGA ATAGATTGGG TGGATTCCGA TTTTATAGGT
TGGTATAGAG TAACAACTAA AAATGGAAAA GTTGGATTTG TTAATTCTGA ATTTGTTAAG
AAGATCTAA
 
Protein sequence
MQSRNNNNLK GIDVSNWKGN INFESVKNDG VEVVYIKATE GNYFKDKYAK QNYEGAKEQG 
LSVGFYHFFR ANKGAKDQAN FFIDYLNEIG AVNYDCKLAL DIETTEGVGA RDLTSMCIEF
LEEVKRLTGK EVVVYTYTSF SNNNLDSRLS NYPVWIAHYG VNTPGANNIW SEWVGFQYSE
NGSVDGVSGG CDMNEFTEEI FIDSSNFNLD NATTKNVSTK LNIRAKGTTN SKVIGSIPAN
ETFKIKWVDE DYLGWYYVEY NGIVGYVNAD YVEKLQMATT HNVSTFLNVR EEGSLNSRIV
DKINAGDIFR IDWVDSDFIG WYRVTTKNGK VGFVNSEFVK KI