Gene CPF_0274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0274 
Symbol 
ID4203147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp329040 
End bp330338 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content33% 
IMG OID638081161 
ProductSagA protein 
Protein accessionYP_694734 
Protein GI110799311 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins)
[COG3883] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA AAATAATTTC AACAGTTCTT GCTGCTACAA TGACATTTGG AGTAGGACAT 
AGTGTATTCG CAACTCCACT TACAGATGAT CAAAAGCAAC AAATGGAACA AAGTCAAGAT
AAATATGCAG ATATAAATAG TAAGATAAGA GAGTTAGAAG ATAAGATAGA TGGGTTATCA
GCTAAAATAG AACCTTTATT TTTTCAAGTA GAAAAAAATA AAGAAGAAAT ATCTAAAACA
GAGGAATCAA TTTCAACAGT TAAAGTTCAA ATTGAGGAAT CAAAGAAGAA AATAGAAAAA
CAACAAGAAG TTTTAGGTCA AAGAATAAGA GCAACATATA AAAGTGGTGG ACAAGCAAAC
TACTTAACTG CTTTATTAGA TTCTAATGGA ATTGGTGACT TCTTATCAAG AGTACAAGCT
ATAAGCAAAG TAATGGGTAT GGATAAGCAA GTTATTGATG AGTTAACTTC TGAAAAGGAA
AAGCTTGATA GCCAAGTTAA AGAATTAGAA GATAAGACTG CCGAATTAAA TAAGCTTAAT
TCAGAAACTC AATCAAAAAT AGATGAGTTA AACAAAATGA AAGCTGAACA AGAAGGCGCT
ATAAAAGATA TGAAAGCTGA GCAAGAAAAA GTTGTTGGAG AATTAGCTCC ATTAGAAAGA
CAGTTAATTG AGCCATGGAC TTCTAAAATA AATTCAAATA GTTCAGTAAA TGATTTAAAT
CAAGCTGTGA CAGCCTTAAG AGGTTTAAGA AATCAAATAA AAACTCCAGA GGTAGATTCA
GAAGCTGTTC AAGCTATAGA AAAAGCTAAA GATTTAATCG AAACTAAAAA AGCAGAAGAA
GCAGTAAGTA GCGCACCTAA CAGAGGTGGA GATGTTAATT CTGGTGGAAG TTCATCTTCA
ACTAGTAATG GAAGTTCATC ATCAAATAGT GGAAGCACAG TAGCTCCTCC AAGTGAAGGG
GCAGCTTCAG CAGTAGTATC ATATGCATAT CAGTTTATAG GAAGACCATA TGTATTTGGT
GCAACTGGTC CAGATTCATT TGACTGTTCA GGATTTACTA GTTATGTTTA TAGAAATGCA
GTAGGTAGAG AAATTACTAG AACTACTTAT ACACAAATAA ATCAAGGTAG ACCTGTATCA
AGAGATCAGT TACAACCAGG AGATTTAGTA TTTACTAATG GTGTAGGACA CGTTGGAATA
TATGTAGGTG GTGGACAAAT GATTCACGCT GCTAGACCAG GTGTTGGTGT AATAGTAGGA
CCTATATATA ATTTCTCATC AGCAAGAAGA ATATTATAA
 
Protein sequence
MKKKIISTVL AATMTFGVGH SVFATPLTDD QKQQMEQSQD KYADINSKIR ELEDKIDGLS 
AKIEPLFFQV EKNKEEISKT EESISTVKVQ IEESKKKIEK QQEVLGQRIR ATYKSGGQAN
YLTALLDSNG IGDFLSRVQA ISKVMGMDKQ VIDELTSEKE KLDSQVKELE DKTAELNKLN
SETQSKIDEL NKMKAEQEGA IKDMKAEQEK VVGELAPLER QLIEPWTSKI NSNSSVNDLN
QAVTALRGLR NQIKTPEVDS EAVQAIEKAK DLIETKKAEE AVSSAPNRGG DVNSGGSSSS
TSNGSSSSNS GSTVAPPSEG AASAVVSYAY QFIGRPYVFG ATGPDSFDCS GFTSYVYRNA
VGREITRTTY TQINQGRPVS RDQLQPGDLV FTNGVGHVGI YVGGGQMIHA ARPGVGVIVG
PIYNFSSARR IL