Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0274 |
Symbol | |
ID | 4203147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 329040 |
End bp | 330338 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638081161 |
Product | SagA protein |
Protein accession | YP_694734 |
Protein GI | 110799311 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) [COG3883] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA AAATAATTTC AACAGTTCTT GCTGCTACAA TGACATTTGG AGTAGGACAT AGTGTATTCG CAACTCCACT TACAGATGAT CAAAAGCAAC AAATGGAACA AAGTCAAGAT AAATATGCAG ATATAAATAG TAAGATAAGA GAGTTAGAAG ATAAGATAGA TGGGTTATCA GCTAAAATAG AACCTTTATT TTTTCAAGTA GAAAAAAATA AAGAAGAAAT ATCTAAAACA GAGGAATCAA TTTCAACAGT TAAAGTTCAA ATTGAGGAAT CAAAGAAGAA AATAGAAAAA CAACAAGAAG TTTTAGGTCA AAGAATAAGA GCAACATATA AAAGTGGTGG ACAAGCAAAC TACTTAACTG CTTTATTAGA TTCTAATGGA ATTGGTGACT TCTTATCAAG AGTACAAGCT ATAAGCAAAG TAATGGGTAT GGATAAGCAA GTTATTGATG AGTTAACTTC TGAAAAGGAA AAGCTTGATA GCCAAGTTAA AGAATTAGAA GATAAGACTG CCGAATTAAA TAAGCTTAAT TCAGAAACTC AATCAAAAAT AGATGAGTTA AACAAAATGA AAGCTGAACA AGAAGGCGCT ATAAAAGATA TGAAAGCTGA GCAAGAAAAA GTTGTTGGAG AATTAGCTCC ATTAGAAAGA CAGTTAATTG AGCCATGGAC TTCTAAAATA AATTCAAATA GTTCAGTAAA TGATTTAAAT CAAGCTGTGA CAGCCTTAAG AGGTTTAAGA AATCAAATAA AAACTCCAGA GGTAGATTCA GAAGCTGTTC AAGCTATAGA AAAAGCTAAA GATTTAATCG AAACTAAAAA AGCAGAAGAA GCAGTAAGTA GCGCACCTAA CAGAGGTGGA GATGTTAATT CTGGTGGAAG TTCATCTTCA ACTAGTAATG GAAGTTCATC ATCAAATAGT GGAAGCACAG TAGCTCCTCC AAGTGAAGGG GCAGCTTCAG CAGTAGTATC ATATGCATAT CAGTTTATAG GAAGACCATA TGTATTTGGT GCAACTGGTC CAGATTCATT TGACTGTTCA GGATTTACTA GTTATGTTTA TAGAAATGCA GTAGGTAGAG AAATTACTAG AACTACTTAT ACACAAATAA ATCAAGGTAG ACCTGTATCA AGAGATCAGT TACAACCAGG AGATTTAGTA TTTACTAATG GTGTAGGACA CGTTGGAATA TATGTAGGTG GTGGACAAAT GATTCACGCT GCTAGACCAG GTGTTGGTGT AATAGTAGGA CCTATATATA ATTTCTCATC AGCAAGAAGA ATATTATAA
|
Protein sequence | MKKKIISTVL AATMTFGVGH SVFATPLTDD QKQQMEQSQD KYADINSKIR ELEDKIDGLS AKIEPLFFQV EKNKEEISKT EESISTVKVQ IEESKKKIEK QQEVLGQRIR ATYKSGGQAN YLTALLDSNG IGDFLSRVQA ISKVMGMDKQ VIDELTSEKE KLDSQVKELE DKTAELNKLN SETQSKIDEL NKMKAEQEGA IKDMKAEQEK VVGELAPLER QLIEPWTSKI NSNSSVNDLN QAVTALRGLR NQIKTPEVDS EAVQAIEKAK DLIETKKAEE AVSSAPNRGG DVNSGGSSSS TSNGSSSSNS GSTVAPPSEG AASAVVSYAY QFIGRPYVFG ATGPDSFDCS GFTSYVYRNA VGREITRTTY TQINQGRPVS RDQLQPGDLV FTNGVGHVGI YVGGGQMIHA ARPGVGVIVG PIYNFSSARR IL
|
| |