Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0041 |
Symbol | |
ID | 4203684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 46559 |
End bp | 48031 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638080916 |
Product | hypothetical protein |
Protein accession | YP_694508 |
Protein GI | 110799806 |
COG category | [S] Function unknown |
COG ID | [COG0397] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAATA AAAATTTTCA ATCAAAAACA GGTTTTAACT TAGAAAACAC TTATTTAACT CTTCCAAATA TATTCTTTAG TGAACAAAAT CCAAAGGGTT CAAAAAATCC TAAACTTATT AAGTTTAATA CTTCTCTTGC TGAAGAACTT GGGCTTAATG AAGAAGTTTT AAACAGCGAT TTTGGACTTA ATATATTTGC AGGAAATGAA ACCTTCCCAG GAATTGTACC AATTGCACAG GCTTATGCTG GGCACCAATT TGGTCATTTT ACAATGCTAG GAGATGGTAG AGCACTCTTA TTAGGAGAAC ATGTAACTAA AGATAGTAAA AGATATGATG TTCAATTAAA AGGGTCTGGT AGAACAATCT ATTCAAGAGG TGGAGATGGA AAGGCTGCCC TTGCACCTAT GCTTAGAGAA TATATAATAA GTGAAGGTAT GCATGGTCTT GGAATTCCTA CAACTAGAAG CCTTGCTGTA GTAAATACTG GTGAGGAAGT TTTAAGAGAA AGATTTGAAC AAGGTGCTAT ATTAACAAGA ATAGCTTCTA GTCACATTAG AGTTGGAACT TTTGCTTATG CAGCTCAATG GGGAACTTTA GAAGATCTTA AAAGTCTTGC TGACTATACT ATTAAAAGAC ACTTTCCTAA TATAGCTAAG AGTGAAAATA AATATATTTT ATTTCTTGAA GAGGTAATAA ATCGTCAAGC TGAACTTATA GTTAAGTGGC AAAGTGTTGG ATTCATTCAT GGGGTTATGA ACACTGATAA TATGGTAATC TCAGGAGAAA CTATAGATTA TGGACCATGT GCATTTATGG ATACTTATGA TACAAACACA GTATTTAGTT CCATTGATTA TGCTGGTAGA TATGCTTATG GAAATCAACC TAACATGGCT TTATGGAACT TAGCTAGATT CTCAGAAGCA CTACTTCCTC TTCTAAACCC TAACCTAGAT GAGGCTGTTA ATATTGCTAA AAAGTCCATA TCAAACTTTT CTAAACTATA TAAAAAATAT TGGTTCAATA AAATGAGAGC TAAACTTGGT CTTTTCACAG AAAAAGAAAA TGATGAATTG CTAATTGAAG GGCTTTTAAG CACAATGCAA AAATATGAAG CAGATTTTAC TAATACCTTT GTATCTTTAA CTCTTAATAA ATTTGAAGAT GAAAAAGTAT TTAGTAGTGA TGAATTCAAA ACTTGGTATG CTCTTTGGCA AAATAGATTA AAAGAAGAAA ATAGATCACA GGAAGAAGTA AGGAATTTAA TGATGAATAA TAATCCTTAT ATAATTCCTA GAAATCACTT AGTTGAAAAA GCTCTTAAAA ATGCTGAAAA AGGTGATTTT ACTTTTATGG ATAATCTATT AGAAGCACTA AAGAATCCTT ATAGTTATTC TAAAGATTTA GAAAAGTACA CTAAGTTACC TGAGAAAAGT GACACTCCTT ATGTAACATA TTGTGGAACT TAA
|
Protein sequence | MDNKNFQSKT GFNLENTYLT LPNIFFSEQN PKGSKNPKLI KFNTSLAEEL GLNEEVLNSD FGLNIFAGNE TFPGIVPIAQ AYAGHQFGHF TMLGDGRALL LGEHVTKDSK RYDVQLKGSG RTIYSRGGDG KAALAPMLRE YIISEGMHGL GIPTTRSLAV VNTGEEVLRE RFEQGAILTR IASSHIRVGT FAYAAQWGTL EDLKSLADYT IKRHFPNIAK SENKYILFLE EVINRQAELI VKWQSVGFIH GVMNTDNMVI SGETIDYGPC AFMDTYDTNT VFSSIDYAGR YAYGNQPNMA LWNLARFSEA LLPLLNPNLD EAVNIAKKSI SNFSKLYKKY WFNKMRAKLG LFTEKENDEL LIEGLLSTMQ KYEADFTNTF VSLTLNKFED EKVFSSDEFK TWYALWQNRL KEENRSQEEV RNLMMNNNPY IIPRNHLVEK ALKNAEKGDF TFMDNLLEAL KNPYSYSKDL EKYTKLPEKS DTPYVTYCGT
|
| |