Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1475 |
Symbol | |
ID | 4203700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 1671824 |
End bp | 1673158 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 638082353 |
Product | amidohydrolase domain-containing protein |
Protein accession | YP_695918 |
Protein GI | 110799822 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0938308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA TAGTAACTTT AATAAAAAAT GTTACAGTTA TTACTATGAA TGAACATAAA GAAATTATAG AAAATGGTTT AGTTGTTTTT GAAAAAAATA AAATAGTATA TGTTGGAACT GATGTAAGGA CAGAAGAAAA ATTAAAAAGA AGTGGATATA AAGTTGAGGT AATAGATGGT GAAGAGGGAA TATTAATGCC AGGTATGATA AACTGTCATA CTCATGGGTC TATGGTTCCT TTTAGAAGTT TAGCTGACGA TTGTAAGGAT AGATTAAAGA GATATTTATT TCCCTTAGAA CAAAGATTAG TTGATAAAGA ATTGACTTAT ATAGGAGCAA AGTATGCTAT AGCAGAAATG TTATTAGGTG GAGTAACTAC CTTTTGTGAT ATGTATTATT TTGAAGATGA GGTAGCGAAA GCTGCTAAAG AATTAAATAT GAGAGGAGTG CTTTGTGAGA CAATAGTAGA TTTTCCTTCG CCAGATTCAG AAAAGGCTTT TGGAGGGATA GATTATTCCA TAAGATTTAT AGAAAAATGG AAAAATGATG ATTTGATAAC TCCAGGAATA GCTCCTCATG CACCTTATAC AAATACAGAG GAATCATTAA AGGAAGCATA TAAAATTAGT AAAAAATATG ATGTTCCTAT AACTATGCAT TTAGCAGAAA TGGACTATGA GTTAGAGGAA TATAAAAATA AATATAATCT TACTCCAGTT TCTTATTTAG ATAAGCTTGG GGTTTTAAAC TCTAATTTTA TAGCGGCCCA TGCAGTTTTA GTTAATGAAG AAGATATAGA AATTTTAAAG AAGAATAATG TTAATATATC TCATAATATA GGGGCAAATT CTAAAGGAGC TAAAGGAATA GCACCAATAT TAAAGATGAG AGAAAAAGGA ATAAATATTG GACTTGGAAC AGATGGTCCT ATGAGTGGAA ATACTTTAGA TATATTAAGC CAGATGTCAC AAGTGGGAAA AATTCATAAA TTATTTAATA AAGATAGAAC TTTACTACCA TCAATAGATT TAATAGAAAT GGGAACCATA GGTGGAGCTA AAGTTTTAGG AATTGATAAA GAAGTTGGAT CTATAGAGGT TGGTAAAAAA GCTGATTTAA CTTTAATAGA AACAAAATCA GTAAATATGC AACCTATATA TGATTATTAT GCAACAATAG TATATTCTGC TAATTCTAGT AATGTAGAAT TAGTAGTTGT TGATGGTAAG ATTGTTGTTA AGGATAAAAA ATTAGTAAGT GCTAGCTTTT CTGAAGTAAG AAAAGATTTA CTAGGTTTAA CAGAGAAAAT AAAGAAAATT TCTAAAGAAC TTTAG
|
Protein sequence | MSKIVTLIKN VTVITMNEHK EIIENGLVVF EKNKIVYVGT DVRTEEKLKR SGYKVEVIDG EEGILMPGMI NCHTHGSMVP FRSLADDCKD RLKRYLFPLE QRLVDKELTY IGAKYAIAEM LLGGVTTFCD MYYFEDEVAK AAKELNMRGV LCETIVDFPS PDSEKAFGGI DYSIRFIEKW KNDDLITPGI APHAPYTNTE ESLKEAYKIS KKYDVPITMH LAEMDYELEE YKNKYNLTPV SYLDKLGVLN SNFIAAHAVL VNEEDIEILK KNNVNISHNI GANSKGAKGI APILKMREKG INIGLGTDGP MSGNTLDILS QMSQVGKIHK LFNKDRTLLP SIDLIEMGTI GGAKVLGIDK EVGSIEVGKK ADLTLIETKS VNMQPIYDYY ATIVYSANSS NVELVVVDGK IVVKDKKLVS ASFSEVRKDL LGLTEKIKKI SKEL
|
| |