Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1002 |
Symbol | |
ID | 4203190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 1152107 |
End bp | 1153333 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638081883 |
Product | amidohydrolase family protein |
Protein accession | YP_695448 |
Protein GI | 110799528 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000070505 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAG AAGAATTACT AAAAGAAGCT AATTTACTTC AAGAAACTAT TGTTTCAAAC AGAAGATATT TACACTCACA TGCTGAGACA GGATTTGATT TAAAAAACAC TCTTGCCTTT GTAAAAAAAG AGTTAATTGA TATGGGCTAT GAGCCAATAG AATGTGGAAA AGCAGGACTT ATTGCTTTAG CAGGTGGAAA AAAGGAAGGA AAGGTTTTCT TAATAAGAGG TGACATGGAT GCCTTACCAA TTAAAGAGGA ATCAGATGTT GAGTTTTCTT GCCAAAGTGG AAAAATGCAT GCCTGTGGCC ATGATATGCA CACATCTATG ATGCTTGGTG CTGCAAGATT ATTAAAAAAA CATGAGGATG AAATTGAAGG AACTGTTAAA CTTATGTTCC AACCTGCTGA GGAAATTTTT GAAGGTTCTA AGGATATGAT TAAAGCAGGA GTATTAGAAA ATCCAAAGGT TGATCAGGCA CTAATGATTC ATGTAATGGC AGGAATGCCT TTTAATGCAG GTACTGTTAT TGTTCCTGTA CCTGGAATTG GAGCTCCAGC TGCTGATTAT TTTGAAATAA AGGTTCAAGG AAAAGGATGT CATGGTTCTA TGCCTAATAC TGGAGTTGAC CCACTTAATG TAGCTGCTCA CATTTTAATT GCATTACAAG AAATTCATGC TAGAGAACTT GCCATAAGCG ATCAAGCAAT TTTAACAATT GGAACAATGA ATGCTGGTAT TGCTGCCAAT GTAATCCCTG ACACAGCTAC TATGGGTGGA ACTATTCGTA CTTTTGATGA AGAAACACGC TCATTTATAA AGGAAAGAAT TGAAGAAATT GCAGAATGTA CAGCAAAATC CTTTAGAGCT TCAGCTGAAG TAATTTGGGG AAGTGGATGC CCAACCTTAG TTAACGATAA AGACTTAACT GTATGCTCTG AAAAATACAT AAAAGAACTT TTAGGGGAAG ATAAAACATT CTCTGTTGCC AAACTAAATG CCATGGCTGG AAATCAAAAA TCTGCTAAAA CTTCTGGTTC AGAAGACTTT GCTTACATAA GCCAAAAAAT TCCTGCCATC ATGTTAGTTT TAGCAGCTGG AAACCCAGAT AAGGGCTATC CATACCCTCA ACACCACCCA ATGGTTAAAT TTGATGAAGA AGTTTTATCT AGTGGTAGTG CAGTTTACGC CTACACAGCA ATGCGTTGGC TTCAAGACCA TAAATAA
|
Protein sequence | MKPEELLKEA NLLQETIVSN RRYLHSHAET GFDLKNTLAF VKKELIDMGY EPIECGKAGL IALAGGKKEG KVFLIRGDMD ALPIKEESDV EFSCQSGKMH ACGHDMHTSM MLGAARLLKK HEDEIEGTVK LMFQPAEEIF EGSKDMIKAG VLENPKVDQA LMIHVMAGMP FNAGTVIVPV PGIGAPAADY FEIKVQGKGC HGSMPNTGVD PLNVAAHILI ALQEIHAREL AISDQAILTI GTMNAGIAAN VIPDTATMGG TIRTFDEETR SFIKERIEEI AECTAKSFRA SAEVIWGSGC PTLVNDKDLT VCSEKYIKEL LGEDKTFSVA KLNAMAGNQK SAKTSGSEDF AYISQKIPAI MLVLAAGNPD KGYPYPQHHP MVKFDEEVLS SGSAVYAYTA MRWLQDHK
|
| |