Gene CPF_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1002 
Symbol 
ID4203190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1152107 
End bp1153333 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content37% 
IMG OID638081883 
Productamidohydrolase family protein 
Protein accessionYP_695448 
Protein GI110799528 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000070505 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAG AAGAATTACT AAAAGAAGCT AATTTACTTC AAGAAACTAT TGTTTCAAAC 
AGAAGATATT TACACTCACA TGCTGAGACA GGATTTGATT TAAAAAACAC TCTTGCCTTT
GTAAAAAAAG AGTTAATTGA TATGGGCTAT GAGCCAATAG AATGTGGAAA AGCAGGACTT
ATTGCTTTAG CAGGTGGAAA AAAGGAAGGA AAGGTTTTCT TAATAAGAGG TGACATGGAT
GCCTTACCAA TTAAAGAGGA ATCAGATGTT GAGTTTTCTT GCCAAAGTGG AAAAATGCAT
GCCTGTGGCC ATGATATGCA CACATCTATG ATGCTTGGTG CTGCAAGATT ATTAAAAAAA
CATGAGGATG AAATTGAAGG AACTGTTAAA CTTATGTTCC AACCTGCTGA GGAAATTTTT
GAAGGTTCTA AGGATATGAT TAAAGCAGGA GTATTAGAAA ATCCAAAGGT TGATCAGGCA
CTAATGATTC ATGTAATGGC AGGAATGCCT TTTAATGCAG GTACTGTTAT TGTTCCTGTA
CCTGGAATTG GAGCTCCAGC TGCTGATTAT TTTGAAATAA AGGTTCAAGG AAAAGGATGT
CATGGTTCTA TGCCTAATAC TGGAGTTGAC CCACTTAATG TAGCTGCTCA CATTTTAATT
GCATTACAAG AAATTCATGC TAGAGAACTT GCCATAAGCG ATCAAGCAAT TTTAACAATT
GGAACAATGA ATGCTGGTAT TGCTGCCAAT GTAATCCCTG ACACAGCTAC TATGGGTGGA
ACTATTCGTA CTTTTGATGA AGAAACACGC TCATTTATAA AGGAAAGAAT TGAAGAAATT
GCAGAATGTA CAGCAAAATC CTTTAGAGCT TCAGCTGAAG TAATTTGGGG AAGTGGATGC
CCAACCTTAG TTAACGATAA AGACTTAACT GTATGCTCTG AAAAATACAT AAAAGAACTT
TTAGGGGAAG ATAAAACATT CTCTGTTGCC AAACTAAATG CCATGGCTGG AAATCAAAAA
TCTGCTAAAA CTTCTGGTTC AGAAGACTTT GCTTACATAA GCCAAAAAAT TCCTGCCATC
ATGTTAGTTT TAGCAGCTGG AAACCCAGAT AAGGGCTATC CATACCCTCA ACACCACCCA
ATGGTTAAAT TTGATGAAGA AGTTTTATCT AGTGGTAGTG CAGTTTACGC CTACACAGCA
ATGCGTTGGC TTCAAGACCA TAAATAA
 
Protein sequence
MKPEELLKEA NLLQETIVSN RRYLHSHAET GFDLKNTLAF VKKELIDMGY EPIECGKAGL 
IALAGGKKEG KVFLIRGDMD ALPIKEESDV EFSCQSGKMH ACGHDMHTSM MLGAARLLKK
HEDEIEGTVK LMFQPAEEIF EGSKDMIKAG VLENPKVDQA LMIHVMAGMP FNAGTVIVPV
PGIGAPAADY FEIKVQGKGC HGSMPNTGVD PLNVAAHILI ALQEIHAREL AISDQAILTI
GTMNAGIAAN VIPDTATMGG TIRTFDEETR SFIKERIEEI AECTAKSFRA SAEVIWGSGC
PTLVNDKDLT VCSEKYIKEL LGEDKTFSVA KLNAMAGNQK SAKTSGSEDF AYISQKIPAI
MLVLAAGNPD KGYPYPQHHP MVKFDEEVLS SGSAVYAYTA MRWLQDHK