Gene CPF_1415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1415 
Symbol 
ID4202577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1595698 
End bp1596894 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content33% 
IMG OID638082295 
Productamidohydrolase family protein 
Protein accessionYP_695860 
Protein GI110799653 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.486637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAA ATTTAATGGA TGAAGCTCAA GAATTAAAAG ACTTACTTGT AGCTTTAAGA 
AGAGATTTTC ATGAAAATCC TGAATTAGGT TTTGAAGAGT GGAGAACTTC AGGAAAAATA
AAGGAATTTT TAGCTAATGA AGGTATTGAA TATATAGAAA CTGCTAAAAC AGGAGTATGT
GGCATAATAA AGGGAACACT AAAGGATGAA TCTAAAAAAG ATAGATGCAT AGCTTTAAGA
GCTGATATTG ATGGTCTTCC TATGGATGAT AAAAAGACTT GTTCATATTC ATCAAAGGTT
AAAGGAAGAA TGCATGCTTG TGGACATGAT GCCCATACAA CAATATTATT AGGTGCAGCT
AAATTATTAA GTAGACATAG AGATAAGTTT AGTGGTACTG TTAAGTTACT CTTTGAACCA
GCAGAGGAGA CAACAGGCGG AGCTCCTATA ATGATAGAAG AAGGGGTTTT AGAAAATCCT
AGAGTAGAAA AAATAATAGG CCTTCATGTT GAAGAAACTT TAGATGCTGG ACAAATAATG
ATAAAAAAAG GAGTAGTTAA TGCAGCATCT AATCCTTTCA CAATAAAGAT AAAAGGAAGA
GGAGGACATG GAGCTTATCC TCACATGGCT GTAGACCCTA TAGTTATGGC TTCTCAAGTT
GTTTTAGGAT TACAAACAAT AGTAAGTAGA GAAATAAAGC CTGTAAATCC AGCAGTTGTT
ACAGTAGGAA GTATAAATGG AGGAACTGCT CAGAATATAA TACCAGATGA GGTTATATTA
AAAGGTGTTA TAAGAACGAT GACCCTAGAA GATAGAGCTT ACGCTAAAGA AAGACTAAGA
GAAATAGCTA CATCTATTTG TACAGCCATG AGAGGAGAAT GTGAAATAGA TATAGAAGAA
AGCTATCCAT GTCTTTATAA TAATAGCTCC GTTGTAGATT TAGTAACTGA AGCTGCAAAA
GGAATTATTG GTTCTCAAAA TGTTAAGGAA CAAGAAGCAC CAAAGCTTGG AGTTGAAAGC
TTTGCATATT TTGCCCTAGA AAGAGATTCA GCTTTTTATT TCTTAGGAGC TAGAAATGAG
GAGAGAAATA TTATTTATTC AGCTCATAAT AGTAGATTTG ATATAGACGA GAATTTATTA
CCAATTGGAG TTTCAATTCA ATGTAAAGCA GCATTAAATT ATTTGACAAG GGAGTAA
 
Protein sequence
MNINLMDEAQ ELKDLLVALR RDFHENPELG FEEWRTSGKI KEFLANEGIE YIETAKTGVC 
GIIKGTLKDE SKKDRCIALR ADIDGLPMDD KKTCSYSSKV KGRMHACGHD AHTTILLGAA
KLLSRHRDKF SGTVKLLFEP AEETTGGAPI MIEEGVLENP RVEKIIGLHV EETLDAGQIM
IKKGVVNAAS NPFTIKIKGR GGHGAYPHMA VDPIVMASQV VLGLQTIVSR EIKPVNPAVV
TVGSINGGTA QNIIPDEVIL KGVIRTMTLE DRAYAKERLR EIATSICTAM RGECEIDIEE
SYPCLYNNSS VVDLVTEAAK GIIGSQNVKE QEAPKLGVES FAYFALERDS AFYFLGARNE
ERNIIYSAHN SRFDIDENLL PIGVSIQCKA ALNYLTRE