Gene CPF_1475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1475 
Symbol 
ID4203700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1671824 
End bp1673158 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content29% 
IMG OID638082353 
Productamidohydrolase domain-containing protein 
Protein accessionYP_695918 
Protein GI110799822 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0938308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TAGTAACTTT AATAAAAAAT GTTACAGTTA TTACTATGAA TGAACATAAA 
GAAATTATAG AAAATGGTTT AGTTGTTTTT GAAAAAAATA AAATAGTATA TGTTGGAACT
GATGTAAGGA CAGAAGAAAA ATTAAAAAGA AGTGGATATA AAGTTGAGGT AATAGATGGT
GAAGAGGGAA TATTAATGCC AGGTATGATA AACTGTCATA CTCATGGGTC TATGGTTCCT
TTTAGAAGTT TAGCTGACGA TTGTAAGGAT AGATTAAAGA GATATTTATT TCCCTTAGAA
CAAAGATTAG TTGATAAAGA ATTGACTTAT ATAGGAGCAA AGTATGCTAT AGCAGAAATG
TTATTAGGTG GAGTAACTAC CTTTTGTGAT ATGTATTATT TTGAAGATGA GGTAGCGAAA
GCTGCTAAAG AATTAAATAT GAGAGGAGTG CTTTGTGAGA CAATAGTAGA TTTTCCTTCG
CCAGATTCAG AAAAGGCTTT TGGAGGGATA GATTATTCCA TAAGATTTAT AGAAAAATGG
AAAAATGATG ATTTGATAAC TCCAGGAATA GCTCCTCATG CACCTTATAC AAATACAGAG
GAATCATTAA AGGAAGCATA TAAAATTAGT AAAAAATATG ATGTTCCTAT AACTATGCAT
TTAGCAGAAA TGGACTATGA GTTAGAGGAA TATAAAAATA AATATAATCT TACTCCAGTT
TCTTATTTAG ATAAGCTTGG GGTTTTAAAC TCTAATTTTA TAGCGGCCCA TGCAGTTTTA
GTTAATGAAG AAGATATAGA AATTTTAAAG AAGAATAATG TTAATATATC TCATAATATA
GGGGCAAATT CTAAAGGAGC TAAAGGAATA GCACCAATAT TAAAGATGAG AGAAAAAGGA
ATAAATATTG GACTTGGAAC AGATGGTCCT ATGAGTGGAA ATACTTTAGA TATATTAAGC
CAGATGTCAC AAGTGGGAAA AATTCATAAA TTATTTAATA AAGATAGAAC TTTACTACCA
TCAATAGATT TAATAGAAAT GGGAACCATA GGTGGAGCTA AAGTTTTAGG AATTGATAAA
GAAGTTGGAT CTATAGAGGT TGGTAAAAAA GCTGATTTAA CTTTAATAGA AACAAAATCA
GTAAATATGC AACCTATATA TGATTATTAT GCAACAATAG TATATTCTGC TAATTCTAGT
AATGTAGAAT TAGTAGTTGT TGATGGTAAG ATTGTTGTTA AGGATAAAAA ATTAGTAAGT
GCTAGCTTTT CTGAAGTAAG AAAAGATTTA CTAGGTTTAA CAGAGAAAAT AAAGAAAATT
TCTAAAGAAC TTTAG
 
Protein sequence
MSKIVTLIKN VTVITMNEHK EIIENGLVVF EKNKIVYVGT DVRTEEKLKR SGYKVEVIDG 
EEGILMPGMI NCHTHGSMVP FRSLADDCKD RLKRYLFPLE QRLVDKELTY IGAKYAIAEM
LLGGVTTFCD MYYFEDEVAK AAKELNMRGV LCETIVDFPS PDSEKAFGGI DYSIRFIEKW
KNDDLITPGI APHAPYTNTE ESLKEAYKIS KKYDVPITMH LAEMDYELEE YKNKYNLTPV
SYLDKLGVLN SNFIAAHAVL VNEEDIEILK KNNVNISHNI GANSKGAKGI APILKMREKG
INIGLGTDGP MSGNTLDILS QMSQVGKIHK LFNKDRTLLP SIDLIEMGTI GGAKVLGIDK
EVGSIEVGKK ADLTLIETKS VNMQPIYDYY ATIVYSANSS NVELVVVDGK IVVKDKKLVS
ASFSEVRKDL LGLTEKIKKI SKEL