Gene CPF_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1064 
Symbol 
ID4203119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1210835 
End bp1212460 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content23% 
IMG OID638081945 
ProductEAL domain-containing protein 
Protein accessionYP_695510 
Protein GI110801198 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0102515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GTAAGAGGTT TATAATGATT ATCCTCTTTT TTCTTGTATT TATAATGTTA 
TGTTTATGCT ATGGATCAAT TATAAATAAA GAAAAAAGAA AAACCTTATT AAAAATTGGA
TTTTATGATG ACTATCCTCA TTTTTATATT AATAATAAAG CAAATGTTTG TGGATATTAT
AAGGATATAA CTGAAAATTT AGCTAAAAAA CTTAATTTTA AGGTAGAATA TGTAAATGGA
AATGTGCCAG ATCTTTTAAA AGAACTTAAA AACGGAGAAA TAGATTTAGT ATTTGGAATA
AATAAGCTTC CAGCGAGAGA AGAATCCTTT AAGTTTACAA ATAAATCTAT AAATGATGAG
CTGAATTTTA TATATACAAA TAAGAATATA AAATATGGTG ATTTAGAAGC TTTAAATGGT
ATGAAAATGG GATATATAGA GGGTGAATTA GATAATGAAT GGATATTAGA TTATCTAAAA
AAAAGAAATA TAAATGTTGA ACTAGTTAAT GGATCTTCTT ATAAAGCAGT AAAGACTTTA
TTAATTCATA ATAAAGTAGA TTTTATTGTG GATAATCCAG ATAGTGATAT AAAAAATAAA
GGAAAAAATA TTAAAGAAGT TTTTGAATTT TCATCTGGAG AAAAATATAT TGTAGCAAAT
AAGAATAATA AAGAGTTAAT AAAAAAAATT GATGGAGCGC TTAGTACAAT TAATCTTAAT
GCATATCTTG GTAATAATCC TTATTTTAAA AAAATTGATA ACTTTATTAT TGATACTACT
AATAAGAATG TAGTTATTTT AATTCTTTTT ATTATATGTA TAATTATGTT CAAGAAGGTT
AAAAAGAGAA TAGTTAAAAT ATTTAAAAAG AAAAAAATAT ATAATGATCT AAAAAAAGAC
AATTATACTT TGTATTATCA ACCAATAGTG GATTTTAAAC ATAATAGAGT AAGAAGCGTT
GAGGCTTTAT TGCGTTTAAG AAAAGATGGC AAATTACTAA CTCCATATCA TTTTATGAAG
GATATAGAAG ACGCCAATAT GATGAAAGAA ATTACATTGT GGGTCTTAAA GAGGGTAATT
AAAGATTATA ATATTATAAG GTGTTACGAC AATATCAATG AAAAAGATTT TTATATTTCT
CTAAATGTAT CTTTTAATGA GATAAAAGAT AGAGAGTTTT TAAAGAAAAT AGTGAAAATA
GTTAATGATA ACAAAATAAT AAAAAATAGT ATTTGTCTAG AGATTATAGA AAAGTTTGGA
GTAGAGGAAA TAGAAAAAAT ACAAGAAAAC ATCAAGTTTT TACAGGATAA TGGTATTTTA
ATCGCAATAG ATGACTTTGG TGTGGAGTAC TCAAATTTAG ATTTATTAAA GAAAATAGAT
TCTAATATTA TTAAATTAGA TAAGTTTTTT GCAGATGGAA TTAATGATTC AGAAATAAGC
CTTAAAGTAA TAGACTTTAT ATTAGATATA TGTAGATTAT CAGATAAGTC TATAGTTATT
GAGGGGATAG AGGAAAAAGA GCAGGTTGAT ATAATAAAAA CCTTTCTTTA TGAAAAAATT
TATATTCAAG GATATTATTT CTCAAAGCCA TTAGATATTA AAAGTTTAAA AGCTTATACC
TTTTAG
 
Protein sequence
MKKSKRFIMI ILFFLVFIML CLCYGSIINK EKRKTLLKIG FYDDYPHFYI NNKANVCGYY 
KDITENLAKK LNFKVEYVNG NVPDLLKELK NGEIDLVFGI NKLPAREESF KFTNKSINDE
LNFIYTNKNI KYGDLEALNG MKMGYIEGEL DNEWILDYLK KRNINVELVN GSSYKAVKTL
LIHNKVDFIV DNPDSDIKNK GKNIKEVFEF SSGEKYIVAN KNNKELIKKI DGALSTINLN
AYLGNNPYFK KIDNFIIDTT NKNVVILILF IICIIMFKKV KKRIVKIFKK KKIYNDLKKD
NYTLYYQPIV DFKHNRVRSV EALLRLRKDG KLLTPYHFMK DIEDANMMKE ITLWVLKRVI
KDYNIIRCYD NINEKDFYIS LNVSFNEIKD REFLKKIVKI VNDNKIIKNS ICLEIIEKFG
VEEIEKIQEN IKFLQDNGIL IAIDDFGVEY SNLDLLKKID SNIIKLDKFF ADGINDSEIS
LKVIDFILDI CRLSDKSIVI EGIEEKEQVD IIKTFLYEKI YIQGYYFSKP LDIKSLKAYT
F