Gene CPF_0908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0908 
Symbol 
ID4203357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1070550 
End bp1072190 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content24% 
IMG OID638081790 
ProductEAL domain-containing protein 
Protein accessionYP_695357 
Protein GI110800502 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain
[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00157995 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GGGTGTTTTG GGTAAGTATA GTATTTTTAA TAATTATAAC GGTTTTAGGT 
ATTACAATTA AATTTGATGG TAAGAAAGTT AATTGTAACA GAAAAACAGT TAAAGTAGGA
TTTTATGAAT ATTATCCTTA TTATTATCTT AATAAAAATT CTATGCCAGA TGGCTATTAT
AATGAAATAC TAGAATTAAT ATGTAATAAG TTAGATTTAA ATTATAAGTA TGTAGATTGT
AATGTAACAG ATGCTTTAGA AAAGCTTAAA TCTGGACAGA TAGATTTAGT CTTTGGAATA
AGTAAGACTC CTGATAGAGA AAAGGAATAT GAATTTACTG ACCACTATCT AAATAATGAT
AACTTTGCCA TATATACTAA TAAGAATATA AAAAATGGTG ATTTAAAAGC TTTAAATGGA
TTAAAAATGG GATTTTTAAA AGGAGAAGAA AATAATGAGT GGATTTTAAG GCTTTTAAAG
GATAAAGGCA TAAATGTGAA ACCTATAGAT GTTTCTAATT ATCCTGAAGA TGAGGAATAT
TTGTATAATA ATAAAGTGGA CTTTGTAGTA GAAAATACAA GAAGCAATAT AAATTATGAA
AATAAAAATA TTAAAAAGAT TTTTGAATTT TCTTCTGGAC CAGTTTATAT AGTTAGTAGA
AAAGGTAATG AAAAATTAAT TGAAGGAATA GATTCTGTCC TTGGAGAGCT TGAGGAAGAT
GAGGAACAAA AAGATATTAA TTTATATTCT AAGTATTTTG ATGAGCATTT AGATAAATTA
AAAAATGAGA AATTACTAGT TGTAATATTT TTAATTATAA TATCATTATT TATTTATAAA
AAAAGAAAAA ATAAAATATT CGCTATAAGA ACTAAAAGAA AAATTAGAGA CTATATTAAA
AATGATAAAT ATATATTATA TTATCAACCT ATAGTAGATC CAAAGAAAAA TAGAGTAAAG
GGATTTGAAT CTCTTTTGAG ATTAAATAAG GATGGAAAAA TTTTAACTCC CTATAGCTTT
ATAAAGGAAA TTGAAGACAA TAATATGTCT TTAGAGGTTT CTTTATGGCT TTTAAAGAAG
GTTATTTTAG ATTACAGAAT AATAAAAGAT TATGATATGG TTAAAGGAAG AGATTTTTAT
ATATCCTTAA ATGTTTCATT TAAAGAAATA GAAAATCCTA AGTTTTTAAG ATCCTTAATG
AAAATTGCAA AAGATTATAA GATTGATGAT TGTAATATTT GTTTAGAGAT AGTAGAAAAG
TTTGGTATGG AGGATATAGG AAGAATACAA AGTGCAATAA GAATAATAAA GGAATATGGA
TTTTTAATAG CTATAGACGA TTTTGGAGTG GAATATTCTA ATTTAGATTT ATTAAATAAA
ATTGATTCTG ATATAGTGAA GCTAGACAAG TACTTTGCTG ATAATTTAGA CAAGTCTATT
ATAAATGAAA AAACAGTGGA ATTTATATCA GAAATATGTA TCATAGCTAA TAGAACTATA
GTATTTGAAG GGATAGAGGA ACAGTATCAG GTTGACATTG TTAAGGCATT TCCATATGAA
AAAATATATA TTCAGGGATA TTTCTATTCA AAGCCAGTAG ATATTGAGAA TTTAAAGGAT
TTTAAATTTA AGGATAGTTA A
 
Protein sequence
MKKRVFWVSI VFLIIITVLG ITIKFDGKKV NCNRKTVKVG FYEYYPYYYL NKNSMPDGYY 
NEILELICNK LDLNYKYVDC NVTDALEKLK SGQIDLVFGI SKTPDREKEY EFTDHYLNND
NFAIYTNKNI KNGDLKALNG LKMGFLKGEE NNEWILRLLK DKGINVKPID VSNYPEDEEY
LYNNKVDFVV ENTRSNINYE NKNIKKIFEF SSGPVYIVSR KGNEKLIEGI DSVLGELEED
EEQKDINLYS KYFDEHLDKL KNEKLLVVIF LIIISLFIYK KRKNKIFAIR TKRKIRDYIK
NDKYILYYQP IVDPKKNRVK GFESLLRLNK DGKILTPYSF IKEIEDNNMS LEVSLWLLKK
VILDYRIIKD YDMVKGRDFY ISLNVSFKEI ENPKFLRSLM KIAKDYKIDD CNICLEIVEK
FGMEDIGRIQ SAIRIIKEYG FLIAIDDFGV EYSNLDLLNK IDSDIVKLDK YFADNLDKSI
INEKTVEFIS EICIIANRTI VFEGIEEQYQ VDIVKAFPYE KIYIQGYFYS KPVDIENLKD
FKFKDS