Gene CPF_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0041 
Symbol 
ID4203684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp46559 
End bp48031 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content31% 
IMG OID638080916 
Producthypothetical protein 
Protein accessionYP_694508 
Protein GI110799806 
COG category[S] Function unknown 
COG ID[COG0397] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATA AAAATTTTCA ATCAAAAACA GGTTTTAACT TAGAAAACAC TTATTTAACT 
CTTCCAAATA TATTCTTTAG TGAACAAAAT CCAAAGGGTT CAAAAAATCC TAAACTTATT
AAGTTTAATA CTTCTCTTGC TGAAGAACTT GGGCTTAATG AAGAAGTTTT AAACAGCGAT
TTTGGACTTA ATATATTTGC AGGAAATGAA ACCTTCCCAG GAATTGTACC AATTGCACAG
GCTTATGCTG GGCACCAATT TGGTCATTTT ACAATGCTAG GAGATGGTAG AGCACTCTTA
TTAGGAGAAC ATGTAACTAA AGATAGTAAA AGATATGATG TTCAATTAAA AGGGTCTGGT
AGAACAATCT ATTCAAGAGG TGGAGATGGA AAGGCTGCCC TTGCACCTAT GCTTAGAGAA
TATATAATAA GTGAAGGTAT GCATGGTCTT GGAATTCCTA CAACTAGAAG CCTTGCTGTA
GTAAATACTG GTGAGGAAGT TTTAAGAGAA AGATTTGAAC AAGGTGCTAT ATTAACAAGA
ATAGCTTCTA GTCACATTAG AGTTGGAACT TTTGCTTATG CAGCTCAATG GGGAACTTTA
GAAGATCTTA AAAGTCTTGC TGACTATACT ATTAAAAGAC ACTTTCCTAA TATAGCTAAG
AGTGAAAATA AATATATTTT ATTTCTTGAA GAGGTAATAA ATCGTCAAGC TGAACTTATA
GTTAAGTGGC AAAGTGTTGG ATTCATTCAT GGGGTTATGA ACACTGATAA TATGGTAATC
TCAGGAGAAA CTATAGATTA TGGACCATGT GCATTTATGG ATACTTATGA TACAAACACA
GTATTTAGTT CCATTGATTA TGCTGGTAGA TATGCTTATG GAAATCAACC TAACATGGCT
TTATGGAACT TAGCTAGATT CTCAGAAGCA CTACTTCCTC TTCTAAACCC TAACCTAGAT
GAGGCTGTTA ATATTGCTAA AAAGTCCATA TCAAACTTTT CTAAACTATA TAAAAAATAT
TGGTTCAATA AAATGAGAGC TAAACTTGGT CTTTTCACAG AAAAAGAAAA TGATGAATTG
CTAATTGAAG GGCTTTTAAG CACAATGCAA AAATATGAAG CAGATTTTAC TAATACCTTT
GTATCTTTAA CTCTTAATAA ATTTGAAGAT GAAAAAGTAT TTAGTAGTGA TGAATTCAAA
ACTTGGTATG CTCTTTGGCA AAATAGATTA AAAGAAGAAA ATAGATCACA GGAAGAAGTA
AGGAATTTAA TGATGAATAA TAATCCTTAT ATAATTCCTA GAAATCACTT AGTTGAAAAA
GCTCTTAAAA ATGCTGAAAA AGGTGATTTT ACTTTTATGG ATAATCTATT AGAAGCACTA
AAGAATCCTT ATAGTTATTC TAAAGATTTA GAAAAGTACA CTAAGTTACC TGAGAAAAGT
GACACTCCTT ATGTAACATA TTGTGGAACT TAA
 
Protein sequence
MDNKNFQSKT GFNLENTYLT LPNIFFSEQN PKGSKNPKLI KFNTSLAEEL GLNEEVLNSD 
FGLNIFAGNE TFPGIVPIAQ AYAGHQFGHF TMLGDGRALL LGEHVTKDSK RYDVQLKGSG
RTIYSRGGDG KAALAPMLRE YIISEGMHGL GIPTTRSLAV VNTGEEVLRE RFEQGAILTR
IASSHIRVGT FAYAAQWGTL EDLKSLADYT IKRHFPNIAK SENKYILFLE EVINRQAELI
VKWQSVGFIH GVMNTDNMVI SGETIDYGPC AFMDTYDTNT VFSSIDYAGR YAYGNQPNMA
LWNLARFSEA LLPLLNPNLD EAVNIAKKSI SNFSKLYKKY WFNKMRAKLG LFTEKENDEL
LIEGLLSTMQ KYEADFTNTF VSLTLNKFED EKVFSSDEFK TWYALWQNRL KEENRSQEEV
RNLMMNNNPY IIPRNHLVEK ALKNAEKGDF TFMDNLLEAL KNPYSYSKDL EKYTKLPEKS
DTPYVTYCGT