Gene CPF_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0456 
Symbol 
ID4203858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp541443 
End bp543263 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content24% 
IMG OID638081340 
Producthypothetical protein 
Protein accessionYP_694913 
Protein GI110800015 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000703218 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATTA AACAAATGAA TATAAAAATA TTTAAACTAC CTTTAATATT TGCAGTTTTA 
TCTGTAGTAT TAATGGGATT ATACAAAACA ATATTTGGAG TGGAAAACAC TATTATAGGA
TTAATAATAG CAATGGCGTC CTATGCTTTT CTAAGATTAG ATTTAACTTC ATATCCAATT
TATAAGTCTA TGATATTTCT CATATTAAAT TTATTTTTGG CCATAAGTGC CTATATATCC
GCCATAAATC CATTTGTTGG ATTAATAATA AACTTTTTAA TACTTTTTAC AGTATCATTT
ATATATACAA CGGAATTTAA AAATGTTATT TCCTATATCT TTCTATTATT ATATGTGTAT
ATGTGGGAAT ATTCTATTAG TTTGGATGAA TTGCCAAGAA GGTTAGTGGC TATGGGGGTA
GGTGTTTTCA TTATTATTGG AATTCACATA TTATTTAATA GAAGAAATTT CAAAAAGAAT
TCAAATAATA TAATCATAAG ATCTATAAGA AATATTCAGA AGGAAATCTG TCATATAATA
AATGAAAGCT ATAGGGAAAA GGAAAATATC TATATAGATA GCGAATTAAG AAAATTATTA
ATTTTGATTG AAGGAAGAAA TAATAATAAA TTTATAGAAA ATCATAAGGA TGATATTTAT
TTTAATATTG TACTTATATT AGAAAGAATA AATTCAATAA TCAATAAAGT TGGCAAAATT
AATAATAAAT CTAAAGACGT AATAGACTAT TTAAATAGTT TAAATCATGA TTTAGAAAAC
ATAACTTTAT TTTTAGAAAG AAAAGTGGAT TGTATTAATG AGGAGAAGGA TGATTTAAAT
AAGGATTCTA CCATAAATAA TTGGACTGAG AAAGAATATG CTTTTTTAGG GGAATGCACT
GAACTTATAA GACTTTTAGA GAAAAATATA AATAATTTAT ATGAATATAA TAGGAAGAAA
TCAAGGAAGA GAATTAAAGT AAAATTTAAT CTTAAGGAGC TACTAATAGG AAATAGTTCT
TTAAAAATGA AACATTTAAG GGTAGCTTAT TCTTTAAAAC TAGCAATAGC AGTTTCTCTA
ATAATGTTTA TAGTAGATTT ATTTAAAATA CCTCAGGGAA GATGGATTGT TACCAGCGTT
TATGTGGTTA TACAGCCCTA TGAAGAAGAA ACCTTAACAA AAGCAATAAA AAGATTCAAA
GGGACAATAA TAGGAGTGAT AATTTACATT TCTATATTTA CATTTTTCCC GCATATTATT
CCTTTAGAAT TACTTCTATT AATATTAATG TTTCTTTACT TTGTCCAAAA GGATTATGAG
AAAAAGGTTG TATGCACAGC ACTTATGACT CTAAGCTTTG GATTATCTAG AAGTACAGTT
GGATACTTAG CCTTTTATAG ATTTTTATTT GTAATAATAG GAATAGTAAT AGCTTTAGGA
ATTAACAAGA TTATTTTCCC ACAGAGCATA AAAAATTCTA TATATGATTT AAAGGAAAGG
TATTTAGAAT TAACAAGTAA ACTTCTTTAT GAGTTAAAAA GCATATTATA TGAAGAGAAA
TATAATGAAA ATACAATTAA ACTTCTATTA GATTGTAACC TTATTGAGTG TAAGTTAATG
GAAAATAAAT TAATTGCTGA AAATTTAGAG CTTAAAGATT TAGTGTATAA ACAGAGCATA
ATTTTAAGCA AGATAAGATG TCTTATATTA TTTATAAATT ACTCAAATTG GGGAATTTCA
TCAAAACACA TTAATGTAGA TAAAAATTTA TTAAATGTAA TTTTCAATAA AATAGAGGAA
GAACTAAGGG AGATTTATTA A
 
Protein sequence
MTIKQMNIKI FKLPLIFAVL SVVLMGLYKT IFGVENTIIG LIIAMASYAF LRLDLTSYPI 
YKSMIFLILN LFLAISAYIS AINPFVGLII NFLILFTVSF IYTTEFKNVI SYIFLLLYVY
MWEYSISLDE LPRRLVAMGV GVFIIIGIHI LFNRRNFKKN SNNIIIRSIR NIQKEICHII
NESYREKENI YIDSELRKLL ILIEGRNNNK FIENHKDDIY FNIVLILERI NSIINKVGKI
NNKSKDVIDY LNSLNHDLEN ITLFLERKVD CINEEKDDLN KDSTINNWTE KEYAFLGECT
ELIRLLEKNI NNLYEYNRKK SRKRIKVKFN LKELLIGNSS LKMKHLRVAY SLKLAIAVSL
IMFIVDLFKI PQGRWIVTSV YVVIQPYEEE TLTKAIKRFK GTIIGVIIYI SIFTFFPHII
PLELLLLILM FLYFVQKDYE KKVVCTALMT LSFGLSRSTV GYLAFYRFLF VIIGIVIALG
INKIIFPQSI KNSIYDLKER YLELTSKLLY ELKSILYEEK YNENTIKLLL DCNLIECKLM
ENKLIAENLE LKDLVYKQSI ILSKIRCLIL FINYSNWGIS SKHINVDKNL LNVIFNKIEE
ELREIY