Gene CPF_0026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0026 
Symbol 
ID4202400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp33434 
End bp34708 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content26% 
IMG OID638080901 
ProductpolyA polymerase family protein 
Protein accessionYP_694495 
Protein GI110800095 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAATGA AATTACCTAA CAATGTCCAA TATATCTTAG AGAAATTTAA CTCTAATGGT 
TTTGAAGCCT TTATAGTTGG TGGTTGTGTA AGAGATTCTT TATTAAATAA AAAACCTCAA
GATTATGATA TTACAACCAA TGCATTCCCT GAAAAAATAG AAGAGCTTTT TGATAAAACT
ATTCCTACTG GTATTAAACA TGGAACAGTA ACAGTTTTAA TCGATAAAAA TCCTTATGAA
GTAACTACTT ATAGGGTAGA TGGGGAATAT TTAAATAATA GAAAACCTAA AGACGTAAAG
TTCGTTTCTA ATATAGAAGA AGATTTATCA AGAAGAGATT TTACTATAAA TGCAATGGCA
TATAGCCCAT ATTTAGGATT TAAGGATTGT TTTAATGGAA AAGATGATCT AAAAAACAAA
TTAATAAGAT GCGTTGGAGA TCCTGATAAA CGCTTCTCTG AAGACGCCTT AAGAATGCTT
AGAGCAATTA GATTTAGTTG TCAATTAAAC TTTAAAATAG AAAAATTAAC TGCTGAATCT
ATAAGAAAGA ATTTTAAATT AATAAAAAAT ATAAGCATGG AAAGAATTCA AAGTGAATTT
ACTAAAATAA TTCTAAGCAA TGATCCAGAT AGAGGTCTTA TGCTTCTTAG AAAGCTAGGA
TTTTCTGACT TTTTAGTTGA GGAATTTAAG AATTTAAAAC TAATAAATTG TTATGATTTA
TATGATGATA TCCATGATAC TTATGGATTA ATAAATTCAC TTCCTAAAAA GCTTCATGTA
AGATTAGCAG GATTATTCTA TAAAGTTTTT AATTCTGAAA ATGCAGTTGA GAAGTGCAGA
ACTATATTAA AGAAACTTAA ATATGATAAT AATACAATCA ATGATACTTG CAACTTAGTA
GAAAATATAA ATAATATTTC ATGTAATATG ACAAGAAAAA AACTAAAACT ACTTATAAAT
TCAGTTGGAA CTGAAAATAT CTTTGATTTA TTAGATTTAC AAAAATCATA TCTATCTTAC
ATGGATGAAT ATGATACTGA ATGTATAGAT ATATTAAAAA ACAGAGTTTC TGATATATTA
GCTTCAAAAG AACCCATATT TATTAAGGAC TTAGCCATAA CAGGAAATGA CTTAATTACC
GAACTTAATT TTAAACCTGG AAAAAATATA GGTGTTATAT TAAATTTTCT TCTTGAAAAT
GTAATGCAAA CACCAGAGTT AAATAATAAG GAAGACTTAC TAAACCTTAG TAAGCAATTT
TATTCATATA ATTAA
 
Protein sequence
MLMKLPNNVQ YILEKFNSNG FEAFIVGGCV RDSLLNKKPQ DYDITTNAFP EKIEELFDKT 
IPTGIKHGTV TVLIDKNPYE VTTYRVDGEY LNNRKPKDVK FVSNIEEDLS RRDFTINAMA
YSPYLGFKDC FNGKDDLKNK LIRCVGDPDK RFSEDALRML RAIRFSCQLN FKIEKLTAES
IRKNFKLIKN ISMERIQSEF TKIILSNDPD RGLMLLRKLG FSDFLVEEFK NLKLINCYDL
YDDIHDTYGL INSLPKKLHV RLAGLFYKVF NSENAVEKCR TILKKLKYDN NTINDTCNLV
ENINNISCNM TRKKLKLLIN SVGTENIFDL LDLQKSYLSY MDEYDTECID ILKNRVSDIL
ASKEPIFIKD LAITGNDLIT ELNFKPGKNI GVILNFLLEN VMQTPELNNK EDLLNLSKQF
YSYN