Gene CPF_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1053 
Symbol 
ID4201747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1201696 
End bp1202742 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content29% 
IMG OID638081934 
ProductDNA-binding protein 
Protein accessionYP_695499 
Protein GI110801073 
COG category[K] Transcription 
COG ID[COG1476] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.101358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAA ATGAGATTAT AAAAGAGAAA AGAAATTCAC AAGGGTTAAC TCAGGAACAA 
GTTGCCATGT ATCTTGGAGT ATCAACACCA GCAGTGAATA AATGGGAAAA GGGAACTTGC
TATCCAGATA TTACATTACT ACCAGCTTTA GCAAGGCTTT TAAAAGTTGA TTTAAATACC
TTACTTTCAT TTAAGGAAGA TTTATCAGAA CAAGAAATAG GAGTTTTTGT TAATGATTTA
GTAAAGATAG CTAATAATGA TGGATTTCAT GTAGCTTTTG AAAAAGCCAT GGACAAAATC
TATGAGTATC CTACCTGTGA TAAATTAATT TTAACAGTTT CTACAGTGAT GCAAGGAAGC
ATTTATATGT TTGGAGCTGA TGATAAGGAG AAGTATGAAA AGCAGATAGA AGAGTTTTAT
ATTAAATTAA TAAGAAGTGA TGATATAGAG ATTAGAAATC AGGCACTTTC TATGCTTATA
AACATATATT TAGGAAGAAA AGAGTATGAG AAAGCACAAG GGTGTGTGAA TAAATTACCT
AATATTACTT ACGATAAAAA AGTATTACAA GGAAATATTT ATAATAAATC TGGTGAATTT
CAAAAGGCTG CTGAAATTTT TGAACAGAAG TTACTTTCAG CCACAACTGA TATATATATA
AGTTTAATTT CTATGATTGA AATAGCCTTA AAAGAAGGGC GCAATGAGGA TGCTAAGTAT
TTTGCAGAAT TTATAGATAA GACAACAAAA TTATATGATT TATGTGAGTA TAATTCATAT
AGTGGGTATT TCGAAGTGTA CAAAGCAGAA AAAGATGTAG AAAACTTTTT ACTTGTATTA
AAGAAAATAT TAGATGTAAT GGGCAAAAAG TGGGAGCCAT CAAAATCTAA ATTATATAAG
CATATAAAAA GTAAAGGAGA CGAAGAGGGA TTTAATGAGC AACTTCTTTC TAACTTTGTA
AACATTCTAA AAAATGATAG CGATGGTGAA TTAGATTTTT TAAAGGATAA TGAGGAATTT
AATAATTTAT TAAATACATC TATATAA
 
Protein sequence
MRINEIIKEK RNSQGLTQEQ VAMYLGVSTP AVNKWEKGTC YPDITLLPAL ARLLKVDLNT 
LLSFKEDLSE QEIGVFVNDL VKIANNDGFH VAFEKAMDKI YEYPTCDKLI LTVSTVMQGS
IYMFGADDKE KYEKQIEEFY IKLIRSDDIE IRNQALSMLI NIYLGRKEYE KAQGCVNKLP
NITYDKKVLQ GNIYNKSGEF QKAAEIFEQK LLSATTDIYI SLISMIEIAL KEGRNEDAKY
FAEFIDKTTK LYDLCEYNSY SGYFEVYKAE KDVENFLLVL KKILDVMGKK WEPSKSKLYK
HIKSKGDEEG FNEQLLSNFV NILKNDSDGE LDFLKDNEEF NNLLNTSI