Gene CPF_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1111 
Symbol 
ID4202865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1267397 
End bp1268917 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content25% 
IMG OID638081992 
ProductAraC family transcriptional regulator 
Protein accessionYP_695557 
Protein GI110801096 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000128485 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAA TACTATTAGT AGATGATGAG GCTACAGAAC GTGAAGGAAT TGAATTTCTT 
ATAAAAAGAT ATGAATTTCC TTTGAATATA GCTAAAGCTG TAAATGGAAA AGAAGCTTTA
GAATACATAA AGAAAAATCA TATAGATATT CTTTTTACAG ATGTAAAAAT GCCTTTCTTG
GATGGACTAG AATTAGCAAA GGAAACCTTT AAGTATGATC CTAAAATTAG AATAATAATT
TTTAGTGCAT ATAGTGAATT TGACTATGCT AAAAAAGCCT TAGAGGCAAA AGTGGTTGAT
TATTTACTTA AACCTATTGA GGTGGATGAA TTTAAAAGAG TTATGGAAGA GGTAATAAAA
AGCTGCATAA AAAGAAAGGA AGAAGAGAAA GAAAAAGAAT TACTTATGGA ATCAAGCAAG
AAGATGTTAC TTTATAAGCT ATTAGCTAAT CACAACAACC AAAATGATAT TAATGAAAAG
TTAAACGTAT ACAATATACA GTTGAAAGAT AAATATTTAG TATTAATAAA CATAGAGACA
AGAGATAACT TTTTTGAGGA GAAGGAGGAG ATTTTCTTTA ATCTCTTAAA TACATATTTA
AAAATTCCAT ATGAATATAT AAATTTTTAT CCTAATGAAT CATATTTATT ATTGCATAGC
AATTTAAAAA TAAAAGAGGA TTTTTTAGAA GAAGTTTGTT TTAAATTAGC CAGAGAAATT
AAATTATTAG CCAATGAAAA CTCTTCTTTT TTTATTAGTA ATGTATTTTG TGGAATTGAA
AAAATCTATG ATAAGGTAAA GGGATTAAAT AATATAAAGG AAAATATTTA TGATTTTGAA
TCAAGGATTA TTAGGGTTAA TAAGGATAAA ACTAATGATT TATACACACT TGAAATAGAG
AATGTAAAGG AAAATCTTAA AAACTCTATA AATGAAAGAA ATTTAAATGA TATTGAATTT
TACATAAATA AGCTTATAGA ATACATGTTA GAATCAGGTT CACTTAGTAC TATATACATA
CATCACCTTT TTTATGATTT AATGGAAAAA CTTTGTAAAG CCTTTGATAT ATATAATGGT
GAAGTGAAAA AGAGATTTAT AGAAAAGATA TTAAAATGTA ACTCAAGTGA AACCTTAAAA
GCAGCTTTTG AATCTATTAT AAAGGATATT GCAAAGGAAT GTGATAATGA TATTTTAGAC
GAAAAAAGCA TAGCTAACAA AGTGATAAAG ATAATAAAAA ATGAATATAG CAGTGAACTG
AGTTTAGATT ATATTGCAGA TAAGGTTAAT TTTACGCCAA CCTATTTAAG CTATGTTTTT
AAGAAAGAAA CAGGCTCAAA CATAGTTAAA TATATAACTG ATTTTAGAAT GAATAAGGCT
AAGGAGTTTT TAGAAGAAGG TAATATGAAA ATTGTACAAG TTGGTAAGGC CTGTGGATAT
GAAAATCAAT CATACTTTAA CCGTATATTT AAGAATTATT TTGGAGTAAC TCCAAATCAA
TTTAAACGTA AGAATAGTTA A
 
Protein sequence
MLKILLVDDE ATEREGIEFL IKRYEFPLNI AKAVNGKEAL EYIKKNHIDI LFTDVKMPFL 
DGLELAKETF KYDPKIRIII FSAYSEFDYA KKALEAKVVD YLLKPIEVDE FKRVMEEVIK
SCIKRKEEEK EKELLMESSK KMLLYKLLAN HNNQNDINEK LNVYNIQLKD KYLVLINIET
RDNFFEEKEE IFFNLLNTYL KIPYEYINFY PNESYLLLHS NLKIKEDFLE EVCFKLAREI
KLLANENSSF FISNVFCGIE KIYDKVKGLN NIKENIYDFE SRIIRVNKDK TNDLYTLEIE
NVKENLKNSI NERNLNDIEF YINKLIEYML ESGSLSTIYI HHLFYDLMEK LCKAFDIYNG
EVKKRFIEKI LKCNSSETLK AAFESIIKDI AKECDNDILD EKSIANKVIK IIKNEYSSEL
SLDYIADKVN FTPTYLSYVF KKETGSNIVK YITDFRMNKA KEFLEEGNMK IVQVGKACGY
ENQSYFNRIF KNYFGVTPNQ FKRKNS