Gene CPF_1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1698 
Symbol 
ID4201545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1919946 
End bp1922006 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content27% 
IMG OID638082572 
ProductAraC family transcriptional regulator 
Protein accessionYP_696136 
Protein GI110800462 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000348751 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAAAAG AATATGTTAA TTTCCCTTCA GACATACCTG TAACCATATC TTATGTAAAC 
ATAAAAAACT ACCCCTTGCA TTGGCATGAT GCAATAGAAA TACTTTATGT TCTTAAGGGC
AGCATAAAAG TAGATATAGA TACTGACAGT TATGAAATAC AAGAAGATGA AATTGAAATA
GTAAACACAG AACAAACTCA TAGAATTTAT TCTAATAAGG ATAACAGAGT TTTAATATTT
AAAATAGATC CACACTTTTT CGAAAAATAT TATAGCGATA TAGAAAATAT GTTTTTCTAT
ACTAACACCT CTGATGAAGG TGCTCAAAGT GATGAATCTT ACGATAAACT TAGAGTATTT
TTATCCATTA TATTATGTGA AGAAGCTCAA AAAGTAGATG ATTATGATAA ATATATAGAA
AAATCCCTAG TAGAGCTTTT ATTTCACCTT TTAAATAATT TCCACTATCT TTTATATGAT
AATGATGAAA TTCATGAGAA TAATATGTTA CTAGAAAGGT ACCATAGAAT TTCTAAGTAT
ATATATAATA ACTACAATAA GAATATAACC TTAAAGGATA TTGCTAATAC TGAGTTTCTT
AGTACTCATT ATCTTTCCCA TGAAATAAAA TATGCAACAG GACTAAGTTT TACAGACCTT
CTAAACTTAA CTAGAGTTGA AGAATCAGTA AAACTTCTTT TAGATACTGA TAAGTCCTTA
TCTGAAATAT CTTATGAAAT AGGCTTTTCT CATACTCGTT ATTTTAATAA ACATTTTAAG
GCCTATTACA ATTGCACTCC TCTTCAATTT AGAAAAAAAC ATAAAATAAG TGAAGAGGAA
TATAATAAAC AAAAAGAAAT AACTTACTAT CCCTTAGCTG ATAGTCTTGA GGAACTATCT
TATTACTTAG ATGATTATCC AAGATTTAAT TATGAAGATA AGATTCATAA GCTTACTTTT
AATATGAATA CTGAAGGCAC TGAATTTAAT AAATACTTTA AAGAAGTTCT TAATGTAGGA
GATGCCTTTG ACTTATTATT AGAAGATAAC CAAGATATTG TTGAAGATCT TCAAGATCAT
ATAGGGTATA ACTATATAAG ACTTCTACAT GTATTTTCAT CAGATATGGG CATATTCCCT
GGTTCAAAAT TCTTTAACTG GACTAGAACC TTTGATATTT TTGAATATAT ATCTTCATTA
GATTTAATCC CTTTAATAGT CCTAGATGAT TCTGGATATT CTAAGGATAA TTTCTTAGAT
GTTATAAAAT CCTTTATAGA CTTTTTTAGC GAGGTAGAAA GTTTTGAATT AACAGATTTA
AAATTCCAAT TTACCTCTAC ATTTAATGAG GATCTAAAGA ATTCTTTAAT TGAATTGTTT
GAATCTAAGG ATTTAAATTT AGTAAATGAA CTTTATACTC CAAACAATAA AATAGATTTA
ATATATGACA CAGCTTATAT GCTTCCATTT ATAATTCATA ATACTGTAAG CTCAGGAAGC
AAATTAAACT TTATAAAAGC CTTTGATGCC TTAGATAGAC AAATTGACAT TACAAATGAA
GTTTTCTTTG GCTATCCAGC AATGGTAAAT GATAAAGGAA TTAAAAAACC CTCTTATTAT
GCTTACTATT TTTTAAGCAA ACTAGGAGAC ACACTTCTTT ATAAGGGAGA TGGATATATA
CTAACTAAGT CAGAGGATGA ATATCAACTT CTTGTATATA CCTATAATGA TGAAATTGAT
TCCCTTATAG ATTTTAAAAA CTTTACTAAG TTAAGAGGGG TTAAAGACTT GGTAGATAAA
AAACTTTCCT TAAACTTACT TGATTTAGAT AGCGATGTTA GAATAACTAA ATACACCATA
GGGGAGAACT TTGGCTCTTC ATTTAATTAC TGGCTTTCCA TGGGAAAACC TAAAAGATTA
AGAAAAGCAG AAAAGGATAT ATTATTCCAA GCTTCCTATC CTAAAATAGA ATTTAAGTAC
GCCAAAAAAA ATACTATTTT AAATATACAA ACAACTCTTC AAGGATATTG TGCAGAACTA
TTTATACTAA AAAAAGTTTA A
 
Protein sequence
MRKEYVNFPS DIPVTISYVN IKNYPLHWHD AIEILYVLKG SIKVDIDTDS YEIQEDEIEI 
VNTEQTHRIY SNKDNRVLIF KIDPHFFEKY YSDIENMFFY TNTSDEGAQS DESYDKLRVF
LSIILCEEAQ KVDDYDKYIE KSLVELLFHL LNNFHYLLYD NDEIHENNML LERYHRISKY
IYNNYNKNIT LKDIANTEFL STHYLSHEIK YATGLSFTDL LNLTRVEESV KLLLDTDKSL
SEISYEIGFS HTRYFNKHFK AYYNCTPLQF RKKHKISEEE YNKQKEITYY PLADSLEELS
YYLDDYPRFN YEDKIHKLTF NMNTEGTEFN KYFKEVLNVG DAFDLLLEDN QDIVEDLQDH
IGYNYIRLLH VFSSDMGIFP GSKFFNWTRT FDIFEYISSL DLIPLIVLDD SGYSKDNFLD
VIKSFIDFFS EVESFELTDL KFQFTSTFNE DLKNSLIELF ESKDLNLVNE LYTPNNKIDL
IYDTAYMLPF IIHNTVSSGS KLNFIKAFDA LDRQIDITNE VFFGYPAMVN DKGIKKPSYY
AYYFLSKLGD TLLYKGDGYI LTKSEDEYQL LVYTYNDEID SLIDFKNFTK LRGVKDLVDK
KLSLNLLDLD SDVRITKYTI GENFGSSFNY WLSMGKPKRL RKAEKDILFQ ASYPKIEFKY
AKKNTILNIQ TTLQGYCAEL FILKKV