Gene CPF_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1220 
Symbol 
ID4202635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1384213 
End bp1385850 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content22% 
IMG OID638082101 
ProductGGDEF domain-containing protein 
Protein accessionYP_695666 
Protein GI110799538 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.12727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA GAAAAAAGAA ATATTTTTTA ATTTTATTAA TTTTTTTAAT AGTTATTAAT 
TTATTTATAT TTTCTTTTTA TAAATTTAAA GTAAATAAAA AAAATGAAAA ATATATAAAA
AAAACTAATG TAATTTTATA TGAATCTAAA TTAGAGATGA AAAAAAGAAA TTTTAAAGAT
TCTGAAAATA AACTTTATTC AATTATAGAT AATAAAGATA TGTTTTATAG TTTACCTGAT
AATTTAAAGT TCGAAATTTA TAATTATTTA GCTATAATAA ATCTTCAACA AGAAAAATTT
TTAAATGCTT TATCTTATTA TGAAGAAGCC TTTAAATATG CTGATAAAGA GTCAAAGATA
ATTATTAAAC TAAACATGAC ATCTGCTTAT AGATATATGG GGGCCTATGT TACTGCTACT
AATATTTTAG ATAAAATGCT AGATTCATCA TTATTATTTA GGAATGAAGA TTCTTACCTT
AAAGAATATA CTTTATTAAA CTTAGCTGAA ACTTATTTTG CTGTAAATGA TATGACTGAT
TTTAACTCTA CAATAGCAAA GGCATCTAAC TCTTATTATT ATGGCCCTGA AAATGATTTA
GAAGACTTAA AAATCCTTTT AGATTCCTAT TTAATAATAA AAGCAATATC AGAAAATAAT
CTTGATTTAG TACCAAACTA TATTTCTGAA ATAGAAGAAT TAGAGATTAA AAATAAAGAT
GTTATATATT CTGAATTAGA AATGATTAAA ACTCGTTCTT ATGGTATGTA CTATAAAAGT
ATTGGAGATT TTGATCTAGC ATTAGACTAC TTTTCAAAAC TAGAAAAATT AGCTGATAAT
GAAGGTGCTT CTTATGTTTC ACTTTTTTCT ATCAGCGAAA GAATATCAAT TTATAGAAAA
CTAAATGATA ATAAGCAGGT AGATTCTTTA ATAAATAAAT ATTATGAAAA ACAAACTTCA
ATAAATGATA TAAATAACTA TGAATTTAAA TATTATATAG ATAATAAAAT TATAAACAAT
CACGAATTAC CATTTTTAAA AGAGACCATT ATTATTTTGA TAATTCTATT TTTAACCTCT
ATTCTATTAG TGCTTTTTTA CTTAAAAAAA GCTAGGGATT CAAAATTAGA TTCTTTAAAA
GATGGGCTTT GCAATATTTA TAACAGGCGT TTTTTAGACT CTTACATAAA TAATTTAAAA
GAAAAAGATT TGCCTATTTC TTTTCTAATG ATAGATGTAG ATTATTTTAA ACTTTATAAT
GATAATTATG GTCATCAAGC TGGTGATTTT ATACTAAAAA GTATAGCCTC TGTACTTGAA
AAAAACTCTC GTAAAGAAGA TATAGTTGCA CGTTATGGAG GGGAAGAATT TTGTGTTTTA
CTAAAAGGTG CTTCTAAGTA TTCTTCTATT AACTACGCTA AAAGAATCAA AGAAAATTTA
GATAATTTAA ATATAAAACA TAAATATTCA AAGACTTCAG ACCATGTAAC CTTTAGTATT
GGAATATATA CTACATATAC TAAAAATGAT CTGAAAAATG CAATTAAATT TTCTGATAAA
GCACTATATA TATCTAAAAC AAGAGGAAGA AATACCTATA CTTATCTAGA AGATAACTCT
TCTGATTCTT CTAATTAA
 
Protein sequence
MKNRKKKYFL ILLIFLIVIN LFIFSFYKFK VNKKNEKYIK KTNVILYESK LEMKKRNFKD 
SENKLYSIID NKDMFYSLPD NLKFEIYNYL AIINLQQEKF LNALSYYEEA FKYADKESKI
IIKLNMTSAY RYMGAYVTAT NILDKMLDSS LLFRNEDSYL KEYTLLNLAE TYFAVNDMTD
FNSTIAKASN SYYYGPENDL EDLKILLDSY LIIKAISENN LDLVPNYISE IEELEIKNKD
VIYSELEMIK TRSYGMYYKS IGDFDLALDY FSKLEKLADN EGASYVSLFS ISERISIYRK
LNDNKQVDSL INKYYEKQTS INDINNYEFK YYIDNKIINN HELPFLKETI IILIILFLTS
ILLVLFYLKK ARDSKLDSLK DGLCNIYNRR FLDSYINNLK EKDLPISFLM IDVDYFKLYN
DNYGHQAGDF ILKSIASVLE KNSRKEDIVA RYGGEEFCVL LKGASKYSSI NYAKRIKENL
DNLNIKHKYS KTSDHVTFSI GIYTTYTKND LKNAIKFSDK ALYISKTRGR NTYTYLEDNS
SDSSN