Gene CPF_1164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1164 
Symbol 
ID4203624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1330169 
End bp1331968 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content27% 
IMG OID638082045 
Productputative sensory box-containing diguanylate cyclase 
Protein accessionYP_695610 
Protein GI110800284 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0260621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAA ATTTATTAGA GGTATTATTA GATAACATAC CTTATTCAGT ATGGTTAATA 
GGAATAGATA GTAGATTTAT TTTTGTAAAT AAATATTACT CAAATGCTTT AAAGTTGAGT
AAAGAAGAGA TAATTGGAAA GAGTTTAAGT GAAATTTATA CTAAAGAAGT GGCAGATGAA
TATATTGGAA ATTATAAGTT AGTAATAGAA GAAGGAAAGC CAAAACTTTT TTCAGGATAT
GAAAATGGAT TAGGATATCC AAATGGAGCT TTTCTAGAAT GTTATTTAGC ACCTATAAAA
GAAAATGGAG AGATAAAGTG TTTTTTAGGA ATTCTTCAAG ATCAAAGTGA AAGAAAAAAG
TATGAAGAAG AGCTCATAAA TCAAAAGGAA CTTTTAAATA CTATTGTGGA ATCTATTCCA
GATGGGATTT ATCATAAAGA TAAGAATGGA AAATATTTAA AATGTAATGA TACTTTAGTT
AATGAATATT ATAAAAAAAA TAGAGAAGAA ATAATTGGGA AAGACATAAA GCGTATTTCT
AAAAAATCTT CTAATAGAAA TAGCCTTTTT AAGGGAGAAA AAATACTAGA TGATTTTATA
GCTCAAGATA AAGAAGTTAT AAATACAAAA AAGAAAGTCA AGGAAAAAAT ACGAGTAAAG
CTTAATAGAA AAGTAAAGTA TATAGAATCT ATTAAAGTGC CTGTTATAGA TAACAATGGC
AGTGTAACAG GTATTGTAGG TGTAGTGAGA GATGTAACTG AGAATGTTGT TTTAGAAAAT
AAATTAAAAA AAATGAGTTA CAGAGATAAA CTTACAGGAC TTTATAATAG AGCTTATTTT
GATGAAAAGC TAGAAGAATT AAATAATGAA GAGTTTTTCC CTTTAAGCTT TGTAATGGGA
GATTTAAATG GACTTAAGGT TATAAATGAT GCTATTGGAC ATTTAGAGGG AGATAAAATT
TTAAAAGAAA TTTCAGGGGT AATAAAAAAT TCTTGTAGAA AAGATGATCT TATATTTAGA
TGGGGTGGAG ATGAGTTTTG TATTCTTTTG CCTAAGACAA CAGAGGAAGA AGCAGAGGCT
ATATGTAATA GAATAAGGAA AAATTGCAAG CTCAATCATA AAACTATAAT TCCTTTAAGT
ATAGCACTTG GAGTATCTAG TAAAAAAGAG GCTAAAAAAC CAATAGATGA AGTTTTAGTA
GAAGCTGAAG ATAAAGTTTA TAGAGAAAAA TTGGTGAATG AAAAACGTAT AAAAAAGAAT
ATAATAGACT CATTAAATAA AGAGCTTTTT TTAAGACATG ATGATATTAA AGATCATATA
AATAGAGTTA AAAAGTATGC TGTTGAACTA GGTAAGAAGA TGAACTTATC AGAAAAGGAA
TTAAAAAAAT TAAAAATGCT AGCTAAACTT CATGATATAG GCAAAGTAGG AATTCCTGAG
GAAATACTAT CTAAGCCAGG TGAGTTAACA AAAGAAGAGT ATGAAATAAT AAAAACTCAT
GCTGAAAAAG GATATAGAAT TGCTATGTTT AATCCAGAGT TTAAAAAAAT AGCACCATGC
ATATTAGCTC ACCATGAAAG ATATGATGGT ACTGGATATC CATTAGGACT AAAAGGTAAT
GATATTCCAC TACTTGCTAG AATTATAAAT GTAGTAGATT CTTATGATGC TATGACAAAT
AAAAGGGTCT ATAAAGGAAG CCTTAGTGCT GAAGAAGCTA AAAAGGAGCT TAAAAAGAAT
TCTGGTACTC AATTTGATCC TATGATTGTA GAGGAATTTT TAGAGTTAAA AAGAATTTAA
 
Protein sequence
MNSNLLEVLL DNIPYSVWLI GIDSRFIFVN KYYSNALKLS KEEIIGKSLS EIYTKEVADE 
YIGNYKLVIE EGKPKLFSGY ENGLGYPNGA FLECYLAPIK ENGEIKCFLG ILQDQSERKK
YEEELINQKE LLNTIVESIP DGIYHKDKNG KYLKCNDTLV NEYYKKNREE IIGKDIKRIS
KKSSNRNSLF KGEKILDDFI AQDKEVINTK KKVKEKIRVK LNRKVKYIES IKVPVIDNNG
SVTGIVGVVR DVTENVVLEN KLKKMSYRDK LTGLYNRAYF DEKLEELNNE EFFPLSFVMG
DLNGLKVIND AIGHLEGDKI LKEISGVIKN SCRKDDLIFR WGGDEFCILL PKTTEEEAEA
ICNRIRKNCK LNHKTIIPLS IALGVSSKKE AKKPIDEVLV EAEDKVYREK LVNEKRIKKN
IIDSLNKELF LRHDDIKDHI NRVKKYAVEL GKKMNLSEKE LKKLKMLAKL HDIGKVGIPE
EILSKPGELT KEEYEIIKTH AEKGYRIAMF NPEFKKIAPC ILAHHERYDG TGYPLGLKGN
DIPLLARIIN VVDSYDAMTN KRVYKGSLSA EEAKKELKKN SGTQFDPMIV EEFLELKRI