Gene CPR_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0996 
Symbol 
ID4205811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1137054 
End bp1138853 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content26% 
IMG OID642565553 
Productsensory box protein 
Protein accessionYP_698319 
Protein GI110802746 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2206] HD-GYP domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.133788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAA ATTTATTAGA GGTATTATTA GATAACATAC CTTATTCAGT ATGGTTAATA 
GGAATAGATG GTAGATTTAT TTTTGTGAAT AAATATTACT CAAATGCTTT AAATTTGAGT
AAAGAATATA TAATTGGTAA GAGTTTAAGT GAAATTTATA CTAAAGAGGT AGCAGATGAA
TATATTAAAA ATTATAATCT GGTAAGAGAT GAGGAAAAGC CAAATCTTTT TTCAGGATAT
GAAAATGGAT TAGGATATCA TGATGGATCA TTTTTAGAGT GTTATTTAGC CCCTATAAAA
GAAAATGGAG AAATAAAGTA TTTTTTAGGA ATTCTTCAAA ATCAAAGTGA AAGAAAAAAG
TATGAGGAAG AGTTAATAAA TCAAAAGGAA CTTTTAAATA CTCTTGTAGA ATCTATTCCA
GATGGAATTT ATCATAAAGA TAAGGATGGT AAGTATTTAA AATGTAATAA CACTTTAGTA
AAAGATTACT ATAAGATAAC TAAAGCAGAA ATAATAGGAA AAGATATAAA GAGTATTTAT
AAAAAGGCTT CCAATAGAAA AGGTATTTTT AAGGAAGAAA AAATACTAGA TAAGCTTATA
CTTCAAGATG ATAAAGTTAT AAATACCAAG AATAAATTAA AAGAAAAAAT AAAGATAGAA
TTAAATAGAA AGATAAGATA CATAGAGTCT ATTAAGGTAC CTGTTATAGA TAAAGGTGGA
GTAGTAACTG GTATTGTAGG GGTAGTTAGA GATGTAACTG AGAATGTTAT TTTAGAAAAT
AAATTAAAAA AAATGAGTTA TAGAGATAAA CTTACAGGGC TTTATAATAG AGCTTATTTC
GATGAAAAGT TAAAAGAATT AAATAATAAA GAGTTTTTTC CTTTAAGTTT TGTAATGGGA
GATTTAAATG GACTTAAGGT TATAAATGAT GCTATTGGAC ATTTAGAAGG AGATAAAATT
TTAAAAGAAA TTTCAAGGGT AATAAAAAAT TCTTGTAGAA AAGATGACCT TATATTTAGA
TGGGGAGGAG ATGAGTTTTG TATTCTTTTG CCTAAAACAA CAGAGGAAGA AGCAGAAGCT
ATATGTAATA GAATAAGGAA AAATTGTAAG CTTAATCATA AAACTATAAT TCCTTTAAGT
ATAGCACTTG GAGTATCTAG TAAAAAAGAA TCTAAAAAAC CAATAGATGA AGTTTTAGTA
GAAGCTGAAG ATAAAGTTTA TAGAGAAAAA TTGGTGAATG AAAAACGCAT AAAAAAGAAT
ATAATAGACT CATTAAATAA AGAGCTTTTT TTAAGACATG ATTATATCAA AGAACATATA
AATAGAGTTA AAAAGTATGC TGTTGAACTA GGTAAGAAGA TGAACTTATC AGAAAAGGAA
TTAAAAAATT TAAAAATGCT AGCTAAACTT CATGATATAG GCAAAGTAGG AATTCCTGAG
GAAATACTAT CTAAACCAGG TAAGTTAAAA AAAGAAGAGT ATGAAATAAT AAAAACTCAT
GCTGAAAAAG GATATAGAAT TGCTATGTTT AATCCAGAGT TTGAAAAAAT AGCACCATGT
ATATTAGCTC ATCACGAGAA ATATGATGGT ACTGGATATC CATTAGGACT AAAAGGTAAT
GATATTCCAC TACTTGCTAG AATTATAAAT GTAGTAGATT CTTATGATGC TATGACAAAT
AAAAGGGTCT ATAAAGGAAG CCTTAGTGCT GAAGAAGCTA AAAAGGAGCT TAAAAAGAAT
TCTGGTACTC AATTTGATCC TATGATTGTA GAGGAATTTT TAGAGTTAAA AAGAATTTAA
 
Protein sequence
MNSNLLEVLL DNIPYSVWLI GIDGRFIFVN KYYSNALNLS KEYIIGKSLS EIYTKEVADE 
YIKNYNLVRD EEKPNLFSGY ENGLGYHDGS FLECYLAPIK ENGEIKYFLG ILQNQSERKK
YEEELINQKE LLNTLVESIP DGIYHKDKDG KYLKCNNTLV KDYYKITKAE IIGKDIKSIY
KKASNRKGIF KEEKILDKLI LQDDKVINTK NKLKEKIKIE LNRKIRYIES IKVPVIDKGG
VVTGIVGVVR DVTENVILEN KLKKMSYRDK LTGLYNRAYF DEKLKELNNK EFFPLSFVMG
DLNGLKVIND AIGHLEGDKI LKEISRVIKN SCRKDDLIFR WGGDEFCILL PKTTEEEAEA
ICNRIRKNCK LNHKTIIPLS IALGVSSKKE SKKPIDEVLV EAEDKVYREK LVNEKRIKKN
IIDSLNKELF LRHDYIKEHI NRVKKYAVEL GKKMNLSEKE LKNLKMLAKL HDIGKVGIPE
EILSKPGKLK KEEYEIIKTH AEKGYRIAMF NPEFEKIAPC ILAHHEKYDG TGYPLGLKGN
DIPLLARIIN VVDSYDAMTN KRVYKGSLSA EEAKKELKKN SGTQFDPMIV EEFLELKRI