Gene CPR_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0235 
Symbol 
ID4204021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp287587 
End bp289914 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content25% 
IMG OID642564792 
ProductGGDEF/EAL domain-containing protein 
Protein accessionYP_697569 
Protein GI110801516 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0547392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCATG GACCTATGCT CAATAAGATA AAACTTTTTT TTATAGTTCA AATTATATTA 
AGTTTTTTAG TTTTGGCAGG AACATTAATT TCTATATTGG TAATACCATC ACACTTGCCA
GACTACTTTA TATGGGCCAT AGCCTTTTCA TTTTGGGTTT TTCAATGTGG CTTTAGTACA
ATCTTTGCAT TTAGATATAG ATTTTCTAAA AGTTCTCTAA GCACCGTAAA ACTAAGCTTT
ATACTAATTG GAAGCATACA AATTTTAATA ACCTTAGCAA CAAACCTATT TAGTATATTT
GATCTTCCAA TTTTCAATAC ATTAATAATA AAAAACTTTC AACTTTTATT ATTATCATTA
ATATTTATTG TGATGATTTA TAAACTTATA AAAGATATGC TTAAATATAG GGTTTTAAAG
GATTATATTA TTCGTGATTC GTTTTTTACC TTTATACCTT TTGCTTTTAT TATTTATAAG
AACTCTAAAT ACTTTATATC TTCACTATAT TTCTTAGATG ATTCAGTATA TAGTTTAGCT
CTATTAAACA TAAATAGCTT AAGAATTTTT TTAGAAGGAT TTCTTATTAT ATTGATCTTT
GTAAAAACTA TTTATTTAAG AGGAAATAAA AAAAGTAGCT TTCTATTATT AATCTACTTT
TATATATCAA ATCTTAACTG CTATATAGTC TCATATTTTA ATAGTAAGTT GCTTTTTTAT
CATTACTTAT ATCAAGGTAT TGGAATCTAC TTTATATGTC CAATATTATT AATACTTTCC
ACATATTATT CAATTACAGA ACAAGAAAAA ACTCCAGATA AAGTAGAATT TGATGGTTCT
TATATGAATA TAAATAACTC ATGGATAAGT ATTACAATAG GTATTTTTCC AGTATCATTT
TCAGCCATAG ATATAATTGC ATCCTCTCTA GCAAGTGGTG ATTTTAACGG CATAGAAAGC
TTAGTAATTA TACTTCTTGT TATGATTTTA CTAGTATGTA GAGATATTAT ATTAAGAAAA
GATAATCTAA GATTAATTGC AAGTGTTAAG AGGGCTAATG AAATAGACTT TTTAACTAAT
ATGTATAGTA GAAATTATTT TTTAAAGAAA CTTGAAGCCA TGAATACTGA TTATGCACTT
TTCTTTATAG ACATAAAGAG CTTTAAGGAC ATTAATAATA TTTTTGGACA TGTTGTTGGT
GATAAGGTAT TAGTAGAAGT TGCAAATAAA TTTAAAGAAC TTAAACTACT TCTAGGTAAA
AGTGTTTTCT ACTGTAGATA TAGCGGAAAT GAATTTGCCT TAATCTCTAA ATTAGAGCAA
GTTGATCTTA TATTAAAATT TTTTAGAGAC CTTGAAAATA TAAAAATTCA ACATAACTCT
GATTTAATAC ATATAAACTT TAATATTGGA TATTCATTAA ATGATTCAAA TATTAACTTA
GACTTACTAA TTCAGGAAAC AGATTATGCT CTAATGAGAG CTAAGAAGCT TAAAGGTCCT
AATAACTCTA TACTTATGTA TGATGATAAT TTAAAAACGG ACTTAAAACA CATAAATATA
TTAAAAAAGG ACTTTATAAA TGATCTTTTA GCAGATAAGT TTCATTTAGT TTTCCAACCA
AAAGTTTGCG TAAAAACATT AAAAATAGTA GGTTTTGAAA CTCTACTTCG TTGGAAACAT
GATAAACTAG GTGAAATTTC TCCTTGTAAG TTTGTACCTA TAGCTGAAAA TTTAGGTGAA
ATTTCTTCCC TTGACTTGTA TGTTTTTAAA AAATCCTGTG AGTTTCAAAG ATGCTTAATA
GATATGGGAA TAGAAATAAA GTGCAGTGTA AATTTATCAT TAAACACTTT AAAATCCTAT
GAAAAAATAA ATGAGATTCT AAATATTTAT AAAAAATATA AAATTCCAAA ACATTTAATA
ACGGCAGAAA TATTAGAAAA TGTATCACTA AATAATAATG CTAAAACAAT CTCTTATATT
AATCTTCTTA GAGAAAATGG AATTTGTATA TCCATAGATG ATTTTGGTAC AGGCTATTCT
TCTCTTAGCC AAATATCTAA TCTGTATTTT GATGAATTAA AAATCCCAAG AGAGTTTGTA
ATAAATGCAA GAACCTCTAG TAAATTAGCT GTTATTGAAG CAATATCCAT ATTAGCAAAA
AAATTAAATG TAACCTCAGT TATTGAAGGG GTAGAAACTT ATGAAGATTT CAAATTATTT
AGTAGTTTAG GATTTGATAT TGTTCAAGGA TACTATTTCT CTAAACCCCT AACTAAATCT
GAAATAATAG ATTATATAAA AGTTAATATT AATAACAAAA TAAGCTAG
 
Protein sequence
MKHGPMLNKI KLFFIVQIIL SFLVLAGTLI SILVIPSHLP DYFIWAIAFS FWVFQCGFST 
IFAFRYRFSK SSLSTVKLSF ILIGSIQILI TLATNLFSIF DLPIFNTLII KNFQLLLLSL
IFIVMIYKLI KDMLKYRVLK DYIIRDSFFT FIPFAFIIYK NSKYFISSLY FLDDSVYSLA
LLNINSLRIF LEGFLIILIF VKTIYLRGNK KSSFLLLIYF YISNLNCYIV SYFNSKLLFY
HYLYQGIGIY FICPILLILS TYYSITEQEK TPDKVEFDGS YMNINNSWIS ITIGIFPVSF
SAIDIIASSL ASGDFNGIES LVIILLVMIL LVCRDIILRK DNLRLIASVK RANEIDFLTN
MYSRNYFLKK LEAMNTDYAL FFIDIKSFKD INNIFGHVVG DKVLVEVANK FKELKLLLGK
SVFYCRYSGN EFALISKLEQ VDLILKFFRD LENIKIQHNS DLIHINFNIG YSLNDSNINL
DLLIQETDYA LMRAKKLKGP NNSILMYDDN LKTDLKHINI LKKDFINDLL ADKFHLVFQP
KVCVKTLKIV GFETLLRWKH DKLGEISPCK FVPIAENLGE ISSLDLYVFK KSCEFQRCLI
DMGIEIKCSV NLSLNTLKSY EKINEILNIY KKYKIPKHLI TAEILENVSL NNNAKTISYI
NLLRENGICI SIDDFGTGYS SLSQISNLYF DELKIPREFV INARTSSKLA VIEAISILAK
KLNVTSVIEG VETYEDFKLF SSLGFDIVQG YYFSKPLTKS EIIDYIKVNI NNKIS