Gene CPR_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1035 
Symbol 
ID4204943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1177773 
End bp1179407 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content21% 
IMG OID642565592 
ProductGGDEF domain-containing protein 
Protein accessionYP_698358 
Protein GI110801973 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00577049 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AAAAAAAGAA ATATTTATTA ATTTTATTAA TTTTTTTAAT AGTTATTAAT 
TTATTTATAT TTTCTTTTTA TAAATTTAAA TCAAATAAAA AAAATAAAAA ATATATAGAG
AAAACTAATT CAATTTTATA TGAATTTAAA TTATATATGA AAAAAAGAGA TTTTAAAAAT
TGTGAAAATA AACTTTACTC AATTGTAAAT AATAAAGATA TGTTTTATAG TTTACCTGAT
AATTTAAAGT TCGAGATTTA TAATTATTTA GCTATAATAA ATCTTCAACA AGAAAAATTT
TTAAATGCTT TATCTTATTA TGAAGAAGCC TTTAAATATG CTGATAAAGA GTCAAAGATA
ATTATTAAGC TAAACATGAC ATCTGCTTAT AGATATATGG GGGCCTATGT TACTGCTACT
AATATTTTAA ATAAAATGAT AGAGTCATCA TTAACTTTTA AAAATGAAAA TTCTTATCTT
AAAGAATATA CTCTATTAAA TTTAGCTGAA ACTTATTTTG CTGTAAATGA TATAACTGAC
TTTAATTCTA CAATAGAAAC TATATCGAAA TATTATTGTG GGCCTGAAAA TGAGTTAGCA
GATCTAAAAA TACTTTTAGA TTCCTATTTA ATAATAAAAG CAATATCAGA GAATAACCTT
GATTTAGTAC CAAAATACAT TTCTGAAATA GATGAATTAG AAAGTAAAAA TAAAGATCTC
ATATATTCAG AATTAGAAAT GATTAAACTT CGTTCATATG GTATGTATTA TGAAAGCATA
GGAAATTTCA ATTTAGCATT AGATTATTTT TCCAAATTAG AAAAATCAGC TGATAATGAA
GGTGCTTCTT ATGTTTCGCT TTTTTCTATA AGCAAAAGAA TTTCTATCTA TAAAAAATTA
AATGATACTT ATAAAATAAA TTATTTAATA AATAAATATT ATGAAAAACA AACTTCAATA
AATGACATAA ATAATAATGA ATTTAAATAC TATATAGATA ATAAAATTAT AAACAATCAT
GAGTTACCAT TTTTAAAAGA AACAATTATT ATTTTAACAA TTCTATTTTT AATCTCTATT
TTATTAGTGC TTTTTTACTT AAAGAAAGCT CGGGATTCAA AATTAGATTC TTTAAAAGAT
GGACTTTGCA ATATTTACAA TAGGCGTTTT TTAGACTCTT ATATAAATAA TTTAAAAGAA
AAGGATCTAC CTATTTCTTT TTTAATGATA GATGTAGATT ATTTTAAACT TTATAATGAT
AACTATGGTC ATCAAGCAGG TGATTTTGTG CTAAAAAGCA TATCTTCTGT ATTAAAAAGA
AATTCTCGTA AGGAAGATAT AGTTTCACGT TATGGAGGAG AAGAATTTTG TGTTTTACTA
AAAGGAGCTT CTAAACATTC TTCTATTAAT TACGCTAAAA GAATCAAAGA AAATTTAGAT
AATTTAAATA TAAAGCATAA ATACTCAAAA ATTTCTAACA ATGTAACCTT TAGTATAGGA
ATATATACTA CATATACTAA AAACGATCTA AAAAATGCTA TTAAACTTTC TGATAAAGCA
CTATATATAT CTAAAACAAG AGGGAGAAAT ACATATACTT ATCTAGAAGA TAACTCTTCT
GATTCTTCTA ATTAA
 
Protein sequence
MKNKKKKYLL ILLIFLIVIN LFIFSFYKFK SNKKNKKYIE KTNSILYEFK LYMKKRDFKN 
CENKLYSIVN NKDMFYSLPD NLKFEIYNYL AIINLQQEKF LNALSYYEEA FKYADKESKI
IIKLNMTSAY RYMGAYVTAT NILNKMIESS LTFKNENSYL KEYTLLNLAE TYFAVNDITD
FNSTIETISK YYCGPENELA DLKILLDSYL IIKAISENNL DLVPKYISEI DELESKNKDL
IYSELEMIKL RSYGMYYESI GNFNLALDYF SKLEKSADNE GASYVSLFSI SKRISIYKKL
NDTYKINYLI NKYYEKQTSI NDINNNEFKY YIDNKIINNH ELPFLKETII ILTILFLISI
LLVLFYLKKA RDSKLDSLKD GLCNIYNRRF LDSYINNLKE KDLPISFLMI DVDYFKLYND
NYGHQAGDFV LKSISSVLKR NSRKEDIVSR YGGEEFCVLL KGASKHSSIN YAKRIKENLD
NLNIKHKYSK ISNNVTFSIG IYTTYTKNDL KNAIKLSDKA LYISKTRGRN TYTYLEDNSS
DSSN