Gene CPR_0788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0788 
Symbol 
ID4206396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp912663 
End bp914537 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content26% 
IMG OID642565347 
Productsensory box protein/histidinol phosphate phosphatase family protein 
Protein accessionYP_698113 
Protein GI110803561 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG1387] Histidinol phosphatase and related hydrolases of the PHP family
[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain
[TIGR01856] histidinol phosphate phosphatase HisJ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0208583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACT ATTATTTAGA AGAGGTTGTG GATGAAATTT ACTATAAAGA AGGTAAAGAA 
AATAGAGAAC TAAAAATGGA TATTTTAAGT TTAAAGATAA AAAATGAATT AAATATCATA
GAAAATTTAT GTGGTAATTG TAAAAGAAAA AGTAATTTAT ATGAAGCTAT AAAGAATGTA
AAAAGCTTAT TAAAACAGTA CTATATAGTT TTTAATGGAA CTCAAGATGC ATTATTTTTA
GTTGAAATGC TTAAAGATGG AAATTTTAAA TATGTAAGAA ATAACAATGC ATATCTTCAG
GATTTTGGAC TTAAAGGAGA AGAAATTATA AATAAAACAC CAAAGGATGT TTTCGGGGAA
GAACTTGGAC AAAGATTCTG CAATTATTAT AAAAAATGCA TAGAGAATAA AAAAGTAATT
GTATTTGAAG ATGATTTGTC TTTAAATGGT AAGAAAAGAG TGTTCTTAAC GAAACTACTT
CCTATTATTG ACGAGGATGA TGGTGTTTTT ATAGTAGGTT CAAGAGAAGA TATTACAAAG
AGAAAAGAAA TGGAAATAGA GTTAGACAGA ATGGCTAATT ATGATGAATT AACTAATATT
CCAAATATGA GATTATTTTT TAAATCTTTT GGAAATACAA TAAATGAAAG TAAAAAGAAA
GGAAAGAAGT TTGCAGTTCT TTTTATAGAT TTAGACTGGT TTAAGGAAAT AAATGATAAC
TTTGGACATG ATGTAGGGGA TGAGGTTCTA GTTTGTGCAG TTAAAAGAAT ATATAAATGT
TTAAGGAAGG GCGATATTTT AGGAAGAATA GGCGGAGATG AATTTGCTGC TATACTAAAG
GATATAAGTG ATAAAGAAGA AATTGAAAAA ATAGTTAAAG ATATTCAAAA CTACCTAAGA
AAAAGAATAA AAATAGGGGA TGTTACATGT AATATTGATT CATCCATAGG TATAACAATA
TTTCCAGAAG ACGGAGAAAA AATAGAAGTA CTTATGAGAA ATTCTGATAA GGCTATGTAT
AAAGTAAAAA ATAGAGAAAA GGGAGGATAT AGATTTTTTA ATAATATGAT TAGAGAATAT
AACTTTAAAA ATAAAGATGG TCATGTACAT ACAAAATACT GTCCTCATGG TAGTGATGAT
AATATTGAAG ATTATATTGA AGAAGCTATA AAATATAAAT TAGATGAAAT AAGTTTTTTA
GAGCATCTTC CTCTTCCTGA AAGTTTTATT GATCCTTCAC CTTTAAAAGA TAGTGCTATT
AAATTAGAAG AAATGGAATC ATATTTAAAA GAAGGAAATA AATTGAGAGA GGGATATAAA
AGTAAGATAA AAGTAAATGT TGGAGTAGAG GTAGACTATA TAGAAGGTTA CGAGATTGAA
ACAGAATTGC TTTTAAATAA ATATGGAAAA CACTTAAATG ACAGTATTTT ATCCGTTCAT
ATGATAAAAG GAAATAAAAG ATATTATTGT ATAGATTTTA GCGAAAGTGA ATTTAAAAAG
ATCATAGATG ATTTAGGTTC TTTAGAAAAG GTATATGATA AATATTATGA TACATTAATT
ATGGCCTTAA AAAGTGATTT AGGACCATAC AAACCAAAAA GAATAGGTCA TTTAAATCTA
GTAAGAAGAT TTAATAAGGA GTTTCCATAT AATTATGAAA AACATATTTC AAAGATAGAG
AAAATATTAG ATATTATAAA AGAAAAAGGC TATGAGCTTG ATTTTAACAT TGCAGGCTTA
AGAAAAAAAG AATGTAATGA ATTTTACATA GAAGGAAAAG TCTTAGAAAT GGCAATTGAG
AAAGATATAC CTATGGTTTT AGGAAGTGAT TCTCATTCTG CTAAATATAT AAAATGTATA
AAAGAGTTTT TATAA
 
Protein sequence
MKNYYLEEVV DEIYYKEGKE NRELKMDILS LKIKNELNII ENLCGNCKRK SNLYEAIKNV 
KSLLKQYYIV FNGTQDALFL VEMLKDGNFK YVRNNNAYLQ DFGLKGEEII NKTPKDVFGE
ELGQRFCNYY KKCIENKKVI VFEDDLSLNG KKRVFLTKLL PIIDEDDGVF IVGSREDITK
RKEMEIELDR MANYDELTNI PNMRLFFKSF GNTINESKKK GKKFAVLFID LDWFKEINDN
FGHDVGDEVL VCAVKRIYKC LRKGDILGRI GGDEFAAILK DISDKEEIEK IVKDIQNYLR
KRIKIGDVTC NIDSSIGITI FPEDGEKIEV LMRNSDKAMY KVKNREKGGY RFFNNMIREY
NFKNKDGHVH TKYCPHGSDD NIEDYIEEAI KYKLDEISFL EHLPLPESFI DPSPLKDSAI
KLEEMESYLK EGNKLREGYK SKIKVNVGVE VDYIEGYEIE TELLLNKYGK HLNDSILSVH
MIKGNKRYYC IDFSESEFKK IIDDLGSLEK VYDKYYDTLI MALKSDLGPY KPKRIGHLNL
VRRFNKEFPY NYEKHISKIE KILDIIKEKG YELDFNIAGL RKKECNEFYI EGKVLEMAIE
KDIPMVLGSD SHSAKYIKCI KEFL