Gene CPF_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0801 
Symbol 
ID4203050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp948869 
End bp950743 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content27% 
IMG OID638081685 
Productsensory box protein/histidinol phosphate phosphatase family protein 
Protein accessionYP_695252 
Protein GI110799120 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG1387] Histidinol phosphatase and related hydrolases of the PHP family
[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain
[TIGR01856] histidinol phosphate phosphatase HisJ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0107221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACT ACTATTTAGA AGAGGTTGTG GATGAAATTT ACTATAAAGA AGGCAAAGAA 
AATAGAGAAT TAAAAATGGA TATCTTAAGT CTTAAGATAG AAAATGAATT GAAGATTATA
GAAAATTTAT GTGGCAATTG TAAAAGAAAA AATACCCTAT ATGAGGCTAT AAAGAATATA
AAAAGCTTAT TAAAACAATA TTATATAGTT TTTAATGGAA CTCAAGATGC ATTATTTTTA
GTTGAAATGC TTAAAGATGG AAATTTTAAA TATGTAAGAA ATAACAATGC ATATCTTCAG
GAATTTGGAC TTAAAGGAGA AGAGATTATA AATAAAACTC CAAAGGATGT ATTTGGGGAA
GAGCTTGGGA AAAGATTCTG TAATTATTAT AAAAATTGTA TAGAAGGTAA AAGAGTAGTT
GTATTTGAAG ATGACTTGTT TTTAAATGGG AAGAAAAGAG TGTTCTTAAC GAAACTACTT
CCTATTGTTG ATGAGGATGA TGGAGTTTTC ATAGTAGGTT CAAGAGAAGA TATTACAAAG
AGAAAAGAAA TGGAAATAGA GTTAGACAGA ATGGCTAATT ATGATGAATT AACTAATATT
CCTAATATGA GATTATTTTT TAAATCTTTC AGAAATACCA TAAACGAAAG CAAAAAGATG
GACAAGAAGT TTGCAGTTCT TTTTATAGAT TTAGATTGGT TTAAGGAAAT AAATGATAAT
TATGGACATG ATGTAGGGGA TGAAGTTCTA GTTTGTGCAG TTAAAAGAAT ATACAAATGT
TTAAGGAAGG GCGATATTCT AGGAAGAATA GGTGGAGATG AATTCGCAGC TATATTAAAG
GATATAAGTG ATAGAGAAGA AATTGAAAAA ATAGTTAAAG ATATTCAAAA CTCCTTAAGA
AAAAGAATAA AAATAGGAGA TGTTACATGT AATATTGACT CATCCATAGG TATAACAATA
TTTCCAGAGG ATGGAGAAAA AATAGAAGTA CTTATGAGAA ATTCTGATAA GGCTATGTAT
AAAGTAAAAA ATAGAGAAAA GGGAGGATAT AGATTTTTTA ATAATATGAT TAGAGAATAT
AACTTTAAAA ATAAAGATGG TCATGTCCAT ACAAAATATT GCCCTCATGG TAGTGACGAT
AATATTGAAG ATTATATTGA AGAAGCTATA AAATATAAGT TAGATGAAAT AAGTTTTACT
GAGCACCTTC CTCTTCCTGA AAATTTTATT GATCCTTCGC CTTTGAAAGA TAGTGCTATG
AAATTAGAAG AAATGGAATC CTATTTAAAA GAAGGACACG AGTTACAAGA AAAATATAAG
GATAAGATAA AAGTAAATAT TGGAGTAGAA GTAGACTATA TAGAAGGTTA CGAGATTGAA
ACAGAATTGC TTTTAAATAA ATATGGCAAA TACTTAAATG ATGGGATTTT ATCCGTTCAT
ATGATAAAAG GAAATAAAAG ATATTATTGT ATAGATTTTA GCGAAAAGGA ATTTAAAAAA
ATCATAGATG ATTTAGGTTC TTTAGAGAAA GTATATAATA AATATTACGA TACATTAATT
ATGGCCTTAA AAAGTGATTT AGGACCATAC AAACCAAAAA GAATAGGACA CTTAAATCTA
GTAAGAAGGT TTAATAAGGA ATTTCCCTAT AACTACGAAA AACATATTTC AAAGATAGAG
GAAATATTAG ATCTTATAAA AGAAAAAGGA TATGAACTTG ATTTTAACAT TGCAGGTTTA
AGAAAAAAAG AGTGTAATGA ATTCTATATA GAAGGAAAAG TTCTAGAAAT GGCAATTGAG
AAAGGCGTAC CTATGGTTTT AGGAAGTGAT TCTCATTCTG CTAAATATAT AAAATGTATA
AAAGAATTCT TATAA
 
Protein sequence
MKNYYLEEVV DEIYYKEGKE NRELKMDILS LKIENELKII ENLCGNCKRK NTLYEAIKNI 
KSLLKQYYIV FNGTQDALFL VEMLKDGNFK YVRNNNAYLQ EFGLKGEEII NKTPKDVFGE
ELGKRFCNYY KNCIEGKRVV VFEDDLFLNG KKRVFLTKLL PIVDEDDGVF IVGSREDITK
RKEMEIELDR MANYDELTNI PNMRLFFKSF RNTINESKKM DKKFAVLFID LDWFKEINDN
YGHDVGDEVL VCAVKRIYKC LRKGDILGRI GGDEFAAILK DISDREEIEK IVKDIQNSLR
KRIKIGDVTC NIDSSIGITI FPEDGEKIEV LMRNSDKAMY KVKNREKGGY RFFNNMIREY
NFKNKDGHVH TKYCPHGSDD NIEDYIEEAI KYKLDEISFT EHLPLPENFI DPSPLKDSAM
KLEEMESYLK EGHELQEKYK DKIKVNIGVE VDYIEGYEIE TELLLNKYGK YLNDGILSVH
MIKGNKRYYC IDFSEKEFKK IIDDLGSLEK VYNKYYDTLI MALKSDLGPY KPKRIGHLNL
VRRFNKEFPY NYEKHISKIE EILDLIKEKG YELDFNIAGL RKKECNEFYI EGKVLEMAIE
KGVPMVLGSD SHSAKYIKCI KEFL