Gene CPR_1637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1637 
Symbol 
ID4204054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1828902 
End bp1830281 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content31% 
IMG OID642566187 
ProductPhoH family protein 
Protein accessionYP_698952 
Protein GI110801461 
COG category[T] Signal transduction mechanisms 
COG ID[COG1875] Predicted ATPase related to phosphate starvation-inducible protein PhoH 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.169922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGAAAA CTTATATATT AGATACTAAT GTATTGCTAT ATTCGCCAGG AGCTATATAT 
TCATTTGAAG ATAACAATGT TATTATCCCA GAGGTGGTTT TAGAGGAATT AGACAATATT
AAAAAAATTA ATAATGACTT AGGAGCTAAT GCTAGGCACG TTGCAAGAGA ATTAGATAAG
CTTAGATTAA GTGGGTCATT AAGCGAAGGA GTTGATTTAC CTAAAGGCGG AAAGCTTAAG
GTGGTTACTA ACTTTTATAA TACTGAAATA CCAGAAGCAT GGAATATAAG CAAGCCTGAT
AATAGAATAA TACAAATTTG TAAGGCCTTA AAGGAAAAGG GAGAAGATGT TTGTTTAATA
ACTAAGGATA TATTTGAGAG AATAAAGGCT GATACTGTTG GCATAAAATC AGAAGATTTT
TATGAGGTTG TAGTTCCAGA ATTTGAAGAA CAATATAGTG GAAGAATGGA AGTTTACACA
AGTTCTGAGT GCTTAAGTAA ATTCTTTAAA AATAAAGTTA TGGAGAAAAA AGATTTAACT
TTCTATGATG AAGAAAATAA GTGCTATGTG GAACCAAAAT TAGAGATTAA TCAATTTTTA
ATAATACACT GTAATGATAA TGATAAACAA ACTGCCTTAG GAAGATTTGA TGGAAAAGTT
ATAAGACCTT TACTTTATAA AGATAACAAT AATATTATGG GAATAAGCCC TAGAAATGTT
GGCCAAAAAT TTATGCTAGA ATGTTTGAGC ATGGATGCTA AAAAGGCACC TTTAGTAATT
ATAAAGGGGC CAGCAGGAAC AGCAAAAACT TTATTTTCAT TAGCTGTGGG ATTACAAAAG
ATATTAGAAG AGGAAAGTGG ACAATATAGA AGGATTTTAG TTTGTAGACC TAATGTAACA
ATGGATGAAG AAATAGGGTA TTTGCCAGGA ACCGAGCAAG AAAAAATAGC TCCATTTATG
AGGCCTATAT ATGATAATTT AGAGATTTTA ATAGATTCAG ATGAGAAAGA AAGATATTCA
AATGAGAGGG AGCTAAATGA TAAAATAGAG GAACTTTTTG AAAGAAAGAT AATAACAACA
GAGGCTGTGG CTTATTTAAG AGGAAGAAGT ATAATAAAGA ATTGGATTAT AATAGATGAA
GCACAAAACT TAACACCAAA GCAGGTTAAA GCAATTATAA CAAGAGCGGG AGAAGGATCA
AAAATAATAC TAGTTGGAGA TCCAGAGCAA ATAGATCAAG CTTTCTTAGA TTCAAGAAGT
AATGGACTTT GTTATGCTTC AGAAAAGATG AAAGGAAGTC ATCTTTGCTA CCAAGTTACC
TTAAAATATG ACGAATGTGA GAGAAGTGAA TTGGCTTATG AAGCTGCAAA AAGATTATAA
 
Protein sequence
MKKTYILDTN VLLYSPGAIY SFEDNNVIIP EVVLEELDNI KKINNDLGAN ARHVARELDK 
LRLSGSLSEG VDLPKGGKLK VVTNFYNTEI PEAWNISKPD NRIIQICKAL KEKGEDVCLI
TKDIFERIKA DTVGIKSEDF YEVVVPEFEE QYSGRMEVYT SSECLSKFFK NKVMEKKDLT
FYDEENKCYV EPKLEINQFL IIHCNDNDKQ TALGRFDGKV IRPLLYKDNN NIMGISPRNV
GQKFMLECLS MDAKKAPLVI IKGPAGTAKT LFSLAVGLQK ILEEESGQYR RILVCRPNVT
MDEEIGYLPG TEQEKIAPFM RPIYDNLEIL IDSDEKERYS NERELNDKIE ELFERKIITT
EAVAYLRGRS IIKNWIIIDE AQNLTPKQVK AIITRAGEGS KIILVGDPEQ IDQAFLDSRS
NGLCYASEKM KGSHLCYQVT LKYDECERSE LAYEAAKRL