Gene CPF_1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1919 
Symbol 
ID4201089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2154852 
End bp2156231 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content31% 
IMG OID638082788 
ProductPhoH family protein 
Protein accessionYP_696352 
Protein GI110800819 
COG category[T] Signal transduction mechanisms 
COG ID[COG1875] Predicted ATPase related to phosphate starvation-inducible protein PhoH 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0452134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGAAAA CTTATATATT AGATACCAAT GTATTGCTAT ATTCACCAGG AGCTATATAT 
TCATTTGAAG ATAACAATGT TATTATTCCA GAGGTGGTTC TAGAGGAATT AGACAATATT
AAAAAAATGA ATAATGATTT AGGAGCTAAT GCTAGGCATG TTGCAAGAGA ATTAGATAAA
CTTAGATTAA GTGGGTCATT AAGCGAAGGA GTTAATTTAC CTAAAGGCGG AAAGCTTAAG
GTGGTTACTA ACTTTTACAA TACTGAAATA CCAGAAGCAT GGAATATAAG CAAGCCTGAT
AATAGAATAA TACAAATTTG TAAGGCCTTA AAGGAAAAGG GAGAAGATGT TTGTTTAATA
ACTAAGGATA TATTTGAGAG AATAAAGGCT GATACTGTTG GCATAAAATC AGAGGATTTT
TATGAGGTTG TAGTTCCAGA ATTTGAAGAA CAATATAGTG GAAGAATGGA AGTTTACACA
AGTTCTGAAT GCTTAAGTAA ATTCTTTAAA AATAAAGTTA TGGAGAAAAA AGATTTAACT
TTCTACGATG AAGAAAATAA GTGCTATGTG GAACCAAAAT TAGAGATTAA TCAATTTTTA
ATAATACACT GTAATGATAA TGATAAACAA ACTGCCTTAG GAAGATTTGA TGGAAAAGTT
ATAAGACCTT TACTTTATAA AGATAACAAT AATATTATGG GAATAAGTCC TAGAAATGTT
GGCCAAAAAT TTATGCTAGA ATGTTTGAGC ATGGATGCTA AAAAGGCACC TTTAGTAATT
ATAAAGGGGC CAGCAGGAAC AGCAAAAACT TTATTTTCAT TAGCTGTGGG ATTACAAAAG
ATATTAGAAG AGGAAAGTGG ACAATATAGA AGGATTTTAG TTTGTAGACC TAATGTAACA
ATGGATGAAG AAATAGGGTA TTTGCCAGGA ACTGAGCAAG AAAAAATAGC TCCATTTATG
AGACCTATAT ATGATAATTT AGAGATTTTA ATAGATTCAG ATGAGAAAGA AAGATATTCA
AATGAGAGGG AGCTAAATGA TAAAATAGAG GAACTTTTTG AAAGAAAGAT AATAACAACA
GAGGCTGTGG CTTATTTAAG AGGAAGAAGT ATAATAAAGA ATTGGATTAT AATAGATGAA
GCACAAAACT TAACACCAAA GCAGGTTAAA GCAATTATAA CAAGAGCTGG AGAAGGATCA
AAAATAATAC TAGTTGGAGA TCCAGAGCAA ATAGATCAAG CTTTCTTAGA TTCAAGAAGT
AATGGACTTT GTTATGCTTC AGAAAAGATG AAAGGAAGTC ATCTTTGCTA CCAAGTTACC
TTAAAATATG ACGAATGTGA GAGAAGTGAA TTGGCTTATG AAGCTGCAAA AAGATTATAA
 
Protein sequence
MKKTYILDTN VLLYSPGAIY SFEDNNVIIP EVVLEELDNI KKMNNDLGAN ARHVARELDK 
LRLSGSLSEG VNLPKGGKLK VVTNFYNTEI PEAWNISKPD NRIIQICKAL KEKGEDVCLI
TKDIFERIKA DTVGIKSEDF YEVVVPEFEE QYSGRMEVYT SSECLSKFFK NKVMEKKDLT
FYDEENKCYV EPKLEINQFL IIHCNDNDKQ TALGRFDGKV IRPLLYKDNN NIMGISPRNV
GQKFMLECLS MDAKKAPLVI IKGPAGTAKT LFSLAVGLQK ILEEESGQYR RILVCRPNVT
MDEEIGYLPG TEQEKIAPFM RPIYDNLEIL IDSDEKERYS NERELNDKIE ELFERKIITT
EAVAYLRGRS IIKNWIIIDE AQNLTPKQVK AIITRAGEGS KIILVGDPEQ IDQAFLDSRS
NGLCYASEKM KGSHLCYQVT LKYDECERSE LAYEAAKRL