Gene Cphy_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3033 
Symbol 
ID5743359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3705830 
End bp3707731 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content36% 
IMG OID641294134 
Producthistidine kinase internal region 
Protein accessionYP_001560129 
Protein GI160881161 
COG category[T] Signal transduction mechanisms 
COG ID[COG2972] Predicted signal transduction protein with a C-terminal ATPase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0284117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAGA TTCTGAAAAT TAATAAAATG AAAAAGCAAA ATAAGAGAAA ATATTTGGGC 
CTAAAAGGCA GGATGCTGCT CGGTATTTTA CAGGTGCTGA TTCCGATTAT GATAGTGATC
ACTATATTAT TCTGGAATAC ACGAAAAGTT ATGAAGCAGG AATATATGCG AACTACCCAA
AGCAGGGTTA CTGAGATTGC TAATAAAATA GATGCTAAAC TTACGGATAT TTACAGTGTT
TCTGATAATT TCGCTGCAAA TGACCAGTTG GATAAATATA TTGAAAAGGT ATATTCACCG
CAAGAGCAGA TTTATAAGAA ACTGGATATT GTTCGTATTT ACAGTAATAT ATTTTCAGCT
TATGATATAT TAAATAAGAG GGAAAGAATC AGTGCAATAT ACACATACAA AGGGGAGTTG
TTTAATTTTC TGGACCCTAA TAAGAATACG AAAGAGGTTA TAGAAAAATT ACAGGATATG
AATATTGAAG ATCCTGACCT TTTGATGAAG TTTCGCTGGT TTCCAGTGCA AGATAATTTT
CTTTTGAGTG ACTATCCGGA GGGGATACGC GAGCAGAAGG CTGTTATGGG TATTCGAAGG
ATTTATTCAT GGGAAAAAGG AAAATATCAA TATGTACAGT TATTTGCCTT GAAAGAAAAA
GAGATATATG AACAGTATGT TCAACTTGCT GAATCTATTC CTGGTGATAT CTATATTTTA
ACAGGAGATG GCAGTTTGAT TTCCTCGAGC AATGAAGAAG TGGTAAAAGC CGGGGAAATA
TCAAGTAAAT TAAAAGATAT GATCTTAGAG CGTACCAAGG ACTCTCAGGA AATGCAAGAT
TCTTCGGGAC ATAAATTGGT AAATGTAAAA GTTTCCGAAG TAAATGACTG GATGACAGTT
ATGATAATTC CTGTAAATGC GGTTACTAGG GATATGGATA TGCTATACCT TAGAATATTC
TTGGTTATGA TGGTTTGTGT CGGACTTTGT GCTATCATGG TATTATATTT ATATAAGAGC
TTTATGGATC CAATCGGCGA GCTCAATGCC TCAATGAAAG AAGTGTATGG TGGTAATCTA
AACGCTTATA TTGAAGTGAA ACAGAAAAAT GAAATGGGTG ATATGATACG CTATTACAAT
TCAATGCTGG AGCAAATTAA TACCCTTTTC ATTGAAGATT TAAAAGCAGA ACGTAAGAAA
AAAGAACTGG AACTTGAAGT ATTGATGAGT CAGATTAACC CTCATTTCCT TTATAATACT
CTGGAGAATA TTGTGTGGAT ATCCAATGAT GCTGGCAGAC CAGACATTGG ACGTATGGCT
GCCTCCCTTG GCAGAATGTA TCGTTTGTCA ATAAGCGGGG GTCAGGTTAT TGTCTTAATG
GAACACGAAA TAGAGCATCT GATGGCCTTT GTCAATATTC AAAAAAACCG CTATAAGGAA
GAGTTTGAAT TTGACCTCCG TACGGATATG CAGCAGATAC ATGGATTGTA TTCTTTAAAG
ATATTGCTAC AACCTGTGGT AGAAAATTCT TTCCTATATG GTATGACTGG ATTAAAACAT
CCAATGCTAA TCAGGGTAAC TATTAAAGAG AAAGATGGAT GGGTCACCAT AAAAGTGATG
GATAATGGCC GTGGAATGGA CAAAGAGCAA TTAAAGGAAA TACGAAACCA AATTCGCTTT
GGAAGGACAG AGAAGGCAGA ACAAGAGAGA AACCGCCGTA GTACCGGTAT CGGGCTCCAT
AGTGTGGAAA TGAGAATTAA GCTGTACTTT GGAGTTGATC ATGCTGTTTC TATATATAGT
AAAAAAGAGG TGGGAACTTT AACTGTCATT CGGATCCCAA AGATAACGAA AGATGATGTT
GACGAACGTG GAAATTTGAT AGAAAATAAG CGAATAAAGT AA
 
Protein sequence
MLKILKINKM KKQNKRKYLG LKGRMLLGIL QVLIPIMIVI TILFWNTRKV MKQEYMRTTQ 
SRVTEIANKI DAKLTDIYSV SDNFAANDQL DKYIEKVYSP QEQIYKKLDI VRIYSNIFSA
YDILNKRERI SAIYTYKGEL FNFLDPNKNT KEVIEKLQDM NIEDPDLLMK FRWFPVQDNF
LLSDYPEGIR EQKAVMGIRR IYSWEKGKYQ YVQLFALKEK EIYEQYVQLA ESIPGDIYIL
TGDGSLISSS NEEVVKAGEI SSKLKDMILE RTKDSQEMQD SSGHKLVNVK VSEVNDWMTV
MIIPVNAVTR DMDMLYLRIF LVMMVCVGLC AIMVLYLYKS FMDPIGELNA SMKEVYGGNL
NAYIEVKQKN EMGDMIRYYN SMLEQINTLF IEDLKAERKK KELELEVLMS QINPHFLYNT
LENIVWISND AGRPDIGRMA ASLGRMYRLS ISGGQVIVLM EHEIEHLMAF VNIQKNRYKE
EFEFDLRTDM QQIHGLYSLK ILLQPVVENS FLYGMTGLKH PMLIRVTIKE KDGWVTIKVM
DNGRGMDKEQ LKEIRNQIRF GRTEKAEQER NRRSTGIGLH SVEMRIKLYF GVDHAVSIYS
KKEVGTLTVI RIPKITKDDV DERGNLIENK RIK