Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0201 |
Symbol | |
ID | 5745069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 249113 |
End bp | 251323 |
Gene Length | 2211 bp |
Protein Length | 736 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641291291 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_001557327 |
Protein GI | 160878359 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00388216 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA AGAAGGGGAA TCAAGAAGAA TTTAAGGTAA AGCGATTAAA AAGACAATCG AAGAAGGAAG AAAACATTGA GAAGTTATAC AAGCTAAATA AAGAATTAAA GCAAGAGAAA GAAATTAGAC AAGATAGGTA TTCAAAGAAA AGGGAGAAAA AAAGGATCTT TGGAATTCGC TTAAGGTTAG CTTTGGCGTT TATGGTACCA ATCGCCTTCA TTATCATTCT AGGAATTACA TCATATAATA AAGCGTCAGA AGCAATGTTA AAAAGCTATG AAGAATCATC CTTTCAAGCT CTTAAGACTA CAACAAACTA TTTCGAATTA GCAATGCAAA CGGTAGAATT ACGTTTAAAT CAGCTAAAAG GCTATGAGAA TTTAAAGAAT TACTATTCTG GTACATATAA AAGTGATCCA ATAACGGAGA TGACTACATA TAAGAATCTT CAAGTGTATA TTGAAACAAC GACGTATTCG GATAATGTTT TGGCTAACTT ATACGTATTT AGCAAACGTG GTAATAACTT AAGTAACTAC GGGAGCATGG ATTTAAGTGG TGCAGAGTTG GTGGAGGCTT ATCTTGAAAC TGAAGAAGGA AAAAGAGATG AAAAAGAGAA TGGAGCATAT TCTTGGTCTG GCAAGCATCA GTTTGTCGAT GAGCATTTTG TGCAGGATAA GAGTAAAATT ACAATCCCAT ATGCTTTTTC CGTATCCTCA GCCTTTTTTG GGAATGGATT TAAACAAATC GGTTATATGG TAGCAGATAT TAGGTTAGAC TTATTTCAAA AGAAATTATC AGAGCTTGAA CTATCTGAAA ACTCTATCTT TGTAGCTATC TCAAAGGATG GTTATGAAGT GCATAGTAAT GAGGCTGAGG GCGTTTTGAT TGCTGACAAA TCGTTTTATA CCGATGCATT AGCAAGTTCA GAGCTAAGTG GAACTAGCTA TGTAGACTTT AATGGTGAAA AGCACCTATT TTTATATTCG AAAGCTACGA AATCAGGAAT TACCGTATGC GGGCTGGTTC CATACTCCTA TTTGATATCA CAAGCAAAGG CTATTAGAAC ATCAACTGTG ATATATGTTC TTATAGCAGT GGTCAGTGCT TTGTTCATTG CAGTCATTAT GTCAACCGAT ATGGGAAGTG CTATTAATAA AATTATTGTC GCACTAAAAA AGGCTTCGGA AGGAGACTTA ACGGTATCTG TTAATTGCCA AAGAAAAGAT GAATTTGGTA TGTTAGCGGA TAGCGCTAAT AATATGATCT CTAATGTAAA AGGGCTTATT GATAAAGCAC AGAAAGTAAA GGATACTATT TCATTATCTA CGGATGAGGT ATCGGATTCT GCAAAGCAGC TTTTAATAGC GACTCAGAAT ATTTCTACTT CGATTGAAGA AATTCGTCAA GGTATTGTAC AACAAGCGGA AGATTCTGAA AAATGCCTTC GTCAATCAGA CGAATTAACA ACGAGAGTGA ATCAGGTATC CTATAATGTT ACAACGATTG AAAAACTGAC AGAGGACTCT AAAGCGGTCG TACAGCAAGG ATTAGTTTCC ATTGACATTT TGCGAGATAA GTCAGAAGAA ACAACTAAGA TAACAAATAA CATTATCGGG GATATCGAAT CGTTAGAGGC AGAGAGCGAG TCCATTGGTA AGATTATTGG TGTTATTAAT GATATTGCTG AGCAGACGAA TTTGCTATCT TTGAACGCGT CAATTGAAGC AGCTAGAGCA GGGGATGCTG GTCGTGGATT TGCAGTCGTT GCAGATGAGA TAAGAAAACT TGCAGAACAA TCAGTCAGGG CATCTGGTGA GATAGCAACT ATCATTAGTA GCATTCAAGG AAAAACGAAG CTTACCGTAA CCACAGTTCA AAAGTCAGAG AATATTGTAA AATCTCAGGG AAATGCGTTA TTAAATACCA TCGATTTATT CCAGAAAATT GATGCATCAG TTGGTATGAT TGCAAAAGAG CTTACTGAAA TTACGTCTGG TATATCGGGG ATTAAAGTTG CTGAAAATAC TACATTAAAT GCGATAGAGA GTATATCTGC GGTTTCGGAA GAAACAGCTG CATCCTCAGA AGAAGTAGAT AGTGCGGCAA ATCGCCAAGT AGAATCGGTA TCGAAATTAA GAGAGGCAAC TTTGCATTTA GAAGCAGAAA CGAAAGAATT GAATGAAGCG TTGAATCAAT TTCGTATATA A
|
Protein sequence | MMKKKGNQEE FKVKRLKRQS KKEENIEKLY KLNKELKQEK EIRQDRYSKK REKKRIFGIR LRLALAFMVP IAFIIILGIT SYNKASEAML KSYEESSFQA LKTTTNYFEL AMQTVELRLN QLKGYENLKN YYSGTYKSDP ITEMTTYKNL QVYIETTTYS DNVLANLYVF SKRGNNLSNY GSMDLSGAEL VEAYLETEEG KRDEKENGAY SWSGKHQFVD EHFVQDKSKI TIPYAFSVSS AFFGNGFKQI GYMVADIRLD LFQKKLSELE LSENSIFVAI SKDGYEVHSN EAEGVLIADK SFYTDALASS ELSGTSYVDF NGEKHLFLYS KATKSGITVC GLVPYSYLIS QAKAIRTSTV IYVLIAVVSA LFIAVIMSTD MGSAINKIIV ALKKASEGDL TVSVNCQRKD EFGMLADSAN NMISNVKGLI DKAQKVKDTI SLSTDEVSDS AKQLLIATQN ISTSIEEIRQ GIVQQAEDSE KCLRQSDELT TRVNQVSYNV TTIEKLTEDS KAVVQQGLVS IDILRDKSEE TTKITNNIIG DIESLEAESE SIGKIIGVIN DIAEQTNLLS LNASIEAARA GDAGRGFAVV ADEIRKLAEQ SVRASGEIAT IISSIQGKTK LTVTTVQKSE NIVKSQGNAL LNTIDLFQKI DASVGMIAKE LTEITSGISG IKVAENTTLN AIESISAVSE ETAASSEEVD SAANRQVESV SKLREATLHL EAETKELNEA LNQFRI
|
| |