Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1308 |
Symbol | |
ID | 5744914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 1636790 |
End bp | 1640047 |
Gene Length | 3258 bp |
Protein Length | 1085 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641292413 |
Product | APHP domain-containing protein |
Protein accession | YP_001558424 |
Protein GI | 160879456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0025589 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA AAGTAAGTAA AATTCTAGCG ATGCTTCTTA CAATTATGTT AATAGTGCAA ACAGCTACTA CATTTGTAAT GGCGAAAGAA ATTACATCCA ATGCAACAGT TTATATTAAA AATGCTACCA CAGGTTCTTA TCTTTATGCT TCTGGTTCTA GTGTAAATGC TGGAGGTTTT CAAGAGAAAA ACCCATATTA CATATGGAGT ATAACTAAGA AAACGGACGG TACTTTTTGG ATTCAAAACA TGGGAACAAA CTCGTATTTA TGTTTAGAAC ATGCGAGCGG AAAAGCACTA TTAGAGACTA CAATATACGA AGTATGGATG AGTAGTAAAT GGTACATCGA AGGTAATGCG GATTCTTCTA CAATTGATAA TATGTGGAAG AGCACAAGAA GTAGTGGAAC TGGTGGTTAT TTATATACAA ACAGTGGAAA TTCAGATGTA TTCTACGGAA CAACAGCGCA ACGATGGATT TTTGAAGACT ACAGTTCAAC TTCCAATATT CCAGAGGATG GTGTTAAAGG TAAATTAACA AATAAAGTTG CAGCAGATGC AATCGTAGTT CCTGCTCCAA ACAGCGATGT TTCAAGCATT GGTGCCACGA TGCCATATGT ACGTTATGAT TCCGAATATG CAGTATTAGG TGGTGGCGCA AGGCTTGCAA CCTCAACGAA TTGGGATTTA ACGAATATTG CTAGCCAGGC ATCCAATCAA TCTTATGTCG TTCTTCCATC GAGCGGTTCT TATGCGGAAT GGAGAGTAAA TTCTTCTGGT AATGGTGTTG TTATGAGATT TACCTTACCG GATACTCCGA ACGGAATGGG ACAAAATGGT TCCTTAGACG TATATGTGAA TGGCTCCAAA GTAAAAACAG TTAATTTAAC CTCGTATTAT ATGTGGCAAT ATTTAAATGG TGGTGCAAGT GATACCCCAG GTGGTGGCAC TGCATGTTTC GCATTTGATG AAGCACATTT TCGGTTAGAT AGATCCTTAG TTCCGGGAGA TACAATCCGA ATTCAAAGTT CAGGAGCTTA TGGTCTTACT TATGGAGTTG ATTTCTTAGA AATAGAGGAA ACGGGAGGTC CAATACCTCA ACCAGCTGGT TCTTATTCCG TAGTAGATTT TGGCGCTGTA CCAGACGATG GTATTGATGA TTATGCAGCA ATTACTGCTT GTGTTGCTGC AGCAAATGCG AATGGCAAAG ATGTTTACTT CCCAGCAGGA ACCTATCACA TCAATCAAAT ATGGCGTTTA ACGGCGTCTA ATATGATGAT TACTGGTGCT GGAATGTGGT ATACCAACAT TCAATTTACT AATTCAGCAC AGCAGTCTGG TGGTATTTCT GGAAATGGTG ATGGTAACAC AAAGAATATT GAATTCTGTA ATTTGTACAT CAACTCCAAT CTTCGTTCCC GTTATGGTGA AAATGCAATC TATAAAGGGT TCATGGATGT ATTTGATGGT AAATCCTTGT TCCATGATTT GTGGGTAGAG CATTTTGAAT GTGGATTCTG GATTGCAGAC TATAATGGTG TCTTAGATTA CTCAGATGGA ATTAAAATTG CAAATTGCCG TATTCGAAAC AATCTTGCAG ATGGTGTGAA CTTCTGTCAA GGAACAAGTA ATGCAACAGT GTATAATTGC TCTATCCGAA ACAATGGGGA TGACGGCCTT GCGATGTGGA ATAACAATTT CATGTCAGCG AAGGACTTAA GTAATAATAT ATTCTGTTAT AATACGATTG ATTTAAATTG GCGTGCTGGC GGCATAGCAA TTTATGGTGG AAACAATCAT AAAATTTATA ACAATTATAT TAGAGACTGT TTTATGTCTT CTGGTATTCA TTTAAATACT ACATTTCCTG GATATAAGTT TAATAACACA CAAAGTATCC AATTTGCTAA CAACACAATT ATCCGAAGTG GTTGTAGTTA TGATACTTGG AGAAGTGAGT ATGGCGCAGT TGATTTAACC GGTTCTGTGA AAAACGTAAC ATTTGATAAT ACTTATATTT ACGATGCACA ACATGACGGA ATACGTCTTG GAGATAGTGT TAACAATGTT GTTTTCAACA ATCTTTATAT TTATGGTACT GGTGTTGATG GTAACACCCC GTCCTACTCC TCCTTACCAC ATCTAGGGGC AGCAATCATG TCCTTTGGAG GAAATCCTTC TGTCACTATT AACGGTATGC ATTTAAGAAA GATTGCCTAT CTAGATGTTT ATTATCTTAA CCAGGCAAAT GTTGTTCGTA ATAATGTGAC TAACGAAGGG AATGTGGCAT ATACGATACC GGCTTATCGT GCAGTTGGTT CCTTACAAAC GGATCATGGT AACCATGGTC AGGTACCAAC AATTATACCA ACAGCTACGC CAACAGCAAC ACCAATACCA ACAGCAAGTC CAAGTGTAAC ACCGACTCCA ACAGCTACAC CATCACCAAC AGTAACGCCA ACACCTTCTG TTCCATATGG TAATCCAGAT ATGATTGTTA CAGATATTAC GTGGTCACCA GCAAATCCAA CCAATGGCAA TCAAGTTACT TTTAGTGCAG TTATTAAAAA TATAGGAACT GGGGTAGCGC CACTTGGATC AATTAATGGA GTTCAATTTC AAGTAAATGG TACTAGTGTT TCTTGGTCAG ATAATGACAA AACTCAGATT CAACCAGGTC AATCCATTAC AGTAACAGCC GTAAGTGGGC CTTCTGGCTC TGCAACATGG GCAGCGAAGA CAGGTACCTA TACCATTACT GCATGGGTAA ACGATGTTAA TCGTTATCCT GAGTCTAATA CCAATAATAA TATGTTAACA AAACCACTTA CTGTAACAGA AGGAGCGGTA CAACCAACGG TTACACCGAC ACCTACCCCA ACGGGGCAAC CTTATGGTAA TCCAGATATG GTTGTAACGG ATATTACTTG GTCTCCAGCA AGCCCAGTAA GCGGGAATAA GATCACATTT AGTGCAGTTA TTAAGAATCA AGGAACCGGA GTAGCTCCAG TAGGGGCAAT CAACGGATTA CAATTCCAAG TGAATGGTAC CTGTGTTTCT TGGTCCGATA ACAATACAAC TCAGATAATG CCAGGACAGT CAGTGACGGT AACAGCAAAT AGTGGTCCAG CTGGCAGTCC AACCTATACT GCTACGGCAG GTACCTTTAA TGTTATGGCT TGGGTCAATG ATATTAATCG GTATCCAGAA TCCAATACAG ATAATAACAC TTTGACGAAA ACGATGACAG TACGTTAG
|
Protein sequence | MKKKVSKILA MLLTIMLIVQ TATTFVMAKE ITSNATVYIK NATTGSYLYA SGSSVNAGGF QEKNPYYIWS ITKKTDGTFW IQNMGTNSYL CLEHASGKAL LETTIYEVWM SSKWYIEGNA DSSTIDNMWK STRSSGTGGY LYTNSGNSDV FYGTTAQRWI FEDYSSTSNI PEDGVKGKLT NKVAADAIVV PAPNSDVSSI GATMPYVRYD SEYAVLGGGA RLATSTNWDL TNIASQASNQ SYVVLPSSGS YAEWRVNSSG NGVVMRFTLP DTPNGMGQNG SLDVYVNGSK VKTVNLTSYY MWQYLNGGAS DTPGGGTACF AFDEAHFRLD RSLVPGDTIR IQSSGAYGLT YGVDFLEIEE TGGPIPQPAG SYSVVDFGAV PDDGIDDYAA ITACVAAANA NGKDVYFPAG TYHINQIWRL TASNMMITGA GMWYTNIQFT NSAQQSGGIS GNGDGNTKNI EFCNLYINSN LRSRYGENAI YKGFMDVFDG KSLFHDLWVE HFECGFWIAD YNGVLDYSDG IKIANCRIRN NLADGVNFCQ GTSNATVYNC SIRNNGDDGL AMWNNNFMSA KDLSNNIFCY NTIDLNWRAG GIAIYGGNNH KIYNNYIRDC FMSSGIHLNT TFPGYKFNNT QSIQFANNTI IRSGCSYDTW RSEYGAVDLT GSVKNVTFDN TYIYDAQHDG IRLGDSVNNV VFNNLYIYGT GVDGNTPSYS SLPHLGAAIM SFGGNPSVTI NGMHLRKIAY LDVYYLNQAN VVRNNVTNEG NVAYTIPAYR AVGSLQTDHG NHGQVPTIIP TATPTATPIP TASPSVTPTP TATPSPTVTP TPSVPYGNPD MIVTDITWSP ANPTNGNQVT FSAVIKNIGT GVAPLGSING VQFQVNGTSV SWSDNDKTQI QPGQSITVTA VSGPSGSATW AAKTGTYTIT AWVNDVNRYP ESNTNNNMLT KPLTVTEGAV QPTVTPTPTP TGQPYGNPDM VVTDITWSPA SPVSGNKITF SAVIKNQGTG VAPVGAINGL QFQVNGTCVS WSDNNTTQIM PGQSVTVTAN SGPAGSPTYT ATAGTFNVMA WVNDINRYPE SNTDNNTLTK TMTVR
|
| |