Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1873 |
Symbol | |
ID | 5743162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 2302530 |
End bp | 2305805 |
Gene Length | 3276 bp |
Protein Length | 1091 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641292970 |
Product | carbohydrate-binding family 6 protein |
Protein accession | YP_001558981 |
Protein GI | 160880013 |
COG category | [S] Function unknown |
COG ID | [COG1572] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00027012 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGTA GAATAAAAAA AGTTATATCA TATGTGCTAT TTTTTGCTTT GTTGATTCAA TTGTCATCAT TTAAGGTGGT TTATGCTTCC ACGAATCAAT ATGAAGCAGA AAATGCATTA CTTTCAGGGG GCGCAGTTTA CGCAAATGAT CATTCAGGGT ATACAGGAAC TGGGTTTGTT GGTGGGTTTA CAGATAGCAA TAAGGGAAAT GCTAAAGTAG AATTTCAAAT TCCTGCAAGT AAATCTTCTC AGTTTAAGTT AAGTTTACGA TACGCCAATG GAACTGGTAG TGAAAAAACG TTATCTTTAT GGGTAGATGG TAATTTTTAT AAACAGCTTT ATTTTCCAGC TACAGCAAAC TGGGATTCTT GGAATTCCAT AATAGAGACA ATTTATTTAA GCCAAGGAAT GCATTCGATT ACTTATTTTT TTGGGAATTC GGATAGTGGC AATGTCAATA TTGATAATCT TCATGTGGAG TCACTGGAAC CAACAGTAAT CCCTGGTAAA GTGGAAGCTG AAAATTACAT AAGAATGAAT GGTATTGCAA CAGAAGATTG TGCAGAAGGT ACTCAAAATG TTGGTTGGAT ACATAATGGA GATTGGTTAG AATATTCGGT AAATGTTAGT GAATCCGCTA AATATCGCGT TGATTATCGA ATTGCTGGGG TGAATAATTC AACTCAAATA TTAGTTCAGG TAGGTAATAA TACTCTAGCT ACAACAAATA TTGTTAACAC TGGTGGTTGG CAAGTTTATC AAACAGTTTC TTCAAAAACT TTTCAATTAA ATGCTGGAAT TCAAACAATA AGGATTTATT TCACTGGTGA AGGTGTTAAC TTTAACTATT TCAATATAGT AAAAGATAAC CAAACACCTC CAGTAGAAGG TGGAAAACCT GATCTTATTG TAACTGATAT TTCATGGATT CCAAATAATC CAGTGAATGG TGAAGAAGTA ACATTTAAAG CGGTCATAAA AAATCAGGGT GATGGAGCAA GTCCACAAGG AGTTATACAC GGAGTAGCAT TTTTAGTTAA TGGAACGACG GTAAGTTGGA GTGATAATGT TACTACTTCA ATTCCAGCAG GTTCTTCTAT TACGGTAACT GCAAATGGTG GACCATATAA CAAGGGTTCA TGGACGGCAG CGACGGGTAA TTATACAATA CGAGCAATAG TAGATGATAT TAATCGTATT GATGAATCAA ATGAAAATAA TAATACTTAT GATAAAACGA TGGTAGTTGA TGTACCTAAA AAACCGGATT TAACTGTTAC CGATATAACA TGGCAACCTA GTGCGCCTGT TGCAGGGAAT TTAGTAACAT TTTCAGCTGT AGTTGAGAAT AAAGGTACAG TAGCAACCTT AAGTGGAATA CCATGTAATC TCTCTTTTTA TGTTAATGGT ATAAAAGTGA GCTGGGTGTC GAATTATACT TCATCGATTC CAGCAGGGGG GAAAGTAACA CTTACTCAGA CAGGAGGGAA CTTTATCGCA ACCAATGGTA TTCATAATGT GAGAGCTGTT GTCGATGAGG AAAATCTAAT AGATGAAATC GACGAAACCA ATAATTCTTT TGGGAAAAGT CTTTCTATAG GTGTTGTAGA AAACTATGGT GCAACAGTTC CTTATGATAC TTATGAAGCA GAGGCACAGA ACTTAATGGG AGCTTCTCTT ATAAGTTACG ATACGACTTG GTGTAGCGTA GCTTCCGAAG CCTCTGGAAG AAGAGCAGTT AGATTAGAAA AGGGACAGTC TATTGAATTT ACAATAAAAA ATAAAGCACA AGGTATTGTT GTTAGATATT CTATGCCAGA CAGTGCTACT GGGGCTGATA TATATCAAAA TCTTAGTGTT TATGTTAATG GTGTTCACAC AACTGATATG ACGTTTACTA ATCGATACGC ACACCAATAT GGGCAATATG GCTCAGATGG GGGTGAGATG CAATGGTCTA ATACCCCTGG AATAAACCAT CATCGTTATT TTGACGAAAG CAGAATACTA CTTGAAAAAG AGTATGCTTC TAGGAGTTCA TTTAAACTTC AATGGGATGC AAATGATTAT AATCCGAATG GTTCTTCCTA TATTATTATT GATTTCATTG AGACGGAAGC AGTGCCGCAA GCATTAACAA AACCATCGAA TTATGTCTCC ATAACAGATT TTGGAGCTAT TCCTAATGAT GGGTTAGATG ATACCGCTGC TATGAATGCT GCAATCAAAG CAGTGAATGG AACAACCATA AAAGGTATTT GGATTCCAGA AGGTACCTTT CATTTTAATA CAGGAACGAG GGGACAAACG AAGATAAAAT TACCGAAAGG TGTTTCAGTA CAAGGAGCAG GCATGTGGCG TTCGACTCTC ACTGGAGCTT TTGCAGGATT CTATCTTCAA GAAGGCAATG TAACATTAGC AGACTTCACG ATAAAAGCAG AAGAAACTTA TCGTTCTAAT GCTTCTGGAA TTGCAGGTTT AGAGGGAAAT GCACAAAACT CTACCATAAT AAATCTATGG ATTCAGCATA CGAAAGTCGG ATTATGGTTA AATGAAGGAA CTGTAAATGC TCTGGTTTCA AAATGCCGAG TTAGAGATAC TTGGGCGGAT GGTATAAATC TAAATGGTGG AACAAAGGAT ACGATAGTGG AGCATTGTAA TTTTCGAAAT ACAGGTGATG ACGGAATGGC TATGTGGTCA AAATCCTTAA ATGGGGGGAG TGGTACTGTA GAAAACTGTA CCTTCCGAAA TAATACAGTC CAAATACCTA ATCTAGCCAA TGGAATCGGT ATTTATGGTG GTAAAAACAA TACGGTTAGT AATAATCTTA TCCTTGATGT GGTAGATAAT GGTTCGGGTA TCCAATTTGG AACCAATCAT GGTCCATCTG CATTTACAGG AACTTTAACG ATAAGTAATA ATAAACTTGT ACGTGCTGGG TCTTGGCATC ATGATTATGG ATATCAAATA GGAGCAATCT GGGGTTATTG GATTAATAAC AATGGTTTAG CACAAAATTT AACCGTTTCA GTAAGTAATA ATCTCATCGA CAGTAGTATC TATTCTGGAA TCTTCACAGA GGAATCGAAT GTAGGTACAA CTGTGAAATT TACAGATAAC AGGATAAATA ATTCTGGGAC TTATGGTGTT CATATCAGAG ATTCAGCTAG AGGCTCAGCG GTATTTCAAA ATAATACGGC AAATGGTAGT GGATTAGTAA ACTTTAAGAA TGATTCACCT AACTTTACAG TAAGTGGAAC CGGAAACAGC TGGTAG
|
Protein sequence | MNRRIKKVIS YVLFFALLIQ LSSFKVVYAS TNQYEAENAL LSGGAVYAND HSGYTGTGFV GGFTDSNKGN AKVEFQIPAS KSSQFKLSLR YANGTGSEKT LSLWVDGNFY KQLYFPATAN WDSWNSIIET IYLSQGMHSI TYFFGNSDSG NVNIDNLHVE SLEPTVIPGK VEAENYIRMN GIATEDCAEG TQNVGWIHNG DWLEYSVNVS ESAKYRVDYR IAGVNNSTQI LVQVGNNTLA TTNIVNTGGW QVYQTVSSKT FQLNAGIQTI RIYFTGEGVN FNYFNIVKDN QTPPVEGGKP DLIVTDISWI PNNPVNGEEV TFKAVIKNQG DGASPQGVIH GVAFLVNGTT VSWSDNVTTS IPAGSSITVT ANGGPYNKGS WTAATGNYTI RAIVDDINRI DESNENNNTY DKTMVVDVPK KPDLTVTDIT WQPSAPVAGN LVTFSAVVEN KGTVATLSGI PCNLSFYVNG IKVSWVSNYT SSIPAGGKVT LTQTGGNFIA TNGIHNVRAV VDEENLIDEI DETNNSFGKS LSIGVVENYG ATVPYDTYEA EAQNLMGASL ISYDTTWCSV ASEASGRRAV RLEKGQSIEF TIKNKAQGIV VRYSMPDSAT GADIYQNLSV YVNGVHTTDM TFTNRYAHQY GQYGSDGGEM QWSNTPGINH HRYFDESRIL LEKEYASRSS FKLQWDANDY NPNGSSYIII DFIETEAVPQ ALTKPSNYVS ITDFGAIPND GLDDTAAMNA AIKAVNGTTI KGIWIPEGTF HFNTGTRGQT KIKLPKGVSV QGAGMWRSTL TGAFAGFYLQ EGNVTLADFT IKAEETYRSN ASGIAGLEGN AQNSTIINLW IQHTKVGLWL NEGTVNALVS KCRVRDTWAD GINLNGGTKD TIVEHCNFRN TGDDGMAMWS KSLNGGSGTV ENCTFRNNTV QIPNLANGIG IYGGKNNTVS NNLILDVVDN GSGIQFGTNH GPSAFTGTLT ISNNKLVRAG SWHHDYGYQI GAIWGYWINN NGLAQNLTVS VSNNLIDSSI YSGIFTEESN VGTTVKFTDN RINNSGTYGV HIRDSARGSA VFQNNTANGS GLVNFKNDSP NFTVSGTGNS W
|
| |