Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0592 |
Symbol | |
ID | 5743506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 769518 |
End bp | 770888 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641291704 |
Product | extracellular solute-binding protein |
Protein accession | YP_001557718 |
Protein GI | 160878750 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000165278 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTAA CTAAGAAGAT TCTTGCCGTA ATGCTCACAA CTATTTTGGC AGTTAGCTTA GTAGCTTGTG GAAAGGATGC AAAAGACACG GGTAATGGCA AATCCTCAAA GGACGGAAAA GTTAAGATTA CATATTCTAT GTGGGGGAGT GCAGATGAGG GAGCTGTAAC GCAGAAACTA GCAGATAAGT TTAATGCCTC ACAGGATCGA ATCGAGGTCG ATGTTCTCGC AATCCCATGG GAAAACTACA TGACAAAGTT AAATACTATG GCAACTGCAA AAGAATTACC AGATACCGGT ATTATGTCGG AAGCAGGGGT TCTTCAATGG GCAGAACAGG GAATGCTTGC TGATATTAGT ACGATGTATG GAAAAAACGA TAGCAAGCCG CTAGATAGCC TTGCTTTCCG CTATCAAGGA AAAACCGTTG CTTACTCTGC AGCAAACGAA GTTCTTTTGC TTTATTATAA CAAAGACATG TTTGATAAAG CAGGTGCCAC TTATCCTCCA GCGTCTGTTG ATAATGCTTG GACCTGGGAA GAATTTGTTA GCACTGCAAA GAAACTTACC TTAGATAAAA ATGGAAAGCA TCCTGATGAA AAGGGATTTG ATGAGCAAAA TATAGTTCAA TACGGCTGTC TAGTAGAGAA CTTAGCATGG CAACTTGAGC TATGGCCAAT GAGCAATGGA GCTGGATACT ATTCAAGCGA TGGATCAGAA GTAACAATTG ATAATCAAGC TAGTATTGAA GCTATCCAAA AAGTTGCGGA TTTGTACTTA GTAGACCATG TTGCACCTCT CTCGACAGGA GCAACAGATG ATAGTATACA GCGTTCCATT ATTTCCGGTA CTTGTGCTAT GGGAACAGGA GGTGCTTGGA ATGTGGGAAC CTGTCTTGCA TCAGCACGTG AAGAGGGATT AAATTATGGT GTTGCAGTAT TACCTTACAT GAAAGAAAAG CTAACCTTAT GTACCGGTGG ACCAAATGTC GTATTTTCGC AGACAAAACA TCCAGAAGAA GCAATGGAGT GGTTAAAGTG GTATTATCAA GAAGAAAATA GCTGGTCCTT GATTGAAACT GGTATCTGGA TGCCAATCCT CCAAAAATGG TATACGGATG AAACGATGAC TCATAAATGG GTAGACAATA AGAACTTCCC TCCATATGAG GAATATAAGA GTGCAGTTGT TGATTATGCT ATGAAGAATT CTAAGTCTGC GTCCTGGTAT TATGTAAATA ATACAGTTGA TTTTAATAAC TTAGTCACCT CAATTTTAGG TGAAGTTTGG ACTGGTAAAG TCACTGCACA GGAAGCAATT ACAAAAAATA TGGAAGCCTT AAAAACAGCT TATAAGGGAA CGGCAAAGTA A
|
Protein sequence | MKVTKKILAV MLTTILAVSL VACGKDAKDT GNGKSSKDGK VKITYSMWGS ADEGAVTQKL ADKFNASQDR IEVDVLAIPW ENYMTKLNTM ATAKELPDTG IMSEAGVLQW AEQGMLADIS TMYGKNDSKP LDSLAFRYQG KTVAYSAANE VLLLYYNKDM FDKAGATYPP ASVDNAWTWE EFVSTAKKLT LDKNGKHPDE KGFDEQNIVQ YGCLVENLAW QLELWPMSNG AGYYSSDGSE VTIDNQASIE AIQKVADLYL VDHVAPLSTG ATDDSIQRSI ISGTCAMGTG GAWNVGTCLA SAREEGLNYG VAVLPYMKEK LTLCTGGPNV VFSQTKHPEE AMEWLKWYYQ EENSWSLIET GIWMPILQKW YTDETMTHKW VDNKNFPPYE EYKSAVVDYA MKNSKSASWY YVNNTVDFNN LVTSILGEVW TGKVTAQEAI TKNMEALKTA YKGTAK
|
| |