Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2347 |
Symbol | |
ID | 5745406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 2896060 |
End bp | 2897355 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641293437 |
Product | extracellular solute-binding protein |
Protein accession | YP_001559447 |
Protein GI | 160880479 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAGA AACTTTTTGG ATTATTTATG GCAACCACAT TAGTAGTTTC CTTAGTTGGT TGTGGTAAGA AAGCAGAAAA TCCATCAACT GATAACGGAA AAACAGAAGC AACACAGACT CCTGGTGCAA CGGAAGCACC TGCAAAAGCA GAGGATGTTA CTTTAAAAGT TTGGGCACCT GAAAATCAGA TTAAAGATGG AACAATGGAT TCTATGACAA AATCCTTCCA GGAATTACAC CCAGAATGGA ACATTAAATT TACTATTGAA ACACAGGGTG AAGATACAGC AAAAGATGAA ATCTTAAAAG ATGTTGGAGC TGCAGGTGAC GTATTCTTCT TTGCTAACGA TCAATTAAAT GAGCTTGTAA ATGCAGGTGC AATTGCAAAG CTTGGCGGAT CTACAGAAGA AATGGTTAAG ACAACTATGG CAGAATCAGT TGTTAATACA GTAAAAGTAA ATGATGCTAT TTATGCAATT CCTTTTACAC ATAATACATT CTTTATGTAC TATGATAAGT CACTTTTAAA CGAAAATGAT ATTAAATCCA TTGAAGGCAT TATGGCAAAA GAAACTCCTT CTAATGTATA CAATTTCTAT TTTGAATCAG CAGGTGGCTG GAAATTAGGT GCTTGGTACT ATGGTGCAGG TTTAACAATC TATGGAGAAA ACCAGACTGA TTTTGCTGCA GGAGCAAATT GGAACAATGA AACAGGCGTT GCTGTAACAA ATTACTTAAT TGACTTAATT AAGAATCCTA AAGCAGCTTT TGATGGTGAA ATTTCCTTAT CCGAATTAGC AGGAGATCAT AGAATCGGTG CTTGGTTTGA CGGTTCTTGG AACTATAAAT TATATAAAGA TGCTTTAGGC GATGACTTAG GTTTAGCAGT AATTCCTACA TTTAATCCAG ATGGCAATGA TTATCAGTTA AAAGGCTTCT ACGGTTCAAA AGCAATCGGT GTTAACTCTC ATGCAGCTAA TCCTGCTGTA GCAGTAGCGT TTGCTGCATA CCTTGGAAGT GAAGAAATGC AAGTACAACG TTTTGAAGAA ACTGGTCAAG TTCCTACAAA CCTTAAAGCT GGTGAATCAG CAGCTGTTCA GGCAGACGAA GTAGCTAAAG TTATCGTTGA AGAAGCTAAT GTTGCATCTA TAATGCAGCC TACATCCTCA GAATTCAGTT CAAGATACTG GGCAAATGCA GGTGGTATTG CTACTGAAAT CAGAAGCGGT GCGTTAAATA AAGATAATGT ACAACAAAAA TTAGATACTT TTGTTTCCTC ATTAAAAGTA GAATAA
|
Protein sequence | MRKKLFGLFM ATTLVVSLVG CGKKAENPST DNGKTEATQT PGATEAPAKA EDVTLKVWAP ENQIKDGTMD SMTKSFQELH PEWNIKFTIE TQGEDTAKDE ILKDVGAAGD VFFFANDQLN ELVNAGAIAK LGGSTEEMVK TTMAESVVNT VKVNDAIYAI PFTHNTFFMY YDKSLLNEND IKSIEGIMAK ETPSNVYNFY FESAGGWKLG AWYYGAGLTI YGENQTDFAA GANWNNETGV AVTNYLIDLI KNPKAAFDGE ISLSELAGDH RIGAWFDGSW NYKLYKDALG DDLGLAVIPT FNPDGNDYQL KGFYGSKAIG VNSHAANPAV AVAFAAYLGS EEMQVQRFEE TGQVPTNLKA GESAAVQADE VAKVIVEEAN VASIMQPTSS EFSSRYWANA GGIATEIRSG ALNKDNVQQK LDTFVSSLKV E
|
| |