Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0291 |
Symbol | |
ID | 5744214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 361234 |
End bp | 362583 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641291381 |
Product | extracellular solute-binding protein |
Protein accession | YP_001557417 |
Protein GI | 160878449 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA AAAAATTAGT AAAAGTACTT AGTTCTCTTC TCGCAGCTTC AGTTATTTTC ACAGGCTGCA GTAGCACAAA AAATAATCAG CCTTCTTCTG CTGATCCAAC CCCAACAAAC AATTCTAGCG AAAGCAAACC TACCACAGAG CCAGAAAAGA ACGAGCTTAG CGGCAGTATT ACATTTTCCG CTTGGGACGT GGATACCCAA ATGCCTTATA TCAAGGATAT GTTAGCTGAA TTTATGAGTC AACATCCTGG AGTTAATGTG GAAATTGTTG ATATTCCTTC CGCTGACTAT GATACAAAAT TAAACATCGA TTTAAACGGT GGAGCTGCTG CTGACGTAAT TTTAGTAAAA AATGCGTCTA CAATGCCATC TATGAATCAA AAGGGTCAAC TTGTCGATTT AAATGAATAC ATAAAACGCG ACAATGTAGA CTTAAGTTCC TATAACGGAT TAGATAAAAG TATTAGCATT AACGGTACTC AGCCAGGTCT TCCTTTCCGT ACAGACTATT ATGTATTGTA CTATAACAAA GACATCTTTG ATACAGCCGG AGTTGCTTAC CCATCAAACG ATATGACTTG GGCAGATTTC GAAGAACTTG CTAAAAAAGT AACTTTCGGT GAAGGTGCAA ACAAGGTTTA TGGTGCACAC CTTCATACAT GGCAAGCATT AGTAGAAAAC TGGGCTATAC AGGATGGAAA GAACACAACT ATGGGTCCTG ATTACGAATT TATGAAACCT TATTATGAGA TGGCTTTAAG AATGCAAAAC GATGACAAGA CCATTATGGA TTATGCAACA CTTAAAACAG CAAACATCCA TTATTCAGGT GTATTCCAAA ATGGTTCTGT TGCTATGCTT CCTATGGGCA CATGGTTCAT GGCTACAATG AGAGATGTTG TAAGCAAAGG TGAATGCAGT GTGAATTGGG GAGTTGCTAC AATACCTCAT CCAGAGGGCT TAGAGGCAGG AAACACTGTA GGTTCCGCAA CTCCAATTTC AATTAACGCA GCTTCTAAAA ATAAAGAATT AGCTTGGGAG CTTATCAAAT TTATGACAGG TGATAGCGGT GCTTCCTATC TAGCTTCCGT TGGTCAATTA CCAGCTCGTA TTAATCCAGA ACTTCTTGAT ACTGTTACAT CTTTAGAAGG TATGCCTGAA GGTGCAAAAG AAGCTTTACA AGTTAAGAAT ATCGTATTAG ATCGTCCAAT CGTAGATAAT GTAAATGAAG TTGATAAGAT GCTTGGTGAG GAACACAGCC TTATCATGCT CGGTGAAGTT ACTATAGACG AAGGAATTAA AGCTTTCACT GAAAACTCTA AGACAATACA GGAACAATAA
|
Protein sequence | MKNKKLVKVL SSLLAASVIF TGCSSTKNNQ PSSADPTPTN NSSESKPTTE PEKNELSGSI TFSAWDVDTQ MPYIKDMLAE FMSQHPGVNV EIVDIPSADY DTKLNIDLNG GAAADVILVK NASTMPSMNQ KGQLVDLNEY IKRDNVDLSS YNGLDKSISI NGTQPGLPFR TDYYVLYYNK DIFDTAGVAY PSNDMTWADF EELAKKVTFG EGANKVYGAH LHTWQALVEN WAIQDGKNTT MGPDYEFMKP YYEMALRMQN DDKTIMDYAT LKTANIHYSG VFQNGSVAML PMGTWFMATM RDVVSKGECS VNWGVATIPH PEGLEAGNTV GSATPISINA ASKNKELAWE LIKFMTGDSG ASYLASVGQL PARINPELLD TVTSLEGMPE GAKEALQVKN IVLDRPIVDN VNEVDKMLGE EHSLIMLGEV TIDEGIKAFT ENSKTIQEQ
|
| |