Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3409 |
Symbol | |
ID | 5743686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 4172661 |
End bp | 4174016 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641294515 |
Product | extracellular solute-binding protein |
Protein accession | YP_001560501 |
Protein GI | 160881533 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000553575 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAC GTGTTCTTAC GATACTCATG GCGATTACTT TAGTAATCTC ATTACTTGCA GGATGCAGTG ACAAGAGTGC GAAAACAGAT AATGCTACGA AAGAGAATAA CAAAGCGGAA GCTACGAAAG GGCCTACACA ATCACCGAAC TCAAGTTCTG ATAAAATAAC GTTAAAACTA TTCTCTAATC TTCCGGATAG AAAGAATGGT CAAGGCTTAA TAGAACAGAT GATAATTGAC GAATACACAA ATAAGAATCC TAATGTAACA ATTGAAGTTG AAGCATTGGA TGAAGAAGCG TATAAAACGA AGTTCAGAGC GTACGCAATG AATGGTATGC CAGATGTTGT GAGTATCTGG GGACAGCCAT CTTTCTTAGA TGAAGTTTTG GAAGCTGGAG TTTTAGCAGA ACTTAAAGAG AGTGATTATG CAAACTATGG TTTTGTATCA GGTTCTCTAG AAGGATTTAA GAAGGATGGA AAGTTATACG GTCTACCAAG AAATACAGAC GTTATGGCAT TTTATTATAA CGAGAAAATA TTTAATGATA ACGGTTGGAA GGTTCCAAAT ACATTTGAAG AATTCTTAGA TCTAGCAAAA CAGATAAAAG ATGCCGGTAT GATTCCGGTT GCAATGGACG GCGGCGATGG ATGGCCAATG GCGATTTATT TAACAGATCT TTTAGTTAGA ATCAATGGGA ACTGTAGCGA ACTGATATCG AAAGGGATTC GCAGCGGAGA TTTTTCAGAT CCTGTATTTA AAGAAGCAGC TGAGTTATTA AGAAAATCTG CAGAAGTAGG AATGTTCCAG ACAGGTTACG ATTCTCAAGA TTATGGTACT GCGATGAATT TATTTACCAA TGGACAATCC GCAATGTTTT ATATGGGAAG TTGGGAAGCT TCCATGGCTT TAAATAAGGA TATTAGTGAA GATGTACGGT CAAATGTCAG AGTATTTACG ATGCCAGCTT TAGCAAACGG AAAAGGAAAG CAAACAGATA TTGCTGCATG GAATGGTGGC GGATATGCAG TTTCTGCAAA TTCAGAAGTG AAGGATGAGG CAATTAAATT CTTAAACTTC ATGTATCAGC AGGATAAATT ATCTAAGTAT GGATGGGAAA ATGGGGTTGG TATGTCAGCA CAGGATCAGT CGGCTTATAT GACTGGCAAT GAAACCATTC TTCAAAAACA GTTTACAGAT ATCGTTAAGA ATGCAACAAG TGTATCCGGA ACACCATTTA ATGATTGCGG TACCTCTGCT TTTAAAACAG CAATAGAGAG TGAAATTCAG AGTTTATCCA ATGGAACAAA ATCAGTAGAT GAGTTCTACA AAGCGCTAGG AGAAGCATGT AAGTAA
|
Protein sequence | MKKRVLTILM AITLVISLLA GCSDKSAKTD NATKENNKAE ATKGPTQSPN SSSDKITLKL FSNLPDRKNG QGLIEQMIID EYTNKNPNVT IEVEALDEEA YKTKFRAYAM NGMPDVVSIW GQPSFLDEVL EAGVLAELKE SDYANYGFVS GSLEGFKKDG KLYGLPRNTD VMAFYYNEKI FNDNGWKVPN TFEEFLDLAK QIKDAGMIPV AMDGGDGWPM AIYLTDLLVR INGNCSELIS KGIRSGDFSD PVFKEAAELL RKSAEVGMFQ TGYDSQDYGT AMNLFTNGQS AMFYMGSWEA SMALNKDISE DVRSNVRVFT MPALANGKGK QTDIAAWNGG GYAVSANSEV KDEAIKFLNF MYQQDKLSKY GWENGVGMSA QDQSAYMTGN ETILQKQFTD IVKNATSVSG TPFNDCGTSA FKTAIESEIQ SLSNGTKSVD EFYKALGEAC K
|
| |