Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0619 |
Symbol | |
ID | 5741668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 805147 |
End bp | 806538 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641291731 |
Product | extracellular solute-binding protein |
Protein accession | YP_001557745 |
Protein GI | 160878777 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000037115 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGA AAAAATTATT ATCATTGGCT CTATCCGTAG TCTTAGTAAC TGGTTTAGCC ACAGGATGTG GGAAAAAATC AGACGACATC ACTGATAAGA ACACGCAAGC TCCATCTCCA ACAACTACTG AAGAAGTAAA AGTTACTGAA TCAGGATATG AAAAGAAAAA CATTACTGAT GATAAAGTAA CATTACGTTT TATGTGGTGG GGTGGCGATG AACGAAACAA TGCAACGTTA GAAGTTATTG ATAAATTTAT GGAATTGTAT CCAAACGTAA CCATCGAACC TGAATATGGT GGAAATGAAG GATATAAAGA GAAATTAGTT ACTCAGCTTT ATAGTAATAC TGCTGCTGAT TTAATTCAAT GTGATCCAGC TTGGTTCTCT GAATTAGTTC AAAATGGTGA CTACTTCATC GACTATAATG ATTATTCTGA TTATTTTGAT ATTTCTGGTT TCGACTATAA TTTATTAAAT AACTATGGCG TTGTTGATGG AAAAGTGTTT GCTGTACCAA CTGGAACTGC TGGAGCAGCA CTTGTTTTAA ATACAACATT AGCTGACAAA ATTGGTATTG ATTATAGTAG CCAATTAACA TGGGATTCTT TAATTGAAAT TGGAAAGCAA GTAAATGCTT ATGATCCAAA CATGTATTTT TTAAATCTTG ATACATTACG TATTGCAGAG CAAGTAATTC GTCCTATGAT CATGCAAAAA ACTGGTCATC CATTTATTGT TGATTCTGAA AAGAAGATGA GTTTTACAAA AGAACAGTTA GTAGAAGTGT TAGATTACGT TAAGTTATTA TATGATTCAA AGACAGTTCA ACCAGCACAA GAAAGTGCAC CTTTTTATAA TGCAACTGAG ACTAATACAA AATGGGTATC TGGTGATTTT GTAGCAGCAC TTGGATTTGC ATCTACTGCA AATAACTTAT CGAATTCCTA TGGCGGACAA GATAAAAACT TTATCTCTGT TCAAGTTCCA CTTCCTGAAG GAAGATTAAA CGATGGATAT TTAACTGCAC CACCACAGGT AATGGCAACA GCAAAAACAT GTAAATATCC AGAAGTTGCA GCAGCTTTTT GGGATTTCTT CTTTAATTGT GACGAATCCA TCTTAACATT GAAAGATTTA AGATCAGTAC CAGCAAAAGA ATACAACCGT GTACTTCTTG ATAAAAATCA ATTAATCACA ACTCTTGTTA CAGACGCAGT GAATTATGCA TCTGCTTGCA ATGGTATTTC TGAGCATGGC TATACGACTG GCTCTGAAAT TGCTCAAATT TTAGTTGATA TGGTAGAAAG TATTGCGTAT GGTAACAGCA CTCCTGAAAA AGTAGCTGAT GAGGCAATTG TATTAATAGA GGATTTCTTA TCTCAACAAT AA
|
Protein sequence | MRMKKLLSLA LSVVLVTGLA TGCGKKSDDI TDKNTQAPSP TTTEEVKVTE SGYEKKNITD DKVTLRFMWW GGDERNNATL EVIDKFMELY PNVTIEPEYG GNEGYKEKLV TQLYSNTAAD LIQCDPAWFS ELVQNGDYFI DYNDYSDYFD ISGFDYNLLN NYGVVDGKVF AVPTGTAGAA LVLNTTLADK IGIDYSSQLT WDSLIEIGKQ VNAYDPNMYF LNLDTLRIAE QVIRPMIMQK TGHPFIVDSE KKMSFTKEQL VEVLDYVKLL YDSKTVQPAQ ESAPFYNATE TNTKWVSGDF VAALGFASTA NNLSNSYGGQ DKNFISVQVP LPEGRLNDGY LTAPPQVMAT AKTCKYPEVA AAFWDFFFNC DESILTLKDL RSVPAKEYNR VLLDKNQLIT TLVTDAVNYA SACNGISEHG YTTGSEIAQI LVDMVESIAY GNSTPEKVAD EAIVLIEDFL SQQ
|
| |