Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1118 |
Symbol | |
ID | 5741953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 1410194 |
End bp | 1411819 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641292223 |
Product | extracellular solute-binding protein |
Protein accession | YP_001558235 |
Protein GI | 160879267 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000575789 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CGGTAACATT ACTGTTGGTT CTGACCATGG TGGTAAGCTT ATTTGCAGCA TGTGGTAAGA AAAATGGATC AAGCGAAACC GGCACAAAAG ATCCTGTGGC AACAAGCGGT GCAAAAGAAC CTGACAAACA AGATCCAGGC AATAAAGAGC CTGAAAAACA AGACCCTGTT AAAATCAAGA TTTATTACTC TGATAATGCA ACCTTACCAT TTAAAGAAGA TTGGTTAGTT ATAAAGGAAG CTGAGAAGAG ATTTAATGTT GATTTCGATT TCGAAGTAAT TCCAATTGCA GATTATCAAA CAAAAGTTTC TTTAACATTA AATACAGGAA ATAACGCTCC AGATGTCATC CTTTATCAGT CAACGCAGGG AGAGAATGCA TCTCTTGCTC TAAATGGTGC TCTAGTACCA ATCAGTGACT ATGCTGAATA TACACCTAAC TTTAATGCAC GTGTAGAAGA GTTTGGTCTA ACTGGTGCTA TAAACAGATT AAACCTCGCA GACGGAAAAC GTTATTATAT GCCTGGCTTA TTTGACGTTC CTTTCTATGA TGGTGGACTT ATTTTAAGAG AGGATTTCTT AGAGGCAGAA GGATTAGCTG TACCTAAGAC ATTTGATGAT TTATATAATA TCTTAAAGGC ATACAAAGCA AAGAATCCTG ATTCTTATCC TTTAACTATC TTAGCTGGTC CTCGTGTATT ATACCGTATG ACAATGCCAT CCTTTGGTGT TAGTTTAGGT AAGAACGGAG CTGGCGGAAC GAATACCTTA AGTTGGGATT ATGAAAAGGG CGAATATTTT GAAGGTGCTA TCAGTGATGG TTATAAACAA TATATTAGCT ACCTTGCAAA ACTTTACAAT GAAGGATTAC TTGATCCTGA AATGGCAGAC CCAATCGATG GCGATAAATG GTCTCAAAAG ATGGCAAGCG GAAAATCTAT GGCTACCTAT GCATACTATG ACCAGATTGG TGGTGTAAGT GCTTCTACTG AAATCGAAGG CTTTAAATTA CAGATGTACC CATCATTAGA GGGACCTGCT GGTGCTCATC ATCAGCAAAA GAACCGTACT GGTTCCGGTA TTATGTTCCC AGCAGCTACT GCACAAAGAA AAGACTTTGA AAGAGTTGTG AGAACAATTG ATGAAGTATT CTTCTCCGAA GAAGGTGCTA AATTATGGTG CTTAGGAGTA GAAGGCGTAA CATATACAGA AGAAAACGGA GTAATCAAAT ATTCTGATGA GTTAGTAAAT TCAGCAGAAG GTGTTTATAA AACACTTCAA GTAAAATACG GCTGTGGTTC TGACGTTACC CAATTAGTAT GGGTTAACGA ACGTGAAATG ACAAAATATG ATGAGAATTA TGCACGTATC AATAAAGAAG TTGCTGCTAT GGGAGATGTT ATTCAACAGA TACCTCCAAC ACCATTATTT GATGATATGA AAGCAGAAGA TGCGGGCGTT TTGCAAACTC CATTATTTGA TACCTTCAGT GTATGGGCAG ACGCATTTAT AACTGGTAAG AAGAGTGTAG ATAATGATTG GGATGCTTAT GTAAATGAGA TGAAAACATT AAAAATTGAC GAATTCTGTA AGATTTATAA TGATAATCTT AACTAA
|
Protein sequence | MKKTVTLLLV LTMVVSLFAA CGKKNGSSET GTKDPVATSG AKEPDKQDPG NKEPEKQDPV KIKIYYSDNA TLPFKEDWLV IKEAEKRFNV DFDFEVIPIA DYQTKVSLTL NTGNNAPDVI LYQSTQGENA SLALNGALVP ISDYAEYTPN FNARVEEFGL TGAINRLNLA DGKRYYMPGL FDVPFYDGGL ILREDFLEAE GLAVPKTFDD LYNILKAYKA KNPDSYPLTI LAGPRVLYRM TMPSFGVSLG KNGAGGTNTL SWDYEKGEYF EGAISDGYKQ YISYLAKLYN EGLLDPEMAD PIDGDKWSQK MASGKSMATY AYYDQIGGVS ASTEIEGFKL QMYPSLEGPA GAHHQQKNRT GSGIMFPAAT AQRKDFERVV RTIDEVFFSE EGAKLWCLGV EGVTYTEENG VIKYSDELVN SAEGVYKTLQ VKYGCGSDVT QLVWVNEREM TKYDENYARI NKEVAAMGDV IQQIPPTPLF DDMKAEDAGV LQTPLFDTFS VWADAFITGK KSVDNDWDAY VNEMKTLKID EFCKIYNDNL N
|
| |