Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0431 |
Symbol | |
ID | 5745191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 550144 |
End bp | 551712 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641291543 |
Product | extracellular solute-binding protein |
Protein accession | YP_001557557 |
Protein GI | 160878589 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA GAAAGCAGTT ATTCTTTTTA GGTCTTCTTC TGAGTTGTAT GATTGCACTC TGCGCATGCA GTGGCAGTAC AGAAAACAAG GGAAATGATC AAAAAGAAGC AGAAAACGGC GAGGCTTCAA AAAATGGTGG CTCTGTTATT GTAGGTATAA CAAACGATTT GGATAGTCTT GACCCACACA AAGCAGTTGC GGCAGGAACT AAGGAAGTTT TGTTTAATAT TTTCGAAGGC TTAGTAAAGC CTGATAAAGA TGGAAATTTA GTTCCTGCTG TAGCAAGCGA CTATAAGATC TCGGAGGATG GAATGACCTA TACGTTTTAT TTACGTGAAG GGGTAAAATT CCACAATGGT GCGCTTGTAA CGGTAGATGA TGTAATATAT TCATTAAAAC GTGCTACCGG CCTGCTTGAA ACTTCCGATC CAGAAGTGAG AGTTGAGTCG GTATTTTCTT GTGTAGAATC GATAAATGCG ATTACTACTG AGGATGGAAA ACAGGCCGTA GAAGTCAAAT TAAATCAACC TAATATTGAA CTTCTATCTT ATTTTACCAT TAGTATAATA CCTAAAGACT ATACCGAGCA GGCGACTAAG CCGGTTGGTA CAGGTCCATT CCGTTTTGTA TCCTATTCGC CGCTGGTTAG CATTGTGATG GAGAAGAATC CTGACTATTA TGTTAAAGGG GTTCCTTATC TTGATGAAGT AACGTTTAAG ATATCTGCAA ATACAGATGC AGCATTTATG GAATTAAGAG CTGGTACTAT CGATATTTTC CAGCAATTAA CCTATGAACA ATCAAATCAA TTAAAGGATT TGTACAACAT TGAGATTGGG CATATGAACC TGGTACAGGC TTTGTTTTTA AATCACAAAG CTGCTCCATT TGATAACTTA AAAGTTCGCC AAGCGCTCAG CTATGCAATT GACAGACAGA TGATTCTTGA TATTGTTGCA GGTGGAAATG GTACGATTAT TGGAAGTAAT ATGTTTCCTG GATTCGGTAA ATACTACGAT GAATCCTTAG CAAGTTACTA CACTTATGAT GTAGAAAAAG CAAAACAACT TCTTAATGAG GCAGGTTATC CGGAAGGATT TCATTTTACA ATCACTGTCC CATCTAATTA TCAAGCACAT GTTGATACAG CACAGGTGAT TGTGGAACAG TTAAAGAAGG TTGGAATTAC TGCGGAAATC AAGCTAGTAG AATGGGCAAG TTGGATTTCT GATGTTTACC AAGGAAGAAA CTATGAATCA ACGATTATAG GACTTGATTC CAATCTTTCT CCTAGTGATA TTGTAAAGCG ATATGAATCT ACATCAAAAA ACAACTTTTT AAATTATTCA AATTCACAAT TTGACAGCCT TTATCCGAAG GCTTATGCAT CGGTTAAGGA AGAAGAAAAA GTAGACCTTT ATCATCAGTT GCAAAGAATA TTAACAGAGG ATGTAGCTTC TGTTTATCTT CAAGATCCAG CTAATTTAGT AGCAGTAAAC AAAAAATTAG CAGGATATCA ATTCTATCCT GTTTATGTAC AGGATATGTC AACAGTTTAT TATAAGTAA
|
Protein sequence | MKRRKQLFFL GLLLSCMIAL CACSGSTENK GNDQKEAENG EASKNGGSVI VGITNDLDSL DPHKAVAAGT KEVLFNIFEG LVKPDKDGNL VPAVASDYKI SEDGMTYTFY LREGVKFHNG ALVTVDDVIY SLKRATGLLE TSDPEVRVES VFSCVESINA ITTEDGKQAV EVKLNQPNIE LLSYFTISII PKDYTEQATK PVGTGPFRFV SYSPLVSIVM EKNPDYYVKG VPYLDEVTFK ISANTDAAFM ELRAGTIDIF QQLTYEQSNQ LKDLYNIEIG HMNLVQALFL NHKAAPFDNL KVRQALSYAI DRQMILDIVA GGNGTIIGSN MFPGFGKYYD ESLASYYTYD VEKAKQLLNE AGYPEGFHFT ITVPSNYQAH VDTAQVIVEQ LKKVGITAEI KLVEWASWIS DVYQGRNYES TIIGLDSNLS PSDIVKRYES TSKNNFLNYS NSQFDSLYPK AYASVKEEEK VDLYHQLQRI LTEDVASVYL QDPANLVAVN KKLAGYQFYP VYVQDMSTVY YK
|
| |