Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3858 |
Symbol | |
ID | 5744810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 4731867 |
End bp | 4733216 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641294970 |
Product | extracellular solute-binding protein |
Protein accession | YP_001560944 |
Protein GI | 160881976 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000022263 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TACTTGCGTT ACTGTTAACT CTTACAATGG TAGTATCTAT GGCTGCCTGC ACTAAGAAAG AGGATCCAGC TACTAATACC CCTGGTAGTA GTACAGATAG CCCTAAAGCT ACTGCTACAA CGGCTCCTAC GGCTGCACCT AAGAAAGTTA CATTAAATGT TACTACTACA TATGCCGGAA ACGATGGCAA TGCTCAGAAT TACCAAGACG CTGTCGCTAA TTGGGAAAAA TCAACTGGAA ACAAAGTGAA TGATTCCTCA TCAACTTCTG ATGAAACTTT TAAAGCTAGA GTTTTAGCAG ACTTCGAAAC AGGATCTGAA CCAGACGTAT TATTCTTCTT TAACGGTGTG GATTCCAATG CTTTCGTTCA AGCAGGCAAA GTAGTATCTA TCGATGAAAT TCGTCAATCT TACCCTGACT ATGCATCTAA TATGAAAGAT GGCATGTTAG GCGCTTCTCC TGTTGATGGA AAGAACTATT CCGTTCCTGT AAATGGTTAC TGGGAAGGTA TGTTTGTAAA CAAAGAAGTA TGTAAGGCTG CTGGTGTTCA GATTCCAGAT GCTAACACTA CATGGGATCA GTTCCTTGAG ACTTGTCAGA AGATTAAAGA TGCAGGCTTT GCTCCTATCG CTGTTTCTTT AGCAACTGTA CCTCACTACT GGTTTGAGTT CTCTATTTAC AATTTCTTAT CACCATCAAC ACATAATGTT CTTCCAAAGA ACACTACTGA TACTCAGGGA CAGGCATGGG TGAACGGTAT TAACGATATT AAGATGCTTT ATGAAAAAGG TTATTTCCCA GAAAATACTT TAACAGGTAC TGACGATGAA ACTGTTCAAT TATTTATTGA TAACAAAGCA GCATTCTTAA TCGATGGTTC TTGGAAAGTT GGTGGAATCG AAGGTTTAAC AACCGATATT GATAATTTTA CTGTTACTTA TGTTCCAGGA AAGGGCGACA GAAAGACTAC AGATATCATC GGTGGATTAT CTAGCGGATA TTTTATCTCA AAGAAAGCTT GGGAAGATCC AGATAAGCGT GCTGCTGCTG TTGATTTTAT TACTTCCATG ACAAGTGATG AGTTAGTTTC TAAATTCGCA CAAGTATCGG CTACAGCATT AAAGAATGGT CCTACAGTAG ATGAATCTAA ATTATCATCT CTAGCTAAAG ATGGCCTTAA GTATGTAAAA GGCGCTACTG GAATGGCTAG CGCTGTAGAA GATCAGGTTC CAAGAGAGTG TAGAGTTCCT GTTTTTGATG GAATGCCTAA TTTAGTAACT GGTAAAAACG ATATCGCAGA AGCAATACAA AGCGTTTTAG ATTTAACAGC TGCTCAATAA
|
Protein sequence | MKKLLALLLT LTMVVSMAAC TKKEDPATNT PGSSTDSPKA TATTAPTAAP KKVTLNVTTT YAGNDGNAQN YQDAVANWEK STGNKVNDSS STSDETFKAR VLADFETGSE PDVLFFFNGV DSNAFVQAGK VVSIDEIRQS YPDYASNMKD GMLGASPVDG KNYSVPVNGY WEGMFVNKEV CKAAGVQIPD ANTTWDQFLE TCQKIKDAGF APIAVSLATV PHYWFEFSIY NFLSPSTHNV LPKNTTDTQG QAWVNGINDI KMLYEKGYFP ENTLTGTDDE TVQLFIDNKA AFLIDGSWKV GGIEGLTTDI DNFTVTYVPG KGDRKTTDII GGLSSGYFIS KKAWEDPDKR AAAVDFITSM TSDELVSKFA QVSATALKNG PTVDESKLSS LAKDGLKYVK GATGMASAVE DQVPRECRVP VFDGMPNLVT GKNDIAEAIQ SVLDLTAAQ
|
| |