Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0812 |
Symbol | |
ID | 5745292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 1038064 |
End bp | 1040445 |
Gene Length | 2382 bp |
Protein Length | 793 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641291926 |
Product | extracellular solute-binding protein |
Protein accession | YP_001557938 |
Protein GI | 160878970 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000378939 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGAA AAAAGGAGAG AGAGACTATG AGAAAAAACA AGTTTAAAAA AGCATTGTCT GTTTTTCTTG TTTTATCCAT GTGTGTTGCT TTAGCAGCTT GTGGAAAGAA AAATAGTGGC AATCAGTCAG GGGAGAACCC GAGTGAAGGA AATAATTCAG GTAATTCTAG TGATAAACCT TTAGTAGTTG GTTCCTTACA ATTTAGTGAA AAATTTAGTC CATTTTTTGC TACTACTGGT TACGATAGGG AAATCGCTGA CTTAACACAA GTTCAGATGT TAACTACAGA TAGAACTGGT GGTTTAATTT TAAATTCAAT TGAAGGTGAG ACTGTTCCTT ACAATGGTAC AGATTATCTT TATACAGGTA TAAGTGATAT TGCAGTAAGC CGTGATGAGG CTGCTGATAC CACTACTTAT AATATTAAGA TTCGTGATGA CATTAAGTTC AGCGATGGCA AGGTAATGGA CGCTGATGAT ATTATCTTCT CTTATTATGT ATTATCTGAT ACTAGTTATG ATGGAAGTTC TACATTATAC TCTACACCAA TTGTTGGTAT GTTAAATTAC CGTAAGAACA ATTCTAATGC AGAGAGCACT GTAGTAACTG AGGATGAAAT TGCTAAAGAG CTTGCTTCTT TAAGTGATGC ATCAAAAGAA GCAATCAACA AAGAAATAAT TGCTCCAACA TTAACTTCAG AACTTGACTG GGTTAAAGGT CTCTATGGCA AAGATTCATA CAAAGAGTAC ACAGAAAAAT ACCCAGTAAC AAAAGATTTA TTTGCTCACT TCTATAGTGT AGATGAGAAT TATGATGCTT CTAAGGTAGA AGACGAAGCT AAAGTTTTAG AAGAAATTAC AGCAGCTTAC GGTGCTAATT ATCAAGCACT TGGCGAAGCA TATGCAGGAG ATGAAACTTA CTTTGCTGAC CAGGTTAAGG AAGTTGTATC TAATGTATTA TTAGAAGAGA AACTTTCAAA AGGCGGAGAT GAGGTTCCTA ATATTGCTGG TATTAAGAAA ATCAGCCAGA CTGAGGTAGA AGTTACAACC AATGGTTTTG ACGCTTCTGC TATCTACAAC ATTTGTGGTA TTACTATTTC TCCAATGCAC TACTATGGTG ATGAGGCTCA GTATGATTAT GATAATAATA ACTTTGGTTT TACTAGAGGT GATTTATCTA TTGTAAAAGC AAAGACTACA AAGCCTTTGG GAGCAGGTCC TTATAAATTC GTAAAGTATG AAAACAAAGT TGTTTACCTT GAAGCAAATG AATTTTACTA CAAGGGTGAA CCTAAGACAA AGTACCTTCA GTATAAAGAA ACAACAGAAG CTGATAAGAT TTCTGGTGTT GGTACAGGTA CAATTGATAT TGCAGATCCA TCCGGTAGCG TATCTGCTTT CGATGAAATT GCAAACTACA ATTCAAACAA AGAATTATCC GGTGATAAGA TTATTACAAG TACTGTAGAT AATCTTGGTT ATGGATACAT CGGTTTTAAC GCCGATGCTA TTAAAGTTGG TAATGATCCA GCTTCTGAAG AATCTAAGAA TTTAAGAAAA GCGATTGCTA CTATATTAGC AGTATACCGT GATGTAGCTA TTGATAGTTA CTATGGTGAA GTTGCTAGCG TTATTAACTA CCCTATCTCT AATACATCTT GGGCAGCACC ACAGAAATCT GATGAGGATT ACAAAGTTGC TTACTCACAG GGAGTAGATG GTAACGATAT CTATACTGCA GATATGACAG CTGACCAGAA ATACGAAGCA GCAACAAAGG CTGCAATAGA ATTTTTCAAG GCAGCAGGAT ATACTTTTGA TGATGCAACA GGTAAGTTTA CAAAGGCTCC AGAGGGTGCT AAATTAGAGT ATGAAATTTT AATTGGTGGA GATGGTAAAG GTGATCACCC ATCCTTCGCG ATTCTTGCTA AGGCAAAGGA AACTCTTGCT ACAATCGGTA TCACATTAAC AATTAATGAT CCATCTGATT CTAATGTAAT GTGGGATAAA ATAGAGGCTG GAACTCAAGA AATGTTTGTA GCAGCTTGGA GTGCAACAAT AGATCCAGAT ATGTACCAGA CTAAGTACAG TACAAACATT GTCGGTAAAG GTGGTTCTGA TTCTAATCAC TATCATATTG CTGATCCTCA GTTAGACCAG TTAATTATGG ATGCTAGAAA GAGTGCTGAC CAAGCTTACA GAAAAGCGAC ATATAAGACT TGTCTTGACA TTATGCTTGA TTGGGGTGTC GAAGTTCCTG TATACCAGAG ACAGAACTGT ATTATATTTG CTAAAGATCG TGTAAACACT GATACAGTTA CTCCTGATAT CACAACCTTC TGGAAGTGGA CTAACGATAT TGAAAAGATT GAAATGAAAT AA
|
Protein sequence | MTRKKERETM RKNKFKKALS VFLVLSMCVA LAACGKKNSG NQSGENPSEG NNSGNSSDKP LVVGSLQFSE KFSPFFATTG YDREIADLTQ VQMLTTDRTG GLILNSIEGE TVPYNGTDYL YTGISDIAVS RDEAADTTTY NIKIRDDIKF SDGKVMDADD IIFSYYVLSD TSYDGSSTLY STPIVGMLNY RKNNSNAEST VVTEDEIAKE LASLSDASKE AINKEIIAPT LTSELDWVKG LYGKDSYKEY TEKYPVTKDL FAHFYSVDEN YDASKVEDEA KVLEEITAAY GANYQALGEA YAGDETYFAD QVKEVVSNVL LEEKLSKGGD EVPNIAGIKK ISQTEVEVTT NGFDASAIYN ICGITISPMH YYGDEAQYDY DNNNFGFTRG DLSIVKAKTT KPLGAGPYKF VKYENKVVYL EANEFYYKGE PKTKYLQYKE TTEADKISGV GTGTIDIADP SGSVSAFDEI ANYNSNKELS GDKIITSTVD NLGYGYIGFN ADAIKVGNDP ASEESKNLRK AIATILAVYR DVAIDSYYGE VASVINYPIS NTSWAAPQKS DEDYKVAYSQ GVDGNDIYTA DMTADQKYEA ATKAAIEFFK AAGYTFDDAT GKFTKAPEGA KLEYEILIGG DGKGDHPSFA ILAKAKETLA TIGITLTIND PSDSNVMWDK IEAGTQEMFV AAWSATIDPD MYQTKYSTNI VGKGGSDSNH YHIADPQLDQ LIMDARKSAD QAYRKATYKT CLDIMLDWGV EVPVYQRQNC IIFAKDRVNT DTVTPDITTF WKWTNDIEKI EMK
|
| |