Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2733 |
Symbol | |
ID | 5743026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 3323144 |
End bp | 3324745 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641293825 |
Product | extracellular solute-binding protein |
Protein accession | YP_001559833 |
Protein GI | 160880865 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000105202 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAGT CTACACAGAA ATTCTTATCA CTTGGTTTAA GTCTTGCGCT TGTATTTAGC TTAACAGCTT GTGGAAGCAA GAACGATGGA CCAAAGACAG AACCAACAAC AGCTCCAACA AAGGGTGCTG AAAACAATGG TGGCGCAACA ACTACTCCAG AAGCTCCAGT TTCTGAAGTT GAGAAGCCAT CTGTAATCAC AGTAATGATG GATGGTACTG TATTTACAGA GCCTAACGGA CGTGATCAGT TCGAAGCAGC TCTTGAGGCT GAACTTGGTC TTGATATTCA GTTTACACAG CCAGATCACT CTGGTTACTA TGATGCTGTA GGTCTTACAT TTGCAGGAAG TAACTGGCCA GATGTTGTTC TTTTAGGTGG CTCTTACTAT ACTAACTATG CAAGTGCAGG TCTTTTAGCA GACATCACAG AGTACTGGGA TAATTCTGAA TTAAAAGCTT CTGGAAGAAT TACTAATATT GATGCAATGG AATCAATTAA GATTGATGGA AAGCTATACG GCTTTACTCC AGCAAGAGGT AACGGATGTG TTACTTACGT TAAAAAAGCT TGGTTAGATA AAGCTGGCTT ATCAGTTCCT ACAAACTATG ATGAGTACCT TAACATGTTA AAGGTATTTA CAGAGTCTGA TATGAACGAT AGTGGTGATC CAACTAATAC TTATGCAGTT TCTGCAGCTG GTCTTATCGG AAATGAAGCT CCATATACAA ACTATTTACC AGAGTTCTAT CAAGATGCTT ATCCAGATTT CATGAAGGAT AATAGCGGTA AGTGGATTGA TGGTTTCTCT ACAGATGCTA TGAAATCAGC TCTTCAGAGA TTAAAGGATG CTTACTCTGC AGGTTATATT GATAAAGAAT CCATTACAAA TGGTACAAAA GACTGTCGTA ATAAGTACTT TGATGATAAG TTTGGTGTAT TCACATACTG GGCTGGTACA TGGGCAGCTA ACTTAAAGAG CAGTCTTGAA AGTAATGGTA AGGATGGCGA ATTAGTAGTT CTTCCTCCAA TTGCTGAGTT AGGTAACTAT CTTGAGAGAC AGGCTCCAGC TTGGTGTATT ACAACTAAAG CTGCAAATCC TGCTGGTATC TTCAAGTATT TCCTTGAGCC AATGTTAGAT GGCGGTGCTG TTCAGACACT TTGGACATAT GGCGTTAAGG GCGTTCACTG GGATGACAAA GCTGAGACAG TTAAGTTAAC AGAGGAAAAA CAACTTACTT ATACAGAAGG TCAATTCCAC ATGTTACCTT CTTTAGAAAA GCCAGACACA TTATTAACTA AGAATCACAT CGATCCAATG TTAAGCTTTG CAACATTCGC TGATAATGCT GATCCTGGTA TTGCAGCTGT TAATCCAGTT GCTAAAGATG CAGCTGAAAA GTTCAATTCA TGGGCAGTTC CAGCTAATGT AGTAGTAAGT AATGATACTA TTAATGAGTA TAATGCAGAT CTTGTTGATC TTCGTACTCA GATTGTAACT AAGGTTGTAA CTGGCAGTAT GAGCGTTGAG GATGGTATGA ATGAATATAC AAGCCAGTCT GCTGATATGG TACAGGAAAT TTTAGATTCT TTAAACAACT AA
|
Protein sequence | MRKSTQKFLS LGLSLALVFS LTACGSKNDG PKTEPTTAPT KGAENNGGAT TTPEAPVSEV EKPSVITVMM DGTVFTEPNG RDQFEAALEA ELGLDIQFTQ PDHSGYYDAV GLTFAGSNWP DVVLLGGSYY TNYASAGLLA DITEYWDNSE LKASGRITNI DAMESIKIDG KLYGFTPARG NGCVTYVKKA WLDKAGLSVP TNYDEYLNML KVFTESDMND SGDPTNTYAV SAAGLIGNEA PYTNYLPEFY QDAYPDFMKD NSGKWIDGFS TDAMKSALQR LKDAYSAGYI DKESITNGTK DCRNKYFDDK FGVFTYWAGT WAANLKSSLE SNGKDGELVV LPPIAELGNY LERQAPAWCI TTKAANPAGI FKYFLEPMLD GGAVQTLWTY GVKGVHWDDK AETVKLTEEK QLTYTEGQFH MLPSLEKPDT LLTKNHIDPM LSFATFADNA DPGIAAVNPV AKDAAEKFNS WAVPANVVVS NDTINEYNAD LVDLRTQIVT KVVTGSMSVE DGMNEYTSQS ADMVQEILDS LNN
|
| |