Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0931 |
Symbol | |
ID | 5741803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 1186734 |
End bp | 1188368 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641292042 |
Product | extracellular solute-binding protein |
Protein accession | YP_001558054 |
Protein GI | 160879086 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0552135 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA ATTGGATGAA AATAGCAGCG ATGGGAATGA GTATCGTGCT AGCAGCAGGA GCTTTGACAG GTTGCTCTAG AGGAAATAGC AACAAAGAAG ATTCCAAGGT AGAAGAACAA GGAGTAGATA AGGGTGGACA AGATGTTTCT AAAGAACCTG TAACTATTGA GTGGTTGGCA TATAATACAT ATTCGCAACC GAATACAGAT ACTGAAATAG TAAAACAAAT TGAGAAAAAG TTTAATGTTA AATTTGAATT TTGGTACGTG GATGACCAGA AGTGGGATGA AATTCTTGGT GCAAAACTAT CTTCAGGAGA TATGCCAGAT GTCATGAAAA TAAAGAACAC TGCCAATATC CCAACTTATG TAAAACAGGG AATTCTTGCA GAATTTACAG ATGAAATGTT GGCTAAGATA CCATCCTTTA CAAAACAGGT TGAGGAAGCC AATGTAGAAG GAAATGGTCT TATTGATGCA TATTATGATG GTAAGAGGTA TGCGATTAAA ACACCTTCTA TTTCTGGAAC ATATCCCACT GTTTTGGTAT GGAGAACAGA TTGGTTAAAG AATCTTGGCA TAGAGAAGAT ACCAGCTACT ATTGATGAAA TGGAAGAGGC CATGTATGCT ATTCGCAACA ATGATCCAGA TGGTAATGGA GTAAAAGATA CCTATGGAAT GTCCAATACA GCTATGAATG CAGTATTCGG TGCTTACGGT GCCATTCCGT TGAAGGAATT TAGAGGGACA GGAGCGCAGA ATCTTTTCTT TACAGAAAAA GATGGTAAAA TTGAATTTGC GTGTACACAG CCGGAGATGA AAGCGGCACT TGCTACAATT CAAAAGTGGT ATAAGGAAGG TTTAATTGAT CCAGAATTCA TCACAGGAGA GAATACAGCT GGATACTGGG CAACTTCACA AGCATTTGAA AATGGAAAAG TGGGAGTGAC AGGAATGGCA TTGGCTTCGC ACTGGGCGCC ACCAGTAGAA GAAGGAAAAA AGGGTGGAGC ATGTTATGAG GGATTTGTGG CAATGAATCC AGATGCAAAA TGGGAAGAAA CTGTTAACAT AGGACCAGCA ATTCAGGGGC CAGAAGGAAA ATCAGGGACA CACACTTGGG GAGCTTTCAG CCCTTCTGGA TTTGGTATAA CCACAAAATG TGCAGAGGAT CCACGAAAAG TAGATGCAAT ACTAGCCATG ATTGAAGCAT ACTCCTCAGA TCCAGAATAT GCGCTATTAG CAGGCTGGGG AATCGAAGGT ACACACTATG AGAAAACCGA AGAAGGCGGT GTACGACGTC TTGAACCATT TACGAAACCA TCCGAATATA TACAAGATGG AGTTGGGGTT TTTATGCTTG GAACTAACAC TGAATTTGAT AGAAGCTTGA GCAAAAACGT ATTTGATTTT AGTGATAAGT ATAAGACACC TGGATATCAG GATATTTTAG TACCAGCAAC AGAGGCAGCA AATCAGTACT TAACTGATTT GAAGATTTTT ACACTAGATG CTTATATTAA GATAATGACA GGCGAAGAAA GCGTTGATTA TTTTGATACC TTTGTAAAGG AGTTTAACTC CATGGGTGGA GAACAAATTC TAAATGAAAT CAATGCAGAA ATAGCAAAAA ATTAA
|
Protein sequence | MRKNWMKIAA MGMSIVLAAG ALTGCSRGNS NKEDSKVEEQ GVDKGGQDVS KEPVTIEWLA YNTYSQPNTD TEIVKQIEKK FNVKFEFWYV DDQKWDEILG AKLSSGDMPD VMKIKNTANI PTYVKQGILA EFTDEMLAKI PSFTKQVEEA NVEGNGLIDA YYDGKRYAIK TPSISGTYPT VLVWRTDWLK NLGIEKIPAT IDEMEEAMYA IRNNDPDGNG VKDTYGMSNT AMNAVFGAYG AIPLKEFRGT GAQNLFFTEK DGKIEFACTQ PEMKAALATI QKWYKEGLID PEFITGENTA GYWATSQAFE NGKVGVTGMA LASHWAPPVE EGKKGGACYE GFVAMNPDAK WEETVNIGPA IQGPEGKSGT HTWGAFSPSG FGITTKCAED PRKVDAILAM IEAYSSDPEY ALLAGWGIEG THYEKTEEGG VRRLEPFTKP SEYIQDGVGV FMLGTNTEFD RSLSKNVFDF SDKYKTPGYQ DILVPATEAA NQYLTDLKIF TLDAYIKIMT GEESVDYFDT FVKEFNSMGG EQILNEINAE IAKN
|
| |