Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2274 |
Symbol | |
ID | 5745333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 2803758 |
End bp | 2806796 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641293364 |
Product | extracellular solute-binding protein |
Protein accession | YP_001559374 |
Protein GI | 160880406 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAACA AAAAGGAGAA GAAGAAAATG GGAAGAAGAA TGAGGTTTAG AAGAACTCTA TCATTCCTTC TGGCTGCGAC CATGCTATTT ACACCAATTC AAAGTGGAAA GACTATAATT GCTGCCGAAA ATAAAACATT GGCTGATTAC GAAGAAATTG TTGGTTCCTA TTCCGTTGAC AAATCCATTC CGCGATATAC GATTTATAAG GAAGGCTTTA ACAATGTGAA ACCTCCGGAT GAGTACACGA TTGAAGCAAA TGATTATGTT TATTATACAG AGGGCGAACT TGATTCAAAC AAGAAAGAAA TAGCCGCAAT ACCAGAAGAA TTGAACGATT ATGAAGGCAT GGATGGAACC TCGGTTATTA TTTCAGAAAC GGGATTTATT GAATATGAAG TAGATATAAA AAATGCAGGT TTTTATGATA TTGCCTTAAC TTATTATCCA ATTGAGGGAA AGAATTCTGA AATTCAAAGG AGTTTCTTTA TTGATGGAGA ACTTCCCTAC GATGAACTGG CACTTATTGA ATTCTCCAGA GTATGGTCAA ACAACATTAC AGAAACCTAC CTCAATGAGG ATGGAATACC GACAAAGAAA TGGGAAAAAG ATAATCAAGG AAATGACTTA AAGCCGACGC TTTATGAAAC ACCAGAATGG ATTACGAGTT ATTTATATGA CAGTAATGGC TTTGTTATAA ATCAGCTTTC GGTGTATTTT ACTGAAGGAG TACATAAAAT TTCTATGCTT TCTTTAAGAG AACCGATGTT ACTTCGTAGG ATTACCTTAA ATAATACAAC GGAAGTAGTA GACTATAAAA CTAAAAAAGC AGAATGGGAT GGGATGAATG CAAAGGATAC GTCAGGCATT CTTGTAAGGG TCGAAGCAGA ATCTGTTACG AAAACGTCTT CGCAAATGTT GTATCCAAAA CAAGACCAAT CCTCCCCTGC GGTTTATCCT TCAAGTACCA AAGAACTGCT AAACAACACG ATTGGAGGCA ATTCCTGGCG TTTGGTAGGC CAGTGGATGG AATGGGATTT TGAGGTACCA GAGAGCGGTT ATTACAAAAT TGGGTTCTAT GCGAAACAAA ATTTCGTAAG AGGTATCTAT GTTTCGCGAA AAATTACAAT TGACGGTAAT GTTTTGTTCA ATGAACTAAG CGATTATGGC TTCCAATATC AATCCAATTG GAGATTTGAT ACCTTAATGG ATGAAAATAA TGATGCCTAT AAGATTTATT TAGAAGCAGG AAATCATACC CTTCGTATGC AAAACGTATT AGGCGAATTC TCTAGTATCA TCGGAGAGGT TCAAGAATCC TTATCAAAAC TAAACTCTAT CTATCGAAAG ATCATCCGTA TTACGGGTGT GAAACCGGAT ATTTATAGTG ATTATCAGAT TGAAGCAAAA TTACCAGAGC TAGAAGCTGA GCTAATCGAT GTACATAATC AACTCGATTT CGCAATCAAA CATTTACGTG AGGTAGCAGG AGGAAGTAGT GATAAGGAAG CTGTTTTAGT AACGATGCGT GATCAGTTAG AGGACTTAAT AAAAGATCAG GAATATTTTA AAAAGATTGT GACTTCTTAT AAGATTAACG TTCGTGCGGT AGGAACTTGG TTATCAGGGG CAGTTTCGCA ACCTTTACAG TTAGATGCCA TTTACATCTA TTCGCCAGAT GTGGATGCGA ATATATCGAG AAGTTCTTTC TGGTCGAAAT TATGGTACGA GATTTGTAGA CTTTTTTATT CCTTTGTTAT CGATTACAAC CAGATTGGTA ATGTTTCAGA AAGAGGGAAA GAAAGCCATA CCATTACGTT ATGGGTTGGT ACTGGACGTG ACCAGGCAAA TGTAATCAAA TCCTTGATTG ATGAGACCTT TACAAAGAGA ACAGGAATTA ATGTAAATGT AATGCTCGTT GATATGGGAA CTTTACTACA AGCAACGCTT GCTGGCCAAG GACCCGATGT CGCGATTCAG CTTAATATAT CAAATCCAAC TTATAATAGC GCAATCCAAA GCTCGAATGA TATGCCTATG AATTATGGAC TTCGTAATGC GGTTGCTGAC TTAAGTCAGT TTAGTGACTT AAAGGAAGTT AGGGAGCGCT TTTTTGATAG CGCACTGGTT CCGTTCACCT ATGACAATCA TACCTTCGCT CTCCCAGAGA CACAAACATT CCCGATGATG TTTTATCGAA AAGATATCTT AAAAGAGTTG GGACTTACCT TACCACAAAC TTGGGATGAT GTGAAAGTAA TCATGTCAGT ACTTGCTAAA AATCAGATGG AATTTGGTAT GTTACCAAAT GAGTTAAATT TCTTAATGCT ACTAAATCAG TACGGTGGCC AGTATTATAA CGAAGATGCA ACAAAATCTG CACTAGATAG TGATGAAGGA ATTAACGCAT TTAAGGAGTA TTGTAGTCTC TATACCGAAT ATAAACTCGA TAAAGTTACC AGTGTTGAGG ACCGATTTCG TACTGGGGAA TGTCCAATCA TTATTGCGGA TTATTCTGTT TATAATAATT TTCAAGTTTC TGCTCCCGAC ATCAAAGGGC TGTGGGGATT TGCACCGGTC CCTGGTATGA GGAAAGAAGA CGGGACAATA AATCGAAATG TAGCTAGTGT TGGTTCAGCC TGCGTTATTA TGGAGTCGTC AAAATATAAA GAAGACTCCT GGGAATTTTT AAAGTGGTGG ACTAGCGCAG AGATACAAAC ACTTTATGGA AAAGAGATGG AAAGTTTAAT GGGAGCTTCT GCGCGTGTAG CAACTGCGAA TAAAGAAGCA TTTGAAAGCC TTCCATGGCC ATCTGCAGAT TACAATGCTT TAAAGAAACA GTTTCAAAGT GTTGTTGGTA CAAGACAGGT ACCAGGTGGA TACTTTACTT GGAGAAACAT CGACAATGCA TTCTATAAAG TAACCACGAA TACGGATTCT GCATCTGCAA GAGAGTGCCT CATGGATAAT ATCATCTATA TTAATGATGA AATTAATTAC AAGAGAAAAG AATTTCATAT GCCATTGTCG AATGACTAA
|
Protein sequence | MNNKKEKKKM GRRMRFRRTL SFLLAATMLF TPIQSGKTII AAENKTLADY EEIVGSYSVD KSIPRYTIYK EGFNNVKPPD EYTIEANDYV YYTEGELDSN KKEIAAIPEE LNDYEGMDGT SVIISETGFI EYEVDIKNAG FYDIALTYYP IEGKNSEIQR SFFIDGELPY DELALIEFSR VWSNNITETY LNEDGIPTKK WEKDNQGNDL KPTLYETPEW ITSYLYDSNG FVINQLSVYF TEGVHKISML SLREPMLLRR ITLNNTTEVV DYKTKKAEWD GMNAKDTSGI LVRVEAESVT KTSSQMLYPK QDQSSPAVYP SSTKELLNNT IGGNSWRLVG QWMEWDFEVP ESGYYKIGFY AKQNFVRGIY VSRKITIDGN VLFNELSDYG FQYQSNWRFD TLMDENNDAY KIYLEAGNHT LRMQNVLGEF SSIIGEVQES LSKLNSIYRK IIRITGVKPD IYSDYQIEAK LPELEAELID VHNQLDFAIK HLREVAGGSS DKEAVLVTMR DQLEDLIKDQ EYFKKIVTSY KINVRAVGTW LSGAVSQPLQ LDAIYIYSPD VDANISRSSF WSKLWYEICR LFYSFVIDYN QIGNVSERGK ESHTITLWVG TGRDQANVIK SLIDETFTKR TGINVNVMLV DMGTLLQATL AGQGPDVAIQ LNISNPTYNS AIQSSNDMPM NYGLRNAVAD LSQFSDLKEV RERFFDSALV PFTYDNHTFA LPETQTFPMM FYRKDILKEL GLTLPQTWDD VKVIMSVLAK NQMEFGMLPN ELNFLMLLNQ YGGQYYNEDA TKSALDSDEG INAFKEYCSL YTEYKLDKVT SVEDRFRTGE CPIIIADYSV YNNFQVSAPD IKGLWGFAPV PGMRKEDGTI NRNVASVGSA CVIMESSKYK EDSWEFLKWW TSAEIQTLYG KEMESLMGAS ARVATANKEA FESLPWPSAD YNALKKQFQS VVGTRQVPGG YFTWRNIDNA FYKVTTNTDS ASARECLMDN IIYINDEINY KRKEFHMPLS ND
|
| |