Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0739 |
Symbol | |
ID | 4810357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 900407 |
End bp | 902131 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106156 |
Product | hypothetical protein |
Protein accession | YP_001037167 |
Protein GI | 125973257 |
COG category | [S] Function unknown |
COG ID | [COG1520] FOG: WD40-like repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0166259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACAA AAACATCCAA AATCATACTT ACTATAAATA TAATAATACT GCTTTTGGCT GTTCCGGGGT TTATTATTCT GGATAAATAT CTTGAACGCT CAAATTCCAA TCCGGACAAT ATCGTTGCGC CACCTTCTCC GGATGTGGCT TCCAATCCCG GTGAAACTAC CATACCGGAC GTTACCGCAA GTCCGGAAAA CACTGACACC AAAAATTGGA CAGCAATAAC TCCTGATGAA ATCGATATTG TAAAAGAGTG TCTTCCCGAA AGCCACAATT TTAAATGGGA CGTTATTAAA GACGGCAAGA AACTTGATAC ATATTCAAGA GACAATACCG TTGTTTTTAA ACCGGCTGAT GAATACAATG AAATTGACGG TGTAACCACC TTCAGAGGAA ACAACTACAG AAACTCCGCA AGCTTTGGTT CTGCAAACGT CAGGGAAGAA AAACTCGAGA AAGTTTGGAG TATCAAAATA GGATATATAG ATACATGGAC AGGTGTTGGC TGGAACGGTC AGCCGGCAAT TGTAAAATGG AGCAACGAAC TTCGAAAAAA AATGAATTTG TTTCAGGATA AAAAGGATAA AAATGATCTC AAGGAAGTCA TATATGCCAC TTTGGACGGA AAAATATATT TTCTCGACCT TGACGACGGT TCCTACACCA GAAATCCAAT TAATGTAGGT GCTCCGCTCA AAGGAAGTGT AACCGTGGAC CCCAGGGGTT ATCCCCTTCT CTATTCAGGT CAAGGCATTG ACGAAGTAAA AGGCCAAAAG GTTTCGATAG GTTTTCGCAT ATACAGCCTT CTGGATCAAA AACTTCTCTA CTTTATAAAC GGCCTTGACA ATACTGCTTT CAGATACTGG GGAGCTTTTG ACTCTTCCCC TCTTTTGCAC AAAGAAACCG ATACGCTGTT TTTATGCGGG GAAAACGGCC TTTTGTATTC CATAAAGCTA AATACGGATT ATGACCCTGC ACAACCTGCT ATTTCAATAA AGCCTGATAT TGTAAAATAC AGATATGTTT CTCCCGTCAA CGGCAGACTT GGAACTGAAA ACTCCATAGC CGCTTTCAAA AATTTCGGCT ACTTTGCTGA CAACAGCGGA ACTCTCCAGT GCGTTGACTT AAACACTCTG TCTCCGGTAT GGATAAGAAA CATAACCGAT GATACGGACA GTACAATGGG TATTGAGGAT TTAGGAGGAA ACAACGTTTA TATCTATATT GCAAACGAAG TTGACCTCCA GGGAGAAAAC GGATACAGTT ATGTCAGAAA AATAAATGCT TTGACAGGAA GTCTTGTATG GGAAAAGAAA TACAAATGCT CATATAACGC AGATACAAAC GGCGGAACAT TGGCCTCTCC CGTAATTGGA AAAAACGAAA TCAGCAATCT GGTTATATTC AGCATAGCCA AATCCTATAA GAAAAACGGC GGAAAGCTAA TTGCCTTTGA CAAAAATACC GGCGACGAAG TATGGGTTAT AGATTCGGAT TTCTACAGCT GGAGTTCACC GGTTGACGTA TATACTGAAG ACGGCAAAGC TTATATCATT CATTGCGATT CCGCTGGGTA TATGAACCTC ATTGAAGGCA AAAGCGGCAA AATCCTTGAC AAAATACCTC TTGGCGGAAA TATTGAAGGT TCACCCGCAG TTTATGACAA TATGATTGTA GTAGGCACAA GAGGTCAGCA AATATATGGA ATAAGAATAA AATAG
|
Protein sequence | MNTKTSKIIL TINIIILLLA VPGFIILDKY LERSNSNPDN IVAPPSPDVA SNPGETTIPD VTASPENTDT KNWTAITPDE IDIVKECLPE SHNFKWDVIK DGKKLDTYSR DNTVVFKPAD EYNEIDGVTT FRGNNYRNSA SFGSANVREE KLEKVWSIKI GYIDTWTGVG WNGQPAIVKW SNELRKKMNL FQDKKDKNDL KEVIYATLDG KIYFLDLDDG SYTRNPINVG APLKGSVTVD PRGYPLLYSG QGIDEVKGQK VSIGFRIYSL LDQKLLYFIN GLDNTAFRYW GAFDSSPLLH KETDTLFLCG ENGLLYSIKL NTDYDPAQPA ISIKPDIVKY RYVSPVNGRL GTENSIAAFK NFGYFADNSG TLQCVDLNTL SPVWIRNITD DTDSTMGIED LGGNNVYIYI ANEVDLQGEN GYSYVRKINA LTGSLVWEKK YKCSYNADTN GGTLASPVIG KNEISNLVIF SIAKSYKKNG GKLIAFDKNT GDEVWVIDSD FYSWSSPVDV YTEDGKAYII HCDSAGYMNL IEGKSGKILD KIPLGGNIEG SPAVYDNMIV VGTRGQQIYG IRIK
|
| |