Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0914 |
Symbol | |
ID | 4810535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1094627 |
End bp | 1097179 |
Gene Length | 2553 bp |
Protein Length | 850 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106333 |
Product | hypothetical protein |
Protein accession | YP_001037341 |
Protein GI | 125973431 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAACA ATTTAAAAAT TATTCTAGCA AAGCAGAAAG AAATCATAAC AAGTCTTGAA AATGAAATTG CCGGCATCGA AGCCAACGAC CTTGTAAAAG AAAATCAAAA ATTGAAAGAA GAGCTGGCAA AACTTAAATC AAGTTTGGAA AAGGAAAAAA TTGAGAACGA AAAACTCTCA AAAGAAAACA AAAATTTGAG AAACATTTTG TATGAACAGT TCTACAACGA AAAAATCAGC CTTTTGAACT CAGCGGAAAA AAGAATGGAC GTTTATTACA GGGCTCATGT GGAAGGAGAA ATAAACAGGC TGACAAGATT TGAATTGTCG GTAAAAAGAC GTATTGATGA AATGACAAAA GTACTTCGGG CCAACAGAGT AAGCCTTCAG GATGAAATTT ACATAAAGCT TGAGGAGCTT AGAAATCTTC TTAACGTTAA AATAACCAAA GCCCGTGAAG AAATCATGCA ACAGACCGGA GCATTTATTC AAAACAGAAA CGAGGGCCTT GCAAGGCTAA GACAGGAACA GGTTACGGAG CAGGAAATAA AAGCCCGGGC AAAGCAAAAC AACATTGAAT CCATTATAGG ACTTAACATA ATCAACAAAG TGGGTATATT TTTACTCATT GTAGGAGTAA TAACCGCGGC GCAATTTACA TATTTCAGGC TGCCCGATAC CTTAAAAAGC GTTTTTACCT TTGCAGTGGG CGTTGTTCTG CTCATAGCCG GTGAAATATT AAACCGCAAA AAGCCCAACG TTTTCTCACT GGGCATAACC AGCGGAGGTA TTGCAATACT GTATGTGGCT CTTTGCCTCA GTTATTTTCA ATTTAAACTG CTTGAAACAT ATCCGGCTTT AGGATTGTGT ATTCTCATAA CGGCAGGAAC TTTTGTCCTT TCGCAGAGGT ACAATTCCCA GACCATATCG GCTTTTGCAC TGATCGGGGG ATATCTTCCC ATCTTCTCGA TAACCGGTAT TAGTGCAATA GCATACGGTG CCATGGTTTA CTTTGTAATA CTGAATCTTT TGGCTCTTAT TATTGCTGTC AACAAAAAAT GGGCTGTCAC TGCATACATA GGATTTGTGC TGAACGTCAT AGGTTCCGTA TATATAGCAT CAATAATGTT TGGAGGACTT TTCGCACCAT CGGAATTTTC ATTCGATTCC GTCATTACCA TAATTTATAT TTTGTTCGCT TTTGTAATTT ATACTCTGAT ACCAATAGCG GGAACCTTCA GGCAAAAATT GAGTTTTAAA ACTTCTGACA TTGTACTGCT CGCATTAAAC ACATTTATCA GCAGTGTACT TCTTTACTGG TCATTTTATG CATCCGACCT CGAAGACTTT ACGGGACTTC TTGCCCTCAC CTTTGCAATT ATTTATCTTG CCCTTGGAAG GTTTATTGAA AAAAACATGC CAAAGGAAAA GCAGGTTACC ACTTTGTTTT ATCTTACAGG ACTTACCTTT GTCGTACTTA TAATACCTTT CCAGTTCGGC AAAGTGTGGC TGTCCCTGGG TTGGCTGATT GAGGGTGTTG CCCTGCTTTC CTACGGTATA TGCAAAGAAA TAAAAAGATT TAAAAAAGCC GGAATTGCAA TTTCATTTTT GTGCCTGTTA ACTTTCCTTT CCTTTGACGT GTCACTGATT CAAGATTCTC TTTTTACCTT TAAATATTTT GCAATAACTT TGGGCAGCAT CATCATTTTA GGAGCCCTCA TATACAAAAA GAATCTGGCA AGCAAGGAGT CAAAACTGTT CAAGTATGCC ACATCCATAA ATCTGCTGAT CTTTTTGCTA TACATAATCA GCAACGAACT AAAACCATTT TTATCCCAAT ATGTTCAGGA CACAAAATAT GTTCAGGACA CAAAATTTGA CCTGGATTAC TTGATCTGCT CGGCAATGAT TTTAACAAGC TTTCTTGTGG CATACACCCT GCCAAGGATA AAAGTTTTGT GCGACAATGT TATAAAAGGC ATTTCAATGT CTATTTATGC TTTGGCACTG CTAACCCTCT TTTCCCTGAA TTTCACCTCC CCCGTGAAAG GATATTTGGT CGAAATGCCT TTGGCTATAA GTATTGTCGG AACCCTGGAA CTTGCTTTAA TAGCTTTCCT GTCCATCCTT GCCGTAAGGG ATTTGGTATT GTATTTTGTA ATTGACCACA AACTGGGTAT TGAATGGTAC CCACTTGCAA TATCTCTTTA CTTTGTAATA ATACTGACTC AAAATTTAAT TACCCAGTAC AGGCTTGAAT TCAGCAACGC GGCAATAAGC ATTATTTACC TTGCCGCAGC TCTCACATGG ATAATATTCG GTTTTGCCAA AAGGTATGTA TTCATCCGCC GTTTCGGACT TGCCATGTCC ATGCTTTCGG TGGCAAAGCT GTTCATTATC GACCTTGCTT TCCTGACCCA AGGTTACAGA ATAGTATCCT ATTTTGTGTT CGGCATAATA CTCCTTGCAA TATCTTTTGT ATATCAGTAT TTCAACAAAA AATTAGAAAA TATATGCGAG GTAATGCCTG ATGATAAAAA GAATAGTAAT TAG
|
Protein sequence | MTNNLKIILA KQKEIITSLE NEIAGIEAND LVKENQKLKE ELAKLKSSLE KEKIENEKLS KENKNLRNIL YEQFYNEKIS LLNSAEKRMD VYYRAHVEGE INRLTRFELS VKRRIDEMTK VLRANRVSLQ DEIYIKLEEL RNLLNVKITK AREEIMQQTG AFIQNRNEGL ARLRQEQVTE QEIKARAKQN NIESIIGLNI INKVGIFLLI VGVITAAQFT YFRLPDTLKS VFTFAVGVVL LIAGEILNRK KPNVFSLGIT SGGIAILYVA LCLSYFQFKL LETYPALGLC ILITAGTFVL SQRYNSQTIS AFALIGGYLP IFSITGISAI AYGAMVYFVI LNLLALIIAV NKKWAVTAYI GFVLNVIGSV YIASIMFGGL FAPSEFSFDS VITIIYILFA FVIYTLIPIA GTFRQKLSFK TSDIVLLALN TFISSVLLYW SFYASDLEDF TGLLALTFAI IYLALGRFIE KNMPKEKQVT TLFYLTGLTF VVLIIPFQFG KVWLSLGWLI EGVALLSYGI CKEIKRFKKA GIAISFLCLL TFLSFDVSLI QDSLFTFKYF AITLGSIIIL GALIYKKNLA SKESKLFKYA TSINLLIFLL YIISNELKPF LSQYVQDTKY VQDTKFDLDY LICSAMILTS FLVAYTLPRI KVLCDNVIKG ISMSIYALAL LTLFSLNFTS PVKGYLVEMP LAISIVGTLE LALIAFLSIL AVRDLVLYFV IDHKLGIEWY PLAISLYFVI ILTQNLITQY RLEFSNAAIS IIYLAAALTW IIFGFAKRYV FIRRFGLAMS MLSVAKLFII DLAFLTQGYR IVSYFVFGII LLAISFVYQY FNKKLENICE VMPDDKKNSN
|
| |