Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1141 |
Symbol | |
ID | 4810809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1354697 |
End bp | 1358758 |
Gene Length | 4062 bp |
Protein Length | 1353 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640106563 |
Product | hypothetical protein |
Protein accession | YP_001037566 |
Protein GI | 125973656 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.485146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTATAT GGGCTCCTTT CGGTACCAGT GATTTTCCTT TAAATCAGCC GATGGTAGGG CAGAAAGAGT TTTATGAAAT TTTCAAAGGC TTTACGAAGA CTATGAAAAG TGCAGGGATG GCAACTATTT TCCCGCTGAT TTCAAAGTGG GGTGTAGGTA AATCCCGTAT CGGTTTCGAA CTTATTTCAG AACCTCTTGG CATGGATAAG GGCTGGATTA TTAATGAAGA TGGCGTACAA AAGGAAGTGA GAATTTTTAA GCCGAATTTC GAAGATAAGG TTCTTCCATT ATACATAAGG TATTCCCAAA TGTGCCATCC TGATTTAATT GGAGATAACT GGGTGGCTTA CGGCATTTAT ACAGCGCTGT CCTATTTAAG CAGAGAATCC GATGGAAGCA TTCAAGGAAA GATTATGGAA GCCATTCAAG ATGCCTTATC TCCTGTAGGG TTTGATAGAA ACATCCTTGG AGATTTACTG CAAGTAGACA GGGTAAATTT AAATGAACTG GTTATAAACA AAGAGAAGTT GGATGAATTA ACCAGAAAAG GAATGGAGTA TGTTAAGCAA TTCGGGATTG AACATTTTCT TATTGTCTGC GATGAATTAG AAACAGCGGG GGAAATTGCA AAGTATGGAA TTGAAAAAGA GAAGGAACTG GTAAACAAAA TAGATGGAGA AGCAATTCAG GTAATTACAA GTGCTATAAA ACATGAAGAC CCGAGAAAGA AATATCCAGA GGTTTCCTAT CTTCTTTTGT GTTCTACAGT TATAGGAGGT AGTATCCAGG GGATAGGTGC TTTAGACAGA AGAACGGAAA TGTATGAAAT GCTTCAGAAT TCCTTTGCAG ATATATCGGA TTATATCGCT TATCTTAACA AGAAGGGCAT GATACCCGAT TATCCAAAAG GACTGATAGA AGCAGCCTAT ACCATTGCAG GAGGTAATTT TGGCTGGTTT AACGTTATTA TGTATAACGT AGACCAAAAG ATGGAAGACA GTTCTGTCAA AAAAGAAACA GGATATATTT TTGAAGCCAT ATTAAACTCC AGCAACCGCT TTAAAGAAAG TTTGATTGAT AAACCTGCTT TTGACTATAT TCAGTGTGAT GATAGATTCA GGCAGACTAT TAAGCATGCT CTTTTAGAGC AAATCCCTAA GAAGAAAACA AACTATTCTC CTGAAGAAAT AAATGCCATG ATGGATGCAA AGGCAGAAGA TGGCGAAAAG TTATTCAAAG AGTTTTACTG TGTGAAATTA GAAAAGGATG ATTTGGCGGT TTACCTCAAT TCTCAAGGCT ACAAAAGAGA AACGGGAGAT ATATTTGTTA ATAATTTTGG AAGCAGTTTT GATTTAGGCA TACTTTTGAG AAGCCTAAAA ACTTTTTCTC TCAATGTAAA AGAAAATGAG TATATTGTGG GGAAGGAAGA AGAAACATTT TTAGACCAGG TCAGGATGTT GTATCCGAAG GATGATATTG AAGAATCAGC AAGATATATC TATGGATATA TCCTTGAAAA AATAGAAAAA GAAGATATAA AAGAAGCAGA ATATATCGGA CCAAACTTTG CCTATTTAAG CCGCCTTGAT AAAAGGTATA GAGTAGAAAA AGATGATTTT GGTTACGTTC CAGATACTGA AAAAAACAAA GAGATTGAAG CGTTAATAAA AGAAAGAAGC AAAAATAGAA AAGAGGAAGT AAAAAGGGTT CTTTCAGGTG CCTGCAGGGC ATTGGAAATA AACTACCCAG AAGAAAGTTT CTATACCATC AATGGCGTTG AATGTGTCAG GACCAAGGTA GAAGGAGGAC CTTATCTTGA TGTTCACGAG GACAGAATAG TAGATATTAT CTGGGGCAAG GATGAAGATA GATTAAAAGA TGCACTGTTA GATAGCAGAC TTTTAAAAGA AGGAGTTCAT CCTGTTATTG TAATCTCTGA TTCTGTTATT AGTGCAGAGT ACACCGATAA ATTTGTAAAA GAAAAGTATG AAGGCATAGG AAAGTGCTTG ATATTTGTCA ATATAACCAG ACTTCAAAAG GATATTTTAG AGGTCATGTC CATAGATAAG GACATATTGG ATATAAGGGA GAACAGGAAT GTTATTACTT CTACCTTTAG GGATAGAATC AGAAAAATCA GGGACCATTT TAATTTAAAG GCAAGAGAAT GGTTTGAAAA GTTAGATGAG GAAGGCTGGA TATTAAGACC TATCATCTAC AAGAAACATG ATGAGAAACA AATAGTGCTG CTGTCAAAGG CTTATAAGCG AATGCTCATC CACAACCTTG ATTTTGAGGA ACTTGGAAGC AAAAAAGATG TGAGATTGTC TGATGCAGAA TATACGGAGT TAAAGAACGT TTTAAAATCA ACCGCCATAG GCAGAATGAA TGAGGGAAAA GGATATAAGG AAACAGGTTT ATTCATTCAT GAGGATGAAA ATAGATACAG CATAAATATA CCCGCATGTA TGAACAGAAT ACTGAAGTTT AATGGAAACA GCAATAAATC TATGGCTGAT TACAGCAATA AATTTTTCTT CAGCCTTATA CACGAGGTAA AGCCGAAACG CATTCTAGAA CAGTGGATAG AGTTTATGAT GGGGCTAAAT CTTTTGCTAA AAACCAAGGA CGGTTTTATC GAGAGAATAT CCAATTATGA GTTGGACGGC AGGTATTCTG TAGTTAAAAA ATGGCTGGAA GATGACTGCA AAAAAGAAAT TGACAGCATG AAGAAGGTTA TAAACGGTCC TTATTTGGAT GTCCTTGAGA AGAATCAGGT ACCTTATTAT AAAGTACAAC TTCAAGAGGC TGAAAAAATT AAGGACAGCA TTAATGTTGA TAAGTTATCA GAAAAAGATA ATAAGAATAT GGAAAACTTC AGAGATGTTA TTTCGAAGAT TGAGGAATTC CTGGAGATTT GCTACCAAGT ATATGACAGT GATGGATGGA ATAACATCAA AACCTATAAT CCAAACATCA TAAAGGATAT TAAAATTGAT GATAAGGAGA AGCCTTTGTG GTTTAGGGTA AGGCATATAA GATTGTTTAT AGACTATATT AATACCTTAA AGGACCCAGC AGTAAAGAGC ATTACGGATA AAATTGCAGA AATCAAGAAA AATTGTGAGT ATGGCGGATT CATGCTTCCT ATTTCTCCGA TAACGAATAT ACTGCAAAAG TATTGTAATG AACTGGAGAA CTCAACTGAT TATGAAAAGA TGACTATATC GGGTACAGGG ACTATGGTTT CCTATATAAA TACATTAGCA TATAAGTTAA AAGATGGAGA TTTTAGTGGA GCATATAAGC GAATTGAAGA AATTCTTGAT GCCTGCGGAT TAGAGCCTAT TGAGAACAAT GAGTTAAAGT GGGCTAATGA TAGAGGAATA ATAGGTGAGT ATAAGGCTAT TTATAAGAAT TTTACTTCTA TCATTGACTG TTATAAGAAA TTACCTGAAA GCAAAAGGTG GGTTGATTAT TTTTCGGATG CACCAGAAAA ATTAAGGAAT CATACCGAGG TGAAGAATTT AAGCAACTGT ATTAAGGAAA TAGAGTTATT TGTTAACGGC GGTCTTGAAC AAGAGATTGA AGATAACGAG CCTGTTATGC TGAGTAAACC TGATGAATTT TTGGCGCTTC TTAAAAAGAG GGTAGAAGAT ATGCAGCAGT ATGTAGGGCT TATTGAAGGC TATAAAACAA ATGTAATGAA TTTAGCAAGG ACAAAGAAGA ATGAGTTTTA TGATAATTTG CTGATAAGTA CCATAGATAA GATATGCAGA GTACAGGGGA AACCTCCCGT ATCCGCACAG ATAAATACGG AAGAGTATCC AAAAGAGGAA ACTTATGCTG CCACAAAAGA AGCGATACAG AATAAAATGA ATTTGCTTGC AAGTGAAGGT GAATCATTCT TTAAAAACTC ACCATTGATT AAGAAAACTA CCTTTAGTTT CTTTAAGCAC GTAGTAGAAA AGGATGGGGA TATTAACTGG GATGACTACC TTGAGGAAAA GCGTGAGTTA GAATCGGTGA AACTCATTCG GACAAAGGTT GAGGTACTGT GA
|
Protein sequence | MSIWAPFGTS DFPLNQPMVG QKEFYEIFKG FTKTMKSAGM ATIFPLISKW GVGKSRIGFE LISEPLGMDK GWIINEDGVQ KEVRIFKPNF EDKVLPLYIR YSQMCHPDLI GDNWVAYGIY TALSYLSRES DGSIQGKIME AIQDALSPVG FDRNILGDLL QVDRVNLNEL VINKEKLDEL TRKGMEYVKQ FGIEHFLIVC DELETAGEIA KYGIEKEKEL VNKIDGEAIQ VITSAIKHED PRKKYPEVSY LLLCSTVIGG SIQGIGALDR RTEMYEMLQN SFADISDYIA YLNKKGMIPD YPKGLIEAAY TIAGGNFGWF NVIMYNVDQK MEDSSVKKET GYIFEAILNS SNRFKESLID KPAFDYIQCD DRFRQTIKHA LLEQIPKKKT NYSPEEINAM MDAKAEDGEK LFKEFYCVKL EKDDLAVYLN SQGYKRETGD IFVNNFGSSF DLGILLRSLK TFSLNVKENE YIVGKEEETF LDQVRMLYPK DDIEESARYI YGYILEKIEK EDIKEAEYIG PNFAYLSRLD KRYRVEKDDF GYVPDTEKNK EIEALIKERS KNRKEEVKRV LSGACRALEI NYPEESFYTI NGVECVRTKV EGGPYLDVHE DRIVDIIWGK DEDRLKDALL DSRLLKEGVH PVIVISDSVI SAEYTDKFVK EKYEGIGKCL IFVNITRLQK DILEVMSIDK DILDIRENRN VITSTFRDRI RKIRDHFNLK AREWFEKLDE EGWILRPIIY KKHDEKQIVL LSKAYKRMLI HNLDFEELGS KKDVRLSDAE YTELKNVLKS TAIGRMNEGK GYKETGLFIH EDENRYSINI PACMNRILKF NGNSNKSMAD YSNKFFFSLI HEVKPKRILE QWIEFMMGLN LLLKTKDGFI ERISNYELDG RYSVVKKWLE DDCKKEIDSM KKVINGPYLD VLEKNQVPYY KVQLQEAEKI KDSINVDKLS EKDNKNMENF RDVISKIEEF LEICYQVYDS DGWNNIKTYN PNIIKDIKID DKEKPLWFRV RHIRLFIDYI NTLKDPAVKS ITDKIAEIKK NCEYGGFMLP ISPITNILQK YCNELENSTD YEKMTISGTG TMVSYINTLA YKLKDGDFSG AYKRIEEILD ACGLEPIENN ELKWANDRGI IGEYKAIYKN FTSIIDCYKK LPESKRWVDY FSDAPEKLRN HTEVKNLSNC IKEIELFVNG GLEQEIEDNE PVMLSKPDEF LALLKKRVED MQQYVGLIEG YKTNVMNLAR TKKNEFYDNL LISTIDKICR VQGKPPVSAQ INTEEYPKEE TYAATKEAIQ NKMNLLASEG ESFFKNSPLI KKTTFSFFKH VVEKDGDINW DDYLEEKREL ESVKLIRTKV EVL
|
| |