Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1094 |
Symbol | |
ID | 4811392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1300520 |
End bp | 1302439 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640106516 |
Product | hypothetical protein |
Protein accession | YP_001037519 |
Protein GI | 125973609 |
COG category | [C] Energy production and conversion |
COG ID | [COG1032] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0145194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTTT TACCTATAAC AAGGGAAGAT ATGAAAAACA GAGGATGGGA TGAACTGGAT TTTCTTTATA TCAGCGGAGA TGCGTATGTG GATCATCCCA GCTTTGGGCA TGCCATAATA ACGAGACTTT TGGAAAGTGA AGGCTACAGG GTGGGGATTG TCGCCCAGCC GGACTGGAGA AAGGACGATG ACTTTCTGGC ATTGGGAAAG CCACGTTTGG CCGTGCTTAT ATCATCGGGA GTAATAGACT CCATGGTAAA CCATTATACT GCGAGTAAAA AGCCAAGAAG CGATGATTTG TACAGTCCGG GAGGAAAAAG CCACAGACGG CCGGACAGGG CGGTGATTGT ATATACCAAC AAAGCGCGGC AGCTTTTCAG GGATGTGCCG GTGATTATCG GCGGGATTGA GGCAAGCCTG AGGAGATTTG CCCATTATGA TTATTGGGAT GACAGGGTCA GACGTTCCAT TCTAGTTGAC TCGAAAGCTG ACCTTTTAAT TTACGGAATG GGAGAAAAAC CGATACTTGA GATTGCCCGG TATCTTTCCA TGGGAGTGCC GATAAAGAAG ATCCAGAATG TAAGGGGAAC CGCTTTTCTG GCAAGAAAAG AGGACTTGCA TGGAGAGTTG AGAAAATTTA TTGATAATTC GGAAGACAAG CCGGAAAAAG GTTATATTCT GCTTCCGTCA TTTGAAGAGG TGTCCACGAG CAAAAGAAAA TATGCCGAGG CTTTTATGAT TCAGTACAAT GAGCAGGACC CTTACACCGG AAGCGTTCTT GTGCAGCCTC ACGGTGACAG GTTTGTGGCT CAGAATCCGC CGGCTTATCC CCTTTCCGAA AAGGAGATGG ACAGGATATA TTCTCTTCCG TATGAAAGGA CTTATCATCC TGTCTATGAC AAAGACGGCG GAGTTCCTGC CATAGAGGAG GTACAGTTCA GCATAACAAG CCACAGAGGC TGTTATGGCG GTTGTTCCTT TTGCGCGTTG AATTTCCACC AGGGCAGGAT AATTCAAAAA CGCAGCCAGG CTTCAATAAT AAATGAGGCA AGAAAGCTTA CATGGCTTCC GGGCTTTAAA GGCTATATTC ACGATGTGGG AGGACCCACG GCCAACTTTA GGAACAAGGC CTGCAAAAAG CAGGAAATTT CCGGTGCGTG CAAGGAAAGG CAATGCCTTT ACCCCAAGCC TTGCAAAAAC CTTATAGTTG ACCACAGCGA ATACCTGGAG CTTTTAAGAA AGCTTCGGGA AATACCGGAA ATAAAAAAGG TTTTTATTCG TTCGGGTATA AGATATGATT ATCTGATGCT GGATAAAAAC GACGATTTCT TTGTCGAACT TTGCCGGCAT CATGTCAGCG GGCAGCTTAA AGTTGCGCCG GAGCATGTGG TGGACCGGGT GCTTGAGAAG ATGGGAAAGC CCCAAAGGGA GGTGTATGAC AGATTCGTCA AAAAGTTTTA TGAGATAAAC AGAAAAATAG GCAAGGAACA GTACCTTGTT CCCTATTTGA TTTCAAGTCA TCCGGGAAGC GACCTTAATG CGGCGATAGA GCTTGCCGAG TACCTGAGGG ATATAAATTA CACGCCTCAG CAGGTACAGG ATTTTTATCC CACGCCCGGG ACATTGTCCA CCTGCATGTT TTATACCGGG CTGGACCCAA GGACGATGAA AAAGGTGTAT GTTCCAAGGT CGCCGAAGGA AAAGGCAATG CAAAGGGCTC TCTTGCAATT TAGAAGGAAG GAAAACTACA AGCTGGTGTA TGAGGCTTTA AAACTTGCCC ACAGAGAGGA TTTAATCGGT TACGGCAGGA AATGCCTCAT AAAGCCTCCG GCTAATCTTT CAAAAAACAA TTTGAAAAAA GACAGCTCAA AAAGAAAATT AAAAAAAGCC GGAAAAAGCA GAAGAAAGAG CTCAAGATAA
|
Protein sequence | MAFLPITRED MKNRGWDELD FLYISGDAYV DHPSFGHAII TRLLESEGYR VGIVAQPDWR KDDDFLALGK PRLAVLISSG VIDSMVNHYT ASKKPRSDDL YSPGGKSHRR PDRAVIVYTN KARQLFRDVP VIIGGIEASL RRFAHYDYWD DRVRRSILVD SKADLLIYGM GEKPILEIAR YLSMGVPIKK IQNVRGTAFL ARKEDLHGEL RKFIDNSEDK PEKGYILLPS FEEVSTSKRK YAEAFMIQYN EQDPYTGSVL VQPHGDRFVA QNPPAYPLSE KEMDRIYSLP YERTYHPVYD KDGGVPAIEE VQFSITSHRG CYGGCSFCAL NFHQGRIIQK RSQASIINEA RKLTWLPGFK GYIHDVGGPT ANFRNKACKK QEISGACKER QCLYPKPCKN LIVDHSEYLE LLRKLREIPE IKKVFIRSGI RYDYLMLDKN DDFFVELCRH HVSGQLKVAP EHVVDRVLEK MGKPQREVYD RFVKKFYEIN RKIGKEQYLV PYLISSHPGS DLNAAIELAE YLRDINYTPQ QVQDFYPTPG TLSTCMFYTG LDPRTMKKVY VPRSPKEKAM QRALLQFRRK ENYKLVYEAL KLAHREDLIG YGRKCLIKPP ANLSKNNLKK DSSKRKLKKA GKSRRKSSR
|
| |