Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1481 |
Symbol | |
ID | 4810631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1800208 |
End bp | 1802520 |
Gene Length | 2313 bp |
Protein Length | 770 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106902 |
Product | membrane protein-like protein |
Protein accession | YP_001037903 |
Protein GI | 125973993 |
COG category | [S] Function unknown |
COG ID | [COG1511] Predicted membrane protein |
TIGRFAM ID | [TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0291099 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGTA AAATGATAAA AATCATTTCT CTTTGCCTGT GTATAGCCAT CCTTGCCAGC GGAGCGGCTT ACGCCCTCGC ATCAGGTAAA GACAAAGCTG AAAATGTTAA AACAGCGGAA AACATTACGG AAAACACTGC AGAAAAACAA CATGAAGAAG CTGCGGCAAA AAGCGATACT TTCAAAGATG AAACAGTATA TGTTCTTGCA AACCCTGACG GTACGGTGCA AAAAATTATT GTAAGCGACT GGATAAAAAA CACGCTTTCA AGTGAAAAAA TCACTGATGT AACCGAACTT GAAAATACAG AAAATATCAA AGGTGACGAA AGTTATACCC TGGGAGGTAA CAACACCCGT GTATGGGATG CGCAGGGAAA TGACATCTAT TATCAGGGCA CAATTGAAAA AGAACTGCCG GTGGATCTTT CGGTTTCTTA CAAACTTGAC GGAAAAACAG TATCCGCAGA CGAGCTCATA GGCAAAAGCG GCAAAGTAAC CATCCGCTTC GACTACAAAA ACAAGCAATA TGAAACCGTA AAAATAAACG GCAAAGATGA AAAAATCTAC GTTCCCTTTG TAATGCTGAC GGGAGTGCTC CTTGACAACG ACAACTTTAC CAATGTTCAG GTTTCAAACG GAAAAATCAT TAACGATGGA AACAAAACGG CAGTTTTAGG GTTTGCTTTA CCGGGGCTTC AGGAAAATCT TGCAATAGAC AAGGAAAAGT TTGAAATACC GTCTTATGTG GAGATAACAG CCGATACCAC CGATTTTTCT CTGGGAATTA CCGTTACTGT AGCAACCAAC TCATTATTTA ACAATATAGA TGTGGAAAAA ATTGATTCGA TATCTGATTT GACCGATTCA ATGAATGAAC TTGATGATGC CATGACAAAA CTTCTTGACG GTTCTTCTTC GCTGTACAAC GGCATCTGCA CGCTTCTTGA CAAGTCAAAG GAACTTGTTG AAGGAATTAA TCAACTTGCA GAGGGTGCCG GAAAGCTGAA AGAAGGAGCA TATTCCCTTG ACGAGGGAGC GAAAAAACTG TATACCGGAG CGGCAGCGTT GTCACAGGGC CTTGATACGC TTTCGGCAAA CAATGATACA TTAAACGACG GTGCAAAAAA AGTTTTTGAA ACTATCCTTT CCACCGCCGA GACACAATTA AAAGCATCCG GGCTTACAGT TCCTGAATTG ACCATCGAAA ATTACGCTTC AGTACTCAAT GAAATTATTG AATCCCTTGA CAGTACAAAA GTATATAATC AGGCTCTCGA ACAGGTTACA GCAGCCGTTG AAGAAAAACG GGACTATATC AAATCACAGG TCGTTGAAGC CGTGCGTTCG GAAGTCGAAG CCAATGTCAC CGCCGCAGTT ACGGAACAGG TTAAAGCAGA GGTTACAAAA GCAGTAAAAG AACAAGTGTC GGAAAAGGTC ACTGCAAATG TTCGTGAGAA CGTTGAAGAA CAAGTAATTC TTGCTGCAAC AGGTATGAAC AAGGCAAGCT ATTATGCCGC AGTTTCAGCC GGACGCGTGG ATGCGGCAAC ACAAAAGGCC GTCAAATCCG CCATTGACAA CAAAATGGCA AGTAAAGAAA TTACCGATTT AATCTCATCA AATATTGACA AACAAATGCA GAGCGAGCAA GTTTCCGCAA TGATTTCGCA AAAAGTCGAT GAACAGATGC AGACAGAAGA AATTAAGAAT ACCATTAAGA AAAATGCTGA GCTAAAAATG GCAGAAAAGG ATATTCAGCA ACTGATTGAA CAAAATACAG AAGCACAGAT ACAAAAAGTC ATATCCGAAA ACATGGCAAG TGAAGAAGTG CAATCAAAAC TTGCTGCAGC ATCGGAAGGG GCAAAATCAG TTATTGCGCT CAAAACTTCC CTTGACGACT ACAATTCATT CTATCTTGGA CTTATGAAGT ATACTGCAGG TGTCGCCCAG GCGGCAAGCG GTGCTTCTGA TTTGAAAAAC GGAGCAAATG AACTGCAGAA CGGAACGGGT ACTTTATATA AGGGTGTTTG CTCCCTTTAT GACGGAATAC TGACAATGAA AAACGGTCTG CCTGCACTCG TAGACGGTAT CACACAGCTT AAAGACGGAG CAATGCAGCT TTCAGACGGG CTTTCAAGGT TCTATGAAGA AGGTATTCAG AAATTAATCG ATACGGTCGA TGATGCCGAA AATCTTATCG AACGGCTGAA AGCTACCGTA ACCGTATCAA AGAATTACAA GTCTTTTGCC GGAATAAGCG AAAACGCTGA CGGTAATGTC AAGTTTATTT ACCGCACAGA CGAGATTAAA TAG
|
Protein sequence | MKSKMIKIIS LCLCIAILAS GAAYALASGK DKAENVKTAE NITENTAEKQ HEEAAAKSDT FKDETVYVLA NPDGTVQKII VSDWIKNTLS SEKITDVTEL ENTENIKGDE SYTLGGNNTR VWDAQGNDIY YQGTIEKELP VDLSVSYKLD GKTVSADELI GKSGKVTIRF DYKNKQYETV KINGKDEKIY VPFVMLTGVL LDNDNFTNVQ VSNGKIINDG NKTAVLGFAL PGLQENLAID KEKFEIPSYV EITADTTDFS LGITVTVATN SLFNNIDVEK IDSISDLTDS MNELDDAMTK LLDGSSSLYN GICTLLDKSK ELVEGINQLA EGAGKLKEGA YSLDEGAKKL YTGAAALSQG LDTLSANNDT LNDGAKKVFE TILSTAETQL KASGLTVPEL TIENYASVLN EIIESLDSTK VYNQALEQVT AAVEEKRDYI KSQVVEAVRS EVEANVTAAV TEQVKAEVTK AVKEQVSEKV TANVRENVEE QVILAATGMN KASYYAAVSA GRVDAATQKA VKSAIDNKMA SKEITDLISS NIDKQMQSEQ VSAMISQKVD EQMQTEEIKN TIKKNAELKM AEKDIQQLIE QNTEAQIQKV ISENMASEEV QSKLAAASEG AKSVIALKTS LDDYNSFYLG LMKYTAGVAQ AASGASDLKN GANELQNGTG TLYKGVCSLY DGILTMKNGL PALVDGITQL KDGAMQLSDG LSRFYEEGIQ KLIDTVDDAE NLIERLKATV TVSKNYKSFA GISENADGNV KFIYRTDEIK
|
| |