Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0823 |
Symbol | |
ID | 4810441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1000407 |
End bp | 1001921 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640106240 |
Product | hypothetical protein |
Protein accession | YP_001037251 |
Protein GI | 125973341 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAAA TCAAGGATAT CAGAAAAATG TATTTTGAGG AAGGGAAAAA TATCAGCCAG ATAGCCAGAG AAACCGGCCA TGATCGCAAA ACGGTGAGAG CATATCTTGA CAAAGTGGAC TGGAACCAGA AGCCACCGAA AGTGAAGAAG GAAACAGCCT TTCCGAAACT TAATCCATAC AAGGATGACA TTGACACATG GCTAAACGAG GATAAAAAGG CCAGGCGCAA GCAAAGACAT ACAGCAAAAC GAATATACAA CCGGCTGGTG GAAAAGTACG GAGAACGCTT CAACTGTTCC TACAGGACCG TAGCAGGATA TGTAGCTGTG AAGAAAAAAG AGATATTCAA CGCAAGGGAA GGATTCCTGC CTTTAGAGCA CGTACCAGGT GAAGCCCAGG CAGACTTTGG CGATGCTGAC TTTTATGAAA ATGGCAGGCA CTACAGGGGT AAAAGTCTGA CTTTATCATT TCCCCACAGC AACAAAGGAT ATACCCAGCT ATTCAAGGGA GAGAACCAGG AATGCCTGTT CGAGGGCTTG AAGGCGATAT TTGAGCACAT AGGTGGAGTG CCGCCAAGGA TATGGTTTGA TAATGCCAGC ACCATAGTAG CTAAGGTAAT AAAGGGCGGA GGCAGGAACC TGACAGATGA TTTCATGCGT TTCATGGAGC ATTACCGTTT CAAAGCAGTA TTCTGCAATG TAGATGCCGG GCATGAAAAA GGCAATGTGG AGAACAAGGT CGGCTATCAC AGGAGAAACA TGCTGGTGCC GGTACCACGT TTTGAAGACA TTAGTGAATT CAACAAAGAA CTCCTGATTA GGTGTGAAGA AGATGCCAAA AGGCAGCATT ACCGAAAGAA CGGTACGATC GAAGAACTAT ACAGGGATGA TAAGGCAGCC CTGCTGGAGC TGCCCAAGAC AACTTTTGAT ACAAGCAAAT ACATAACAGT GAAGACAAAC GGATATGGCA AATTTCTGCT CAACAAAGGC CTGCACGAAT ATTCCTCAGC GCCAAAATTC GCAAACAAAT ATGTACTGGT CAGGCTGACT GCCTTTCATG TAACAGTGCT TGACGAAAGC CATCGGGAGA TAGTGCGTCA TGAGAGACTC TACGGCGACT ACAAGCAGCA AAGCATGCAA TGGCTGCCAT ATCTGACTCA GCTGGCACGG CGACCGGGGG CATTGAAATA CACAGGTATA TATCAGATGC TGCCACAGCC TGTGAAAGAA TACATGGAAG AGCTAAGCAA GCAAGACAGA GGGAAAGTAT TAAGAGTAAT TGCTGATCTG ACACAGAAGA GCAGCTTCGA AAAGGCCATT AAGACTGTCA GTACTGCCCT GTCCTATGGT GCTGCCGATG TGGACAGCCT GATAAATCTG CACAGATATT TGTATGAAAA AGTGCTGCAG CTGGAGCCGA TACATTTGCC CGAGCATATA CCTCACTTAA ACAGATATGT GCCTGATTTT ATGGCATATG ACAGAAGTCT CAAGGCAGGT GAAGAAAAAT GCTGA
|
Protein sequence | MTQIKDIRKM YFEEGKNISQ IARETGHDRK TVRAYLDKVD WNQKPPKVKK ETAFPKLNPY KDDIDTWLNE DKKARRKQRH TAKRIYNRLV EKYGERFNCS YRTVAGYVAV KKKEIFNARE GFLPLEHVPG EAQADFGDAD FYENGRHYRG KSLTLSFPHS NKGYTQLFKG ENQECLFEGL KAIFEHIGGV PPRIWFDNAS TIVAKVIKGG GRNLTDDFMR FMEHYRFKAV FCNVDAGHEK GNVENKVGYH RRNMLVPVPR FEDISEFNKE LLIRCEEDAK RQHYRKNGTI EELYRDDKAA LLELPKTTFD TSKYITVKTN GYGKFLLNKG LHEYSSAPKF ANKYVLVRLT AFHVTVLDES HREIVRHERL YGDYKQQSMQ WLPYLTQLAR RPGALKYTGI YQMLPQPVKE YMEELSKQDR GKVLRVIADL TQKSSFEKAI KTVSTALSYG AADVDSLINL HRYLYEKVLQ LEPIHLPEHI PHLNRYVPDF MAYDRSLKAG EEKC
|
| |