Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1269 |
Symbol | |
ID | 4809774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1543014 |
End bp | 1544297 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106692 |
Product | hypothetical protein |
Protein accession | YP_001037694 |
Protein GI | 125973784 |
COG category | [S] Function unknown |
COG ID | [COG2006] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCAA AGGTAGCAAT TGTCAGAACC AAACCGTCCA CGGTACTTGA AGATTATCAT AGACTGATGA ATCTTGCCGA ATATCAAAAC TATATTTCCA AAGACGTTGA CACTGCACTT AAAATAAATA TAAGCTGGCA TTTTTTCTAC CCGGGAAGTT CAACAACGCC CTGGCAGCTG GAAGGTGTTA TCCGGGCTTT AAAACGTGAC GGATATAATC CTGAGCTTAT TCACGGCTGC CACAACAGAA CGGTGGTAAT TGACGCTCAC CTGGGAGAGC GGGAAAACAA ACAGATTAAT GTTATAAAAG CCCACAATCT TCGAAACATT CATCTCTACG AAGGCGAGGA ATGGATTGAC ATAAGGGAGG CAGTGGGAGA TCTTACAAAA AAATTTTTGT GTCTGAACGA AGTTTATCCG AAAGGATTTT CCATACCCAA AAGGTTTATT GGTGAAAACA TTATACATCT TCCCACTGTG AAAACCCACG TATTTACCAC CACTACCGGA GCAATGAAAA ACGCTTTCGG CGGACTTTTA AATGAAAAAA GGCATTGGAC TCACCCGGTG ATTCATGAAA CTTTGGTGGA TCTGCTTATG ATACAAAAAA AGATCCACAA AGGGATTTTT GCAGTTATGG ACGGAACTTT TGCAGGAGAC GGACCGGGTC CCCGCTGTAT GGTTCCCCAT GTAAAAAACG TGCTTTTGGC TTCGGCGGAT CAGGTAGCTA TCGATGCCGT GGCGGCAAAA CTGATGGGTT TCGACCCGCT AAAGGACTGT AAATACATCC GTTTAGCCCA TGATGCAGGC CTTGGCTGTG GCGACGTAAG ACAAATCGAA ATTGTCGGAG ATGTTGACGC CTTGAATGAA AACTGGAATT TTGTCGGCCC CTATAAAAAA ATGACCTTTG CAAGCAAATG CCAGCACCTG ATTTACTGGG GACCTTTGAA AAAGCCGGTG GAATGGACAT TAAAAACAAT CCTGGCCCCC TGGTCCTACA TTGCCAGTGT TGTTTATCAT GATATGTACT GGTATCCGAA AAACTATGGC AGGGTCGAGG AAATTCTAAA TTCGGACTGG GGACGGTTGT TTGCAAACTG GGAGCAGCTT CAGCTGCCTG CGGATGACCT GTCGGTTCCG GGCTGGGAGC ACGTCGGAGA CAAACCGCTA AAACTTGACA AAGAAACAAG TAAAATGATA CGCAAAGCTT TCAGAGTTCT TGGAACCGCA ATCAGGGAAG CTCCGGAGTT TAGCGCAAAG AAATCAAAAA AAGCATGCAA ATAA
|
Protein sequence | MKSKVAIVRT KPSTVLEDYH RLMNLAEYQN YISKDVDTAL KINISWHFFY PGSSTTPWQL EGVIRALKRD GYNPELIHGC HNRTVVIDAH LGERENKQIN VIKAHNLRNI HLYEGEEWID IREAVGDLTK KFLCLNEVYP KGFSIPKRFI GENIIHLPTV KTHVFTTTTG AMKNAFGGLL NEKRHWTHPV IHETLVDLLM IQKKIHKGIF AVMDGTFAGD GPGPRCMVPH VKNVLLASAD QVAIDAVAAK LMGFDPLKDC KYIRLAHDAG LGCGDVRQIE IVGDVDALNE NWNFVGPYKK MTFASKCQHL IYWGPLKKPV EWTLKTILAP WSYIASVVYH DMYWYPKNYG RVEEILNSDW GRLFANWEQL QLPADDLSVP GWEHVGDKPL KLDKETSKMI RKAFRVLGTA IREAPEFSAK KSKKACK
|
| |