Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1830 |
Symbol | |
ID | 4809814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2169800 |
End bp | 2171014 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107244 |
Product | hypothetical protein |
Protein accession | YP_001038244 |
Protein GI | 125974334 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.034164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTT TAAAAGAACT GCTTAAAAAG AAAGATGGGT CGTTAACTGT AGAGGCTGCA ATTTCATTGC CGCTGTTTAT GTGTGTGTTT TTATCCATAG CCTTTTTCAT GAAGGTTGTG TATATACATA ATAATGTGCA ATATGCAATA AACGGCGCTG CCAATGAAGT TGCAACCTAC AGCTATCTTT ATTCAATTTC CGGCCTCCAG AAGGTGAATG ATGCTATTAC GGAAACAACG GATGAATATG GAACGACTGC ATCAGAGCAT ACAAAAGAGA TTCTGGAAGC TTTTGATGCC TTGGGAGATA TTTCACAGCA GAGTCTGGAA TCGTTTAAAG GACTGGCGGC AGGTGATACA ACACAAATAG ACAAGTTAAA GGAGTTGTAT GAAGAAGGTA AAATTTCAGT GGGGACCGTC CAGAAAGTCA TTGGTGAAGT AAAGGAGAAT CCCAGAAAAG AGTTCATCAG TGTTGCTTCT TTGTTTTTCA GTGCAGGATA TGAAAAAATC AAATCTGAAT TGTCAGAGCC GTTAATTAAG CTCTTTATGA GAAAATACAT TGACGAGAGA ATATTCAACA GCAAGGGTGG ACCGGGAGCT TATATTGTAG TAAAGGAAGG AAAAGACCCG TTAGATGCTT TTAGCTTTAA CAACCGGATA TTTACCGACA ATAAGAGCAT AGATATAAGA GTAAAATATA AGATAAAGAC TTCGTTGCCT ATAAACATTC TTCCTGAAAT CAGCATCGAG CAACGGGCAA CTGTCAGAGG ATGGATGGAC GGAGATAAAT CGGCACCGGT AAAAGAGGAA CCAAAAGAAG AATCTTTATG GGATAAAGCG CCTTTTGAGT ACGGTAAAGT GATTACCGAG AAAGAACTTG AAAAGTATCC GGACAAATAT CCGAATTCCG GGCATATATA TGAAGTCAGG AGTATCAATT TGGATTGCGA AACGTATAAA GATATTAAAA AAGCAAAGAG CTCTTTAAAG AGCAGTATTA ATAAATTTAG TTCGAAAACT AAAGATGTCG CTGAAATTAC TTCAAGGACA TTTATTATAG TGATACCGGA AGGGACATTG ACAGACGAAA TCAAGGCAAT GTTGGAAGAA CTGAAAAGTG AAGCGGCTTC CGGGACACCT TCGATAGAGG TGATTTATAA AGAAGGATAT GGAAGACAAA GTAATGTAAG TGACAGCAGC GAAGAAGAAA AGTAA
|
Protein sequence | MNILKELLKK KDGSLTVEAA ISLPLFMCVF LSIAFFMKVV YIHNNVQYAI NGAANEVATY SYLYSISGLQ KVNDAITETT DEYGTTASEH TKEILEAFDA LGDISQQSLE SFKGLAAGDT TQIDKLKELY EEGKISVGTV QKVIGEVKEN PRKEFISVAS LFFSAGYEKI KSELSEPLIK LFMRKYIDER IFNSKGGPGA YIVVKEGKDP LDAFSFNNRI FTDNKSIDIR VKYKIKTSLP INILPEISIE QRATVRGWMD GDKSAPVKEE PKEESLWDKA PFEYGKVITE KELEKYPDKY PNSGHIYEVR SINLDCETYK DIKKAKSSLK SSINKFSSKT KDVAEITSRT FIIVIPEGTL TDEIKAMLEE LKSEAASGTP SIEVIYKEGY GRQSNVSDSS EEEK
|
| |