Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0895 |
Symbol | |
ID | 4810516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1071207 |
End bp | 1072283 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106314 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_001037322 |
Protein GI | 125973412 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000160053 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACCCA ATAGCGATAA AAAAGCAATT TTGCTTGAGC TTATTGAAAA AGGCAAACAA AAAGGAATGC TGACATACCA GGAAATTATG GATGCTTTTG AAGAAGTTGA TATCGATCCG GAACAGATCG AAAAAATTTA TGAGACATTA GAAAACATGG GAATAGATGT AGTAGGAGAT ATCGAAGCGG AAATGGAGGA TATCCAGCTT ACGGAAGATA ATCTGGATCT TTCCATTCCC GAAGGTATAA GTATAGATGA TCCTGTCAGA ATGTATTTAA AAGAGATCGG CAAAGTACCT CTTTTGACTG CAGAAGAAGA GATAGAGCTG GCTCACAGGA TTGAGCAGGG TGATGCCGAA GCCAAAAGAA GACTGGCTGA GGCGAACCTG AGGCTGGTTG TAAGTATAGC CAAGAGGTAT GTCGGAAGGG GCATGCTTTT TCTTGATTTG ATTCAGGAAG GAAATCTCGG GCTTATAAAA GCGGTGGAAA AGTTTGATTA CAGAAAAGGT TTCAAATTCA GTACTTATGC CACATGGTGG ATTAGACAGG CAATTACAAG AGCGATTGCA GACCAGGCAA GAACCATTAG AATACCTGTT CACATGGTTG AAACCATCAA CAAGCTTATA AGAGTTTCCA GGCAGCTTCT TCAGGAGCTT GGAAGGGAAC CTCATCCTGA AGAGATTGCC AAGGAGATGA ATATGCCTGT TGAAAAGGTA AGGGAGATAA TGAAAATATC CCAGGAGCCT GTGTCGCTTG AAACACCTAT AGGTGAAGAA GAAGACAGCC ACCTTGGGGA CTTTATACCT GACGATGACG CTCCTGCACC GTCAGAGGCT GCTGCTTTTA CGCTTTTGAA AGAACAGCTT GTAGACGTTT TGGATACTTT GACTCCCAGA GAAGAGAAAG TTTTAAGGCT TCGATTCGGG CTGGATGACG GACGGGCCAG AACCCTTGAA GAAGTTGGAA AAGAGTTTAA TGTGACAAGG GAAAGAATTC GTCAGATCGA GGCAAAAGCG CTTAGGAAAC TTAGACATCC GAGCAGGAGC AAAAAACTGA AGGATTATTT GGATTGA
|
Protein sequence | MKPNSDKKAI LLELIEKGKQ KGMLTYQEIM DAFEEVDIDP EQIEKIYETL ENMGIDVVGD IEAEMEDIQL TEDNLDLSIP EGISIDDPVR MYLKEIGKVP LLTAEEEIEL AHRIEQGDAE AKRRLAEANL RLVVSIAKRY VGRGMLFLDL IQEGNLGLIK AVEKFDYRKG FKFSTYATWW IRQAITRAIA DQARTIRIPV HMVETINKLI RVSRQLLQEL GREPHPEEIA KEMNMPVEKV REIMKISQEP VSLETPIGEE EDSHLGDFIP DDDAPAPSEA AAFTLLKEQL VDVLDTLTPR EEKVLRLRFG LDDGRARTLE EVGKEFNVTR ERIRQIEAKA LRKLRHPSRS KKLKDYLD
|
| |