Gene Cthe_0895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0895 
Symbol 
ID4810516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1071207 
End bp1072283 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content43% 
IMG OID640106314 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001037322 
Protein GI125973412 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000160053 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACCCA ATAGCGATAA AAAAGCAATT TTGCTTGAGC TTATTGAAAA AGGCAAACAA 
AAAGGAATGC TGACATACCA GGAAATTATG GATGCTTTTG AAGAAGTTGA TATCGATCCG
GAACAGATCG AAAAAATTTA TGAGACATTA GAAAACATGG GAATAGATGT AGTAGGAGAT
ATCGAAGCGG AAATGGAGGA TATCCAGCTT ACGGAAGATA ATCTGGATCT TTCCATTCCC
GAAGGTATAA GTATAGATGA TCCTGTCAGA ATGTATTTAA AAGAGATCGG CAAAGTACCT
CTTTTGACTG CAGAAGAAGA GATAGAGCTG GCTCACAGGA TTGAGCAGGG TGATGCCGAA
GCCAAAAGAA GACTGGCTGA GGCGAACCTG AGGCTGGTTG TAAGTATAGC CAAGAGGTAT
GTCGGAAGGG GCATGCTTTT TCTTGATTTG ATTCAGGAAG GAAATCTCGG GCTTATAAAA
GCGGTGGAAA AGTTTGATTA CAGAAAAGGT TTCAAATTCA GTACTTATGC CACATGGTGG
ATTAGACAGG CAATTACAAG AGCGATTGCA GACCAGGCAA GAACCATTAG AATACCTGTT
CACATGGTTG AAACCATCAA CAAGCTTATA AGAGTTTCCA GGCAGCTTCT TCAGGAGCTT
GGAAGGGAAC CTCATCCTGA AGAGATTGCC AAGGAGATGA ATATGCCTGT TGAAAAGGTA
AGGGAGATAA TGAAAATATC CCAGGAGCCT GTGTCGCTTG AAACACCTAT AGGTGAAGAA
GAAGACAGCC ACCTTGGGGA CTTTATACCT GACGATGACG CTCCTGCACC GTCAGAGGCT
GCTGCTTTTA CGCTTTTGAA AGAACAGCTT GTAGACGTTT TGGATACTTT GACTCCCAGA
GAAGAGAAAG TTTTAAGGCT TCGATTCGGG CTGGATGACG GACGGGCCAG AACCCTTGAA
GAAGTTGGAA AAGAGTTTAA TGTGACAAGG GAAAGAATTC GTCAGATCGA GGCAAAAGCG
CTTAGGAAAC TTAGACATCC GAGCAGGAGC AAAAAACTGA AGGATTATTT GGATTGA
 
Protein sequence
MKPNSDKKAI LLELIEKGKQ KGMLTYQEIM DAFEEVDIDP EQIEKIYETL ENMGIDVVGD 
IEAEMEDIQL TEDNLDLSIP EGISIDDPVR MYLKEIGKVP LLTAEEEIEL AHRIEQGDAE
AKRRLAEANL RLVVSIAKRY VGRGMLFLDL IQEGNLGLIK AVEKFDYRKG FKFSTYATWW
IRQAITRAIA DQARTIRIPV HMVETINKLI RVSRQLLQEL GREPHPEEIA KEMNMPVEKV
REIMKISQEP VSLETPIGEE EDSHLGDFIP DDDAPAPSEA AAFTLLKEQL VDVLDTLTPR
EEKVLRLRFG LDDGRARTLE EVGKEFNVTR ERIRQIEAKA LRKLRHPSRS KKLKDYLD