Gene Cthe_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2120 
Symbol 
ID4810980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2518811 
End bp2519572 
Gene Length762 bp 
Protein Length253 aa 
Translation table11 
GC content36% 
IMG OID640107527 
Productputative RNA polymerase sigma factor SigI 
Protein accessionYP_001038520 
Protein GI125974610 
COG category[K] Transcription 
COG ID[COG1191] DNA-directed RNA polymerase specialized sigma subunit 
TIGRFAM ID[TIGR02895] RNA polymerase sigma-I factor
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATTGGC ATTTTCAAGG TACGAACGAC GACAGGGAAC ATACAAAAAG GATTATTATA 
GAGTATCTGA ACAGAATAAA AGCCGGGGAT GATTCTGCAA GGGAAGAGTT TATCCTGAGG
TTTAGGCCTT TTATATTAAA ATTGGTGTAT AAGGCGACTG ACAGGCATGT TGAGCCGGAA
AACAGTGAAG AATACAGCGT TGCATTATTG GCTTTCAATG AAGCCATCAA TGCTTATGAT
GAAGAGAAGC ATTCTAACTT CCTTGTTTTC TCAGAACAGG TTATTAATAG AAGACTTATT
GACTATAAAA GAAAAAATCA TAAGAATAAA ATGGTTTATC CTTTTTCTTA CTTTGAAAAC
GAAGATATCA AACTTGAAAG AACTCTTTCG GATGCTGACG GCAACAATGC AATTGAAAGA
TTGGAATTTA CGGACGAGAT TAGACTTTTC AAATCTGAGC TGGCCTCCTT TGATATAACT
TTTAAGGATT TGCTCTCCTG TACTCCAAAG CACAGAGATT CGAGAGAGCT TTTGATAAAT
ATTGCAAAAA AAATTGCAAG TAATGACGGG CTTTATGAAA AGCTAAAAAA AACCAAAAAG
TTGCCCACAT TGGAACTGTT GAAACTGGCA AAAGTTAGCA GAAGGACTAT AGAAAGAAAT
AAAAAATATA TAATTGCAGT AAGCTTGATA TTAAGGAGCA ACCTCGAAAT CTTCAAGGAG
TATGCTGCAG GTATCCAGGA AAAGGAGGTG GATTTGCGGT GA
 
Protein sequence
MDWHFQGTND DREHTKRIII EYLNRIKAGD DSAREEFILR FRPFILKLVY KATDRHVEPE 
NSEEYSVALL AFNEAINAYD EEKHSNFLVF SEQVINRRLI DYKRKNHKNK MVYPFSYFEN
EDIKLERTLS DADGNNAIER LEFTDEIRLF KSELASFDIT FKDLLSCTPK HRDSRELLIN
IAKKIASNDG LYEKLKKTKK LPTLELLKLA KVSRRTIERN KKYIIAVSLI LRSNLEIFKE
YAAGIQEKEV DLR