Gene Cthe_1910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1910 
Symbol 
ID4810768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2271729 
End bp2272976 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content32% 
IMG OID640107327 
Producthypothetical protein 
Protein accessionYP_001038322 
Protein GI125974412 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000707151 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATGA AAAAAAGCGA CTTGAGTCTT AGAGAAGTTC TGGAAGTAAT ATTTAGTGTT 
GCCAATAAGA GGGCCGGTTT TTTTGCTGAT AGTTATCTAT TTAAAGACCC ATCTATCATT
TCCAAATGGA AGATCGGAAG AGCATTGCCC AGTAATGATG ACATTGAAAA GATTGTTACC
TTTACAGTAA ATGAGATGAC AGGAAACCAG CATAAGATAC TTAGAAGCGC TCTCGAAGAT
TTAATTCGAA ATTCCATTAT TGACAAAGAT ATCACTATTG ACAATGAAAT TAAAGAATCA
TTGTTAAGCA TAGAAGATTT TAAGAATTTT CTGTCGGAAG TATTGAGGAT TGCCAAAACT
GTTAACCGCA GCAACAAGCA AATCAATGAC AAATTTAGTG GCAATTTATG TGGTATCAGT
GGGACTGATG AAAAAGAAGG GTCTGACAGT AGTGGTAAAA ATATAGTTCT TGACAACGCT
GTTATTGTAT CCTCAGAAGA TATGGAAGGA ACATATTCCG GAATAGTTGA GTTTAATATG
CGGCTTTTAA AAAAGAAGGA TAGAAGCCTT AAGAATACGG AAAGTCCGGA CATACATATC
AACAGGAATG AGAATTATAT TGCAACAGAC AAAGCTGGTA AAGTGAAAGG ACGTATTACT
GCAAAAAGCT TGATTGGTAC TGTTATTGTA GGGATTATTT CAAGTTTGTG TGTTATTCAA
ATGGTAAACA GCTTGAAATT AACAGATAAA GTGCCGGAAG TATATGCCGT GGAGAGTTTG
GCAAAAGAAT TTTCATCTTT GGACAACTTA AATCCGAAAT TGGAAGTAAT AAAACAAAAT
AATTATGATA ACTCAAATCA GAAAGAGAGT ACTGTCAAAG AAAATAATGA ATCAGATGTT
TGTTTTAGCA ATGATAATGT CATAGAGGAA AACAAAGGAG AAGATGAAGG AAAAAATAAA
AATATAGAAG AAAGCAAAAG AGAAAATGAA AGTGAGAGCA AAGAGAAAAA AGAAGAAAGT
GAAAAGAAAA ATAAAGACAA AAACAAAGAG GAAAATAAAG AAGAAAAGAA AGAGGCTGTC
AGTGAAGCAA ACCAGTATAT AAAGGACAAT ACAATAGACA ATTCGACAGT TGTTGATGTC
AATAATGGTT TAATAAACAG TTCAATAGTT ATTAATGGCG ATAACAACAA CATAATTAAT
GGACATAATA TATTTTTCAA TTATGAAAAT AAGAGCGATT CAAATTGA
 
Protein sequence
MKMKKSDLSL REVLEVIFSV ANKRAGFFAD SYLFKDPSII SKWKIGRALP SNDDIEKIVT 
FTVNEMTGNQ HKILRSALED LIRNSIIDKD ITIDNEIKES LLSIEDFKNF LSEVLRIAKT
VNRSNKQIND KFSGNLCGIS GTDEKEGSDS SGKNIVLDNA VIVSSEDMEG TYSGIVEFNM
RLLKKKDRSL KNTESPDIHI NRNENYIATD KAGKVKGRIT AKSLIGTVIV GIISSLCVIQ
MVNSLKLTDK VPEVYAVESL AKEFSSLDNL NPKLEVIKQN NYDNSNQKES TVKENNESDV
CFSNDNVIEE NKGEDEGKNK NIEESKRENE SESKEKKEES EKKNKDKNKE ENKEEKKEAV
SEANQYIKDN TIDNSTVVDV NNGLINSSIV INGDNNNIIN GHNIFFNYEN KSDSN