Gene Cthe_0155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0155 
Symbol 
ID4808643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp193793 
End bp194854 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content42% 
IMG OID640105566 
Producthypothetical protein 
Protein accessionYP_001036589 
Protein GI125972679 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.22307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCACA ACTTTTCAAT TAAAAAGGCG GTATTATTGT TGGGAATATT GGCGCTTGTA 
ATGTTCTCTG TTTCCTGCGG GCGGAGCGAC AATACGGAAA ATAACCCGGA TGCAAGCCCG
GAACCCACGG AGGAAAATGT GAATTTTGAT TATGTGGAGC AGCCAAGTCC GGAACCCGAA
CAAGAGGTAA ACTTTGTTTT TCCCGAAAAA GGAGTAAGAC CTTATGCCGT AATGATTGAC
AACCAGGGAG AAAAATGTCT TCCGCAGGGA GGATTGAGCC AGGCCCAAGT TATTTATGAG
GTAATTGTTG AGGGTGGCAT AACAAGGTTC ATGCCTGTGT TCTGGGGTCA GAAAACGGAA
CTCATAGGGC CTGTGAGAAG TGCGCGCCAC TATTTCCTTG ACTATGCATT GGAACATGAC
GCAATTTATG TACATATCGG ATGGAGTCCT ATGGCAATGG CGGATATACC CAAGCTTGGG
GTAAACAATA TAAACGGTGC TTACGGAGTT TTCTGGGACA TTACAAACGA CAAATCAAAC
TGGCAGGATA CTTACACGTC AATGGAAAAA CTGGAGGAAT ACGCAAAAAA GGTTAATTAC
AGAACTACTA CGGACAAAGA AATGGTGTTT AAATATCATA ACAGGGACCA GGAGCTTGAA
GGAGGCAAAA AGGCTGAGAA GATAAACCTC TCATATTCTG GCGAGTACAA ATCCTATTAT
GAATATGATG CCGACAAAAA GCTCTATCTG AGATTCAGAA ACGGAAAGCC CCATATTGAA
AGGCAGACTG GAGAGCAGCT CACCACAAAG AACATTATTA TACAGAAAGT CAGAAATTAT
GACATAAAAG GTGACCAGTA CGGCAGGCAA AATCTTGATA CAGTTGGCAG CGGAGAAGGC
TATTATATCA CAAACGGAAA GTGCATTGAA ATTAAGTGGT CGAAAGCTTC CAGAACGGAA
AAGACAAAAT ATTTGGACGG CGACGGCAAA GAAATAGTTT TAAATCCCGG TCAGACATGG
GTTCAGATAT TCCCTGTATC GGGTAAAATT GAAATAGAAT AA
 
Protein sequence
MYHNFSIKKA VLLLGILALV MFSVSCGRSD NTENNPDASP EPTEENVNFD YVEQPSPEPE 
QEVNFVFPEK GVRPYAVMID NQGEKCLPQG GLSQAQVIYE VIVEGGITRF MPVFWGQKTE
LIGPVRSARH YFLDYALEHD AIYVHIGWSP MAMADIPKLG VNNINGAYGV FWDITNDKSN
WQDTYTSMEK LEEYAKKVNY RTTTDKEMVF KYHNRDQELE GGKKAEKINL SYSGEYKSYY
EYDADKKLYL RFRNGKPHIE RQTGEQLTTK NIIIQKVRNY DIKGDQYGRQ NLDTVGSGEG
YYITNGKCIE IKWSKASRTE KTKYLDGDGK EIVLNPGQTW VQIFPVSGKI EIE