Gene Cthe_0369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0369 
Symbol 
ID4808446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp463483 
End bp464706 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content40% 
IMG OID640105783 
Producthypothetical protein 
Protein accessionYP_001036800 
Protein GI125972890 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.299825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAC TGTATTTTGA CTGTTTTGCG GGTGCCAGCG GTGATATGAT TCTGGGTGCC 
CTGCTGGATT TGGGAATTGA TGTCGGAATT TTTAAAAGGG AGCTTGCAGG ATTAAATTTG
GACGGTTTTG ATATTGCTGT TGAAAAAAAA GTAATAAACT CAATAGCCGT AACCGATGTG
AATGTTATTG TAAAAGAGGA ATGTAATCAT CATACCGGAC ACCATCATCA TTGTGAGCGC
AATTTGGCGG ATATTGAGAA AATAATTGAC GAAAGCAGCC TGAAAGACAA TGTAAAAAGG
CTTAGCAAAA AGATATTTTC AGAAATTGCC CGGGCTGAGG CAAAGGTCCA CAACAAATCC
ATTGAAGATG TGCACTTTCA TGAAGTCGGT GCCATTGATT CCATTGTTGA TATTGTAGGT
ACTGCAATTT GTCTGGACCT TTTGAAAGTT GACAAAATAT ACTCGTCACC GATGCATGAC
GGCACGGGCT TTATAGAGTG TCAGCACGGA AAACTGCCGG TCCCGGTTCC TGCGGTTTTG
GAAATGCTTA AGGAAAGCAA TATACCTTAC ATAACCGAGG ATGTGAACAC GGAATTGTTA
ACTCCGACGG GCCTCGGAAT TATAAAATGT GTGGCTTCAA AGTTTGGCCC CATGCCCCCG
ATGACTATTG AAAAAGTCGG ATATGGGGCA GGCAAAAGAC AGACGGGGCG TTTTAATGCC
TTAAGGTGTA TTTTGGGAAA TGCTAAAGAA AAAGAAAAAA TTGATGATGA AATTTGTATG
CTTGAGACAA ATATTGACGA TATGAATCCG GAGATTCTTG GCTATGTTAT GAACAGGCTT
TTTGAGAACG GTGCACTGGA TGTATTCTAT ACGCCCGTTT ACATGAAAAA GAACAGACCG
GGAGTTTTAT TGACGGTGCT TACGGACAAG GAGCATGAAG AGAAGCTTGT GGATATTATT
CTGACAGAAA CGACCACTTT GGGAATCAGA AAGACCACCG CCCAAAGATA TGTTCTTGAA
AGGGAAATAA AACATGTGAA TACTGAGTTT GGGAAAATAA GAGTGAAAGA GTCGTCCTTT
GGCGATTACA AGAAATATTC GCCGGAATTT GAAGACTGTA AAAAAGTGGC CCAGGAATTG
AAAATACCGC TGTCAAAGGT ATATGATGCC GTAAACAAAG CTATTTTAGT ATTTGAAGAA
AGGAATGAAA ATGCTTTACA ATAA
 
Protein sequence
MRILYFDCFA GASGDMILGA LLDLGIDVGI FKRELAGLNL DGFDIAVEKK VINSIAVTDV 
NVIVKEECNH HTGHHHHCER NLADIEKIID ESSLKDNVKR LSKKIFSEIA RAEAKVHNKS
IEDVHFHEVG AIDSIVDIVG TAICLDLLKV DKIYSSPMHD GTGFIECQHG KLPVPVPAVL
EMLKESNIPY ITEDVNTELL TPTGLGIIKC VASKFGPMPP MTIEKVGYGA GKRQTGRFNA
LRCILGNAKE KEKIDDEICM LETNIDDMNP EILGYVMNRL FENGALDVFY TPVYMKKNRP
GVLLTVLTDK EHEEKLVDII LTETTTLGIR KTTAQRYVLE REIKHVNTEF GKIRVKESSF
GDYKKYSPEF EDCKKVAQEL KIPLSKVYDA VNKAILVFEE RNENALQ