Gene Cthe_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1301 
Symbol 
ID4809553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1578033 
End bp1579280 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content41% 
IMG OID640106724 
Producthypothetical protein 
Protein accessionYP_001037726 
Protein GI125973816 
COG category[R] General function prediction only 
COG ID[COG1323] Predicted nucleotidyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTC TGGGGTTGAT TGTGGAATAT AATCCTTTTC ACAACGGTCA TCTTTACCAT 
CTTGAGGAGT CAAAAAAAAT AAGCGGGGCT GACTTTGTCG TATGCGTCAT GAGCGGCAAT
TTCATTCAGC GCGGAGAGCC GGCAATTGTA AACAAGTGGG CAAGAACAAA AATGGCCTTG
TCAGCAGGAG CAGACCTTGT AATTGAGCTT CCCCTTTCCT GTGCCATGGC CAGTGCCGAA
TACTTTGCCT CCGGCGCCGT AAGAATTTTA AATGATATAG GAATAGTTGA CTATATTTGT
TTTGGAAGTG AACACGGCGA TGTCAAGACT CTCGATTATA TAGCCCAAAT TCTTGTTGAA
GAGCCTGAAA GTTACAAATC TTTCCTGAAA GAAGAACTGG ACAATGGCCT GTCATATCCT
GCCGCCCGCG AATCAGCCCT GAAGAAATAC ACCGCACATA GCATTAATAT CCCGCAAATA
ATCTCCTCAT CAAACAACAT ACTGGGTATA GAATATTTAA AGGCGTTAAG ACGCATAAAA
AGCAGCATAA TACCTCTTAC AATAAAGCGC ATTAACAATG ATTACAACAC GGAAAATATC
ACCGGAAGCA TTTCCAGCGC ATCATCCATA AGAAAATATA TTTCAACCTC AAATTCAACC
TCTTTTGATG ACGTTCTTGC CATGACAATG CCCAAAACAA GCGTCGATAT ACTTTTTGAA
GAATTCAGTG CCGGAAGGGG GCCGGTTTTT AAAGAGGATT TTTATCCTGT TGTAACTTCC
CTCATACGAA AAATGACGCC GGAACAAATC AGAAATTTTG CTTATGTTTC GGAAGGCCTT
GAAAACAGGA TAAAAAGTGC CGCCGATACC GCAGGTACAT ATGAAGAGCT GGTGGAAAGC
ATATGCACCC GAAGATACAC CAAAACCAGA GTGCAAAGAA TCCTGATGGG CATACTTATG
GGAGTAACCT CGAAGGATTT GGACATGCTA AGCCGTTTTG ACAGTCCTCA ATATGCAAGG
ATTCTAGGCT TTAATTCAAA AGGAAAACAG CTTCTTTCCC AAATAAAGAA AAAATCATCA
ATACCTCTGG TGTTAAAGTT GTCTGATTTC ATAAAATCCT GTGATCCGGT GCTGAAAAGA
AAGCTTGAAT TGGAGATACT TGCCACCGAC CTTTATGTGA TGTGCTATAA AAATCCTGCC
TTTAGAAAAG CCGGCCAGGA GTTTACTCAA AATATCATCA TTATGTAA
 
Protein sequence
MKVLGLIVEY NPFHNGHLYH LEESKKISGA DFVVCVMSGN FIQRGEPAIV NKWARTKMAL 
SAGADLVIEL PLSCAMASAE YFASGAVRIL NDIGIVDYIC FGSEHGDVKT LDYIAQILVE
EPESYKSFLK EELDNGLSYP AARESALKKY TAHSINIPQI ISSSNNILGI EYLKALRRIK
SSIIPLTIKR INNDYNTENI TGSISSASSI RKYISTSNST SFDDVLAMTM PKTSVDILFE
EFSAGRGPVF KEDFYPVVTS LIRKMTPEQI RNFAYVSEGL ENRIKSAADT AGTYEELVES
ICTRRYTKTR VQRILMGILM GVTSKDLDML SRFDSPQYAR ILGFNSKGKQ LLSQIKKKSS
IPLVLKLSDF IKSCDPVLKR KLELEILATD LYVMCYKNPA FRKAGQEFTQ NIIIM