Gene Cthe_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1647 
Symbol 
ID4809342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1971496 
End bp1974051 
Gene Length2556 bp 
Protein Length851 aa 
Translation table11 
GC content45% 
IMG OID640107062 
Producthypothetical protein 
Protein accessionYP_001038063 
Protein GI125974153 
COG category[S] Function unknown 
COG ID[COG4983] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCATCT CATTCCCTAA AGAATTGGCA AACCGGAAGC AATGGATCTG CTGGCGTCTG 
GAACCAAACA CAAAGGACGG AAGAGACAGT AAAATCCCTT ACAATCCCTT AACCGGCAGA
AAAGCCTCAA GCACTAACCC AAACGACTGG TCGACCCTTG ACGATGCGAT TGCGGCAAAA
GAACAATACC TCTATACCGG ATTAGATTTT GTATTCGCAA AAAGCGGAGG TTTAGTAGGG
ATAGACATAG ATCACTGCCG CGACAAAAAC ACTGGGGAAT TAAGCGATAC CGCCAAGGAT
ATCCTTGAGC GGTTTCCGTC CTATACGGAA ATCAGCCCTT CAGGAACTGG GCTTCATATT
TTCTATAAAG GGGAGATGCC TGCCAAGGGC AATAAAAACA CTAAAACCGG CGTTGAAATG
TATGCCCACA GCAGGTACTT CACAATGACC GGCGACCCGT TGCCCGGGAC TCCTGATAGC
ATTGCCGAAG ATAACGGAGC ACTGGCCTGG ATACATGAGA ACTATATCAA AAGCAAGAAG
CGGAGCGGGA AAAGCAAGAA AAACCGTAAG AATTTTAAGC TAGAGCCGCT TACAGATGAA
GAAATTCTGG AGAAAGCCCA GACAGCCGAA AACCATAAGG AATTTGAACT GCTATGGGAA
GGAAAATGGC AGGAAGCAGG GTATCCGAGC CAGTCCGAAG CCGACCTTGC TCTTTGCTGT
ATGCTGGCTT TCTGGTCAGG CAAAAACAAA GAGCAGATGG ACAGGCTGTT TAGAAATTCC
GGGTTATTCC GGGAAAAGTG GGATACGGTA CATCATGCAA GTGGAGCAAC ATATGGGCAG
GAGACACTGG ATAAGGCCAT TGAAGTCACA GAGAATGTAT ACAGCCGCGA AAGCGAGTCA
GTTATCTTTG AACATGAGGG CAGGTATTAC CGCACCAGAG GCGAAAGTGT GTATCCTATA
ACAAATTTTA TCATTCAACC GGTGGAGATG ATTGTATCGG AAGATGAAAC GCAGATGACT
GCTGACCTTA TTACAATCCG TGATGAAATA TACCGCCAGA CATTTATGAC TACCGACTTC
AATAACATCC AAAAATTTAA AAATATCTTG AACCGCCGGA CAATATCCTT AGGCTATTTT
GGCTCAGAAG GAGATTTGGA ACTGCTGAAA GGTTATATAT CTGAAATGGA GTGGGTAAGG
AAAACAGGGG TCAAGGCTCT TGGGATTTAT GAGCATGGCG GGCGGATGGT ATATGTTTCA
ACGGATGGTG CCATTGAAGC AGGAGGCAAC ATTGTTGAAG ATATCGTGCA GCTTGATAAG
TATAAAAGCA TAACAACCGA TATCCTAACC TTTGAGCCAT TGACAAAGAA ACAGCTTATT
ATGCTTGGTG AATGGCTCCT CAGCTATAAC GAGCCCATAA AAACGGTATC AGTAATGGCC
TGGGTGGCCG GATGCTTCAT TAAACCGCAT CTTAAAAAAT CAGGCATCAA GTTTCCTCAT
TTATTGCTTG TCGGAGAACA AGGCAGCGGA AAAAGTAATA CATTGGAGCG GGTTATTCTG
CCGGTATTTT CGTGCAGCAA AATCCGCGCG TCTACGCAGG TTACTGCATT TACACTGATG
AAGGAATCTG CATCATCGAA TCTTATACCG CAGTTGATGG ATGAGTTCAA GCCTTCAAAG
ATAGATAAGT TAAGGCTAAA TGCCTTATAC AACCATCTTC GAGATGCATA TGACGGCCAT
GAAGGTGTCC GCGGTAGGGC GGATCAAAGT GCTGTTACTT ATGAACTGTT GGCACCTATT
ATTGTAGCTG GTGAGGAATC GCCGGATGAA GCGGCCATCA GAGAACGGAG CATAGAATTG
CTATTCAGCA AGAAGGACTT AAAACCAGCC AGCCATAGAC AAGCATTTTA TAAGCTGTGT
GCAAAAGCGG ATCTGCTTGG CAGCTTCGGT CGGAGCCTGC TGGATATAGC ACTCAGAGTA
TCGGTTGCTG AGGCAGAGAA GTGGTATGAG GAAGCAAAGT CAGAGATATC TGATGAGTTT
CCATCTCGTA TCGTCAATAA TCTCGCCTGT TGCTATGCCG GATTGAGCCT AGTAAACAAA
CTGTGTGAAT TCCTTAATGT AACGTGGGCT GAAGTATTTC CCATTAACAA AGGGGCATGT
ATTCGATATC TTCAAAACGG TGTGCAGGAG TACTTGCTGG ATGGCGGCAG CAATAACAAG
ACCATTGTAG AACAGACCCT GGAAATCATG GCCCGGATGA AACTGGCTCC GAATCAAGAC
TACACTTTTG ATAAAGATGG CAAGGTTATC GGGATTCGTT TCTGTGATGT ATATGACCGC
TATACCAAGT ACAGACGCGA TTATGCAATC ACAGGTGAAT GTCTTCCATA TAACCAGTTT
CTGAAGCAAT TGAGGCAAAG TGACTTTTTT ATAGAGAGCA ATAAAACGAT GCGTTTCGGG
AATGAAACAA AAAAAGCGTG GGCTCTTGAT TTCTCGATAC TGAAAGAGCG ATGCGATGTG
AGCGGCTTTG AAATTACAGA TATTGAGCCT CTTTAG
 
Protein sequence
MSISFPKELA NRKQWICWRL EPNTKDGRDS KIPYNPLTGR KASSTNPNDW STLDDAIAAK 
EQYLYTGLDF VFAKSGGLVG IDIDHCRDKN TGELSDTAKD ILERFPSYTE ISPSGTGLHI
FYKGEMPAKG NKNTKTGVEM YAHSRYFTMT GDPLPGTPDS IAEDNGALAW IHENYIKSKK
RSGKSKKNRK NFKLEPLTDE EILEKAQTAE NHKEFELLWE GKWQEAGYPS QSEADLALCC
MLAFWSGKNK EQMDRLFRNS GLFREKWDTV HHASGATYGQ ETLDKAIEVT ENVYSRESES
VIFEHEGRYY RTRGESVYPI TNFIIQPVEM IVSEDETQMT ADLITIRDEI YRQTFMTTDF
NNIQKFKNIL NRRTISLGYF GSEGDLELLK GYISEMEWVR KTGVKALGIY EHGGRMVYVS
TDGAIEAGGN IVEDIVQLDK YKSITTDILT FEPLTKKQLI MLGEWLLSYN EPIKTVSVMA
WVAGCFIKPH LKKSGIKFPH LLLVGEQGSG KSNTLERVIL PVFSCSKIRA STQVTAFTLM
KESASSNLIP QLMDEFKPSK IDKLRLNALY NHLRDAYDGH EGVRGRADQS AVTYELLAPI
IVAGEESPDE AAIRERSIEL LFSKKDLKPA SHRQAFYKLC AKADLLGSFG RSLLDIALRV
SVAEAEKWYE EAKSEISDEF PSRIVNNLAC CYAGLSLVNK LCEFLNVTWA EVFPINKGAC
IRYLQNGVQE YLLDGGSNNK TIVEQTLEIM ARMKLAPNQD YTFDKDGKVI GIRFCDVYDR
YTKYRRDYAI TGECLPYNQF LKQLRQSDFF IESNKTMRFG NETKKAWALD FSILKERCDV
SGFEITDIEP L