Gene Cthe_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1888 
Symbol 
ID4809219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2241344 
End bp2242927 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content35% 
IMG OID640107307 
Producthypothetical protein 
Protein accessionYP_001038302 
Protein GI125974392 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACAAA TGGACAATGA AGCACAGTTT AAAGTGGAAT TTCATACCAC TAGCAACAAG 
TTGATAACAA AGATTTTATA TACCAGTGAT TTTTCTTCTA CCAGTAAGTT TAAGAGTGCC
CTTAACCGCA ACAGCATAGA CCTTATATAC CAGGGAAATG ACCACGACCT TGAGATTATT
AAGGATATCG TCTCCAGGAA GCCTTATGCT GTTAAGACAG GTGTAAGTTA TATAGGTATT
GTGAAAAAAG AGTCACAGTA CATTTTTGTA TCAGACACAA AAGCCATTGA TAAAAATGAC
AATATTGTTG ATGATATTGT AATCATGGAA AACTCAAAGT GTATCGGCAC TGGACTTGCT
GAAGTACAGC CTATAGATAA TATTGGATTA GCGCAATTAT CAAAACTTCT TTTTAATTTT
AATATTCTTC CTATTACAGC GACTATTATC GGTTTTTGTG CAAGTTGTTT TTTGAAAGAA
AAATTATGGC AAAGCGGGAG AATAAAGCAT AACTACTTAA TTATTACAGG CGAATCAGGT
TCGGGGAAAA GTGAAACAGT TGAAAATGTT ATTATGCCTA TGTTTGCTGC AACAAATTGC
ACTATAAGCA GCGGGCAGAT AACAAGATTT GTAAATGCAA AAATGTCAGC AAGCAGCAAT
CTTATCCCTA TGATTATAAC TGAATATAAG CCAACCAAGT TAAATAAGAA CAGAATGGAT
GAAATTTCTG ACCTGTTAAG AAATACCTAT GACAGAACAC CTGCCTATAG AGGCAGGCCT
GATTTGATAC TTAACGAATA TCTACCTCTT GCCCCAATAA TTCTAGTAGG GGAAATGGGC
TTCGATGAAA CTGCTATAAA GGAACGCAGT CTGGAAGTCT TGTACTCTAA AGCGAATATT
AGGGATGAGG TTATTCAAGG ACGATTTAAA ATGCTAAAAC GGCGTTCAAG AGAACTAAGA
ATGCTTGGAA GAAGCCTATT AAACCAAGCA TTAAAAATAG AGCCTGATGA ATTGATAAGA
AGGCATAGAG CAATTGAAGA AAGGATAAAT TTAAGTTTAC CATCCAGAGT TAAGAATTCT
ATTGCAAATT GTATGCAGGG ATTGCTCCTT CTCAAGGATG TATATGATTC TCTGAATATG
AACTTTGAGA AACAGGTAGG CTATAGCATT CAAGAACTCT TCAATTCTGT TTTAAGTGGA
GTTCATGACT ACCTGCTTGA TGGGCAAAAT GAGGCTAAAG GAAGCATAGA GAAAATTCTA
GAGGTTATCT GTCGTATGGA AGAATCAGGG GTACTTATAA GGGGTAATGA TTATCAGGTT
ATAAATCAGG GTACTGAGTT GGCACTGAAT ATTAGTCCTT TATATGATAA ATTTACAAAG
TATGTAAGAG AACATAACAT ATTAGATGTG GAAGTTTTAT CCCTTCCTCA ATTCAGAAAG
CAATTAAGAA CAAAAGAGTA TTTCTCTGAC TACAAAACAG TAAGGTTCAA TACAGCCAAC
TCTGCTGATA AGCCTGTAAA AGCATATACA CTTGATATTG CTAAGTTAAG AGAGATTCTT
GATATTTCTG CACTGGTAGA ATAA
 
Protein sequence
MVQMDNEAQF KVEFHTTSNK LITKILYTSD FSSTSKFKSA LNRNSIDLIY QGNDHDLEII 
KDIVSRKPYA VKTGVSYIGI VKKESQYIFV SDTKAIDKND NIVDDIVIME NSKCIGTGLA
EVQPIDNIGL AQLSKLLFNF NILPITATII GFCASCFLKE KLWQSGRIKH NYLIITGESG
SGKSETVENV IMPMFAATNC TISSGQITRF VNAKMSASSN LIPMIITEYK PTKLNKNRMD
EISDLLRNTY DRTPAYRGRP DLILNEYLPL APIILVGEMG FDETAIKERS LEVLYSKANI
RDEVIQGRFK MLKRRSRELR MLGRSLLNQA LKIEPDELIR RHRAIEERIN LSLPSRVKNS
IANCMQGLLL LKDVYDSLNM NFEKQVGYSI QELFNSVLSG VHDYLLDGQN EAKGSIEKIL
EVICRMEESG VLIRGNDYQV INQGTELALN ISPLYDKFTK YVREHNILDV EVLSLPQFRK
QLRTKEYFSD YKTVRFNTAN SADKPVKAYT LDIAKLREIL DISALVE