Gene Cthe_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2032 
Symbol 
ID4811002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2409329 
End bp2415343 
Gene Length6015 bp 
Protein Length2004 aa 
Translation table11 
GC content39% 
IMG OID640107441 
Producthypothetical protein 
Protein accessionYP_001038436 
Protein GI125974526 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGGAA AAATTAAGCA ATTTATTTCT TTGCTGGTGA CGGTAGTTCT GATGGCCGGA 
ATGTTCCCGA TATCCATGAT GTTTGCTTCA GATGCGGTGT TTGAACCTGT TTTTACTGTT
ATAATTGACG AATCTGTAAT AGGAGATATT TCCGTTACAC TCACAAATTT TCAAAACAAG
GAAGAGACAA GAACGGAACC TGTGATAAAC GGAGTTGCGA CATTTGAGAA TTTTGTTAAC
CTGGCTAATT CCTACGATAT AAAAATAACA GGAATGTTCG GGTATAAAGA CTATTATCGT
TCAAACATAT ATTTTAGCGG ACCGAGTATT TCGTTCAGTG CTTCCGATTT TGTACCGGCG
GAGGTTATTA AAGTATCGGG TAAAATATAT GATGAAAACG GTAATCTTTA TAAAGGAGGC
GGAACGGTTT CTTATTCTGT ATATAATGGG TATAACAGTT ATACCGTTAA TTTCGGACAC
GACGGCAGTT TTTCTGTTGA AATCTACAAG AATACTCCTT ATAACTTTAC TTTTACAACG
TATGAACTTA AATATGATTC TGTTTCTCTG GGCCCCATAA GTAGCGATAA AGATATTGAC
AATCTGGAAA TTCGATTTTC TCTCAGGACT TTTTCAATCA CAACGAACGC CAGTGGGAAC
GGAACAATTT CACCTTCCGA ACATTCTGTT CCATATGGCT CAGACTACAC CATATGTGCC
AAGGCCGATT CCGGGTATGA GATTGAGGAA TTTATTGTTG ACGGAGAGCC GAAATGGGAT
GCTGTCGGCA AAGATGAGTA TACGTATTCT TTCTATCACA TTACCGGCAA TCATACCGTT
AGTGTTACTT TTACCGCAAA AACTTATGAG CTGGAATTTT GGTTTAATTC TGATGGCAAA
ATAGTGGATG GGTCTTTCAA AAATATAAAC AATGGTGGTA AAATTTCGGT TAAAAGTGGG
GAAAGCCCCA GTTTTACTGC GATTGCAAAC CCCAATTATC ACATCAGCCA GGTTCTTATT
GATGGTGTTG AGCAGAAAGA TGGTACGTTT GACAATGAAC AAACCACTTA CACGTATACT
TTTAACAATA TAAGCTCAAA CCATCATATT CAGGTAACAT TCTCCATCAA TAGATATACC
ATAAATATTT CTACTGTCGG GAAAGGACGG GTTTATGCTG AAGGAGCTCC ATATGGACCG
AAGGTTGATG TGGAACATGG ACAGTCTGTT AAGCTGCTTT TGATACCTGA CTGGGATTAT
GATGTGAAAG AAGTCCTGTT GGACGGTGTG GAAGTTAATA ATTATAACTT GTCAGGCAAC
GGTATTTCTT ACATTTATGA ACTGCCATCC ATAACGTCAG ACCATGATAT AACGGTTACT
TTTGGTGAGA TGGGGACTGT TGAAGGCGAT GAAAGCGATT TTTACACTAT CGTTACTAAA
AGCCTTTTGG ATGGATATCC GATAGTGTCT GACGGTGTAC GAATTTACAA TTTTAAAAAC
AAAGATGCTT CGTTAACCTT TAAACTCAAT AATAAATATA GTAGTATTTG TATAAATGGA
ACGGTGTACT TTTCTCAGGC GACAATTACA GAGTCGACTT TTATAGAAGA GATCAAAGTG
TACAGGCATT TAGCAGGCTG GAAAAAAATA AAAATGCCGG AGAAAATTCA AATAATTATT
GACAAAAAAG CTCCGCTTAT TTCCGATATT CCGCCGATGG AATGGACAAA TCAGGATTAT
ACCGTTGAGG CGAAGGTATT TGATGAAGAT GCGAAAGATT TCCCGTCATC CGGCTTGTCA
AGGGTTGTTT GGAGCAAGAA ACCGTTAACA AAGGAAGAGG TTTTGGCGGA AGAAACAAAT
ATAATTCCAA TTGCGGACGG GACATTTTCG TATACAATTA CAACAGAGCA AAATAACGAG
AAATTTTATT TCTACGCAAT TGATAATGCT GACAACATGT CAGAACCCAA AACAATAGAT
GTAAAAATTG ACAAAACAAA ACCTGAAATA ACAGAATTCA CATTCCGGAA AAAACAGGGG
TCTGCCTCGT CACAAGTTAT TAATTTTCTG ACTTTTGGAA CCTTTTCAAG CGATGAAATT
GAAGTGGTTG TAACTGCCCA GGACCCCGGT ATTTCCTCAG GTTTGAAAGA AATTACTTTG
TACAGTGACG GTGTCGCTGT GGAAACTAAG ACAGTTACAG AAAATTTTGC GGTATTCAAT
TTGACCCTTG AGAATTTCAG CGGGAATGAA ATTTCAGCGT CTGTAAAGGA CGTGGCAGGT
AATGATTCGG CAAGGGACGG ACTTACAAAG CCTACAGATG TGAAGACAAA TGCATTCAGC
AATTTTGTAG GGCTTAGAAA TGAAAAACCG ACTGTTGTTA TTACACCTAT GAACAGGTCG
GTGTATAGAG AAGGCGAAAA AGAATGGCAC AACGACAGTG TGGCATTTTC AGTATATGCC
GCCACTGACA GTGCCGGAAT TTATTCGGTT GAGATCAAAG TCAACGGAAA GACCGTTGCA
ACTGACAAAA CCGGAAAAGC CGTGAATGCC GACTTTTTCG AATCTCAGAC GTTGAAAGAA
ACATTTACAG TTGATACGGA CTTCAATGCC ATGGATGGGG AGAACAACAT AGAAGCGATT
GTTACCAACA ACTATGGAAA CATGGAAACT GCCAGGGTAA AGGTGTTTAT TGACAAGACA
AATCCCAGAA TTATAGGATT TAATATTACT GCAGAAAATA ACGGAATTTT AAGTAAAATA
TTAAATTTCC TGTCATTTGG AAATTTCTTT AACGAAAGAG TAAGAGTAAC GGTTATTGCT
GATGACAGAT ATGGAGCTAC TTCGGGTATC AATACAATTA TTTTGTATAT GAACGGTAAT
CCTGTAAACG GTTCGCCCAA AACTGCAACC TTGCTTAGTG ACGGAACATA TAAAGCAGAG
TTTGTTTTAC CGGAACGACT GCTTTCCAAT GACGTTTCTC TCAATGCAGT TTTGTCAGCC
GTAGCTGTTG ACAATGTCGG AAATATCACG GGCAAAGACA AAGAACATCC CAATGGTGTT
CCGGTAACGC CTGATAAAGT AAATTCCGAT TTTAAGAGCA GCAAACTGAT TATTGAAACG
GTTAATCCGG CAGTAAGCAT TTTTTGCCCG GAACCTGATT TTACAGCAAA TGACGGTAAG
AAATGGTATT TGGACGATGT TGTGTTCAAA GTATCCGTTA AAGATTTGGA TGCAGGTATT
CGTTCTGTCA GGATTAAAAT TAATGGTACT GATATTACAA AAGATATAGA GGGGAAAATT
ATAGATGCTG AATTTTATAA CAAAGAAACA CATGAAGAAG TTTTTCTGGT AAGTACCGGG
CAGGCAGTCA GGGCGGATGA CGGCTCTTAT TTGATTGAAG TTACTGCAGT GGATAATGCA
GGAAACAGTT ACTTTTTAAG CGATGTTGTA TATAAGGATA CAGATATCCC TGTTGTTACA
GACTTCAGTT TTGTTCCGTC AGCATCTGAC GGTGTTAAAA GTACATCTCA ATTCATCGAT
TTTCTGGAAT ATGGCTTCTA TTTCAAAACT GAATTCAACG TTGTTGTCAG TGTGTCGGAT
AACAAACCTT CTTCAGGGCT TGACAAGATA AACTACCGTC TTGTATCGTA CGATAACGGG
AAAAAATCTG AAGAGAAAAA AGGAACTCAA ATTATTGCGG ACGGTACAGC GGTTATATCT
GTTCCAAAAG GTTTTAAAGG ACAGATATTT GTAGAGGCAT TTGACAATGC CGGCAACAAA
TCACGGGAAG TGACTCCGGG AGGATTTGTA AGTGATGATG TTGCTCCGGA GATCAGCATT
ACAAACAATA CGACTACAAA CTACAGTGAT GCCGAGGGGA ACAAATTGTA TGTCACCGAC
ATGAGCTTTA CTGTCACAAT CTCGGATTAT GACTCGGGTA TAAAGGAAAT AGGATTTTCA
CTGAGTTCGG AAAAAGAATC CTTTGAGAGA AAGAGCACTT TGATTGCCAA CAAAGGCTAT
AAAGTAGGTG ATGTTCTTGA AAACGGATGG ATAGTTTCGG AAATGGATCA GAACCTTGTT
AGAAAGGTTA CAAAAGTATT TACTTTCAGT TCAGACAATA ATGATATTTT CCTGACTTTT
GATGCAAGCG ACCGTTCAGG GAACAAAAAA GAAAATATCC GAACCGAAAA AGTTACAATT
GATAAAACGG CACCGGTGAT TAATGTAGAG TTCCGTCCTG ATGAAAGCAA AAACAATTAT
TACTACAGCG GAAACAGGGC TGCGCAGATA ACGGTTGTCG AGAGGAATTT TGATGCCGGT
TTGATAAAAA CTTCTATTGA AAATAAAATT GGAAGAGTGC CGACTGTTTC CTTTTCTCAA
AAGTCAAATA CTGAGCATGT TGCGGTGATT GAGTTTGACG AGGGCGACTA TGTATTTGAT
ATCAGCGGTA CTGATTTGGG CAATCATACT GCGGTTGTGA ATTTCAGCGG AGAAAATGAG
AAACTGTTCT ATGTGGATAA AACTAAACCC AGTATTGAGG AGAATTTTGC AACATTTACG
AACGAGGCTA CAAATAATAG TTTTAATACA GATAAAACTG CTGTTATAAA AATAACTGAG
CATAATTTTG ACCCTCAACT GGCTGGTTTG AAAGTTTTCA GAAAGGATCC GGGAGAAGAA
CATAACGAAG AAGGTTTTGT TGACATAACA TCTGAAATAT TGAGTACATC CCGTTGGGTG
TCGGAAGGTG ATGTTCATAC AATTTCCTTT ACGTTTAGCA GAGATGCTGT CTACAAGATT
GAAATTAATC CCGAAGATTT GGCCGGAAAC AGTGCTGAGC CAAGACGTAC GGTTGTCTTT
GAAATTGACA AAACACCACC GGTTGTTAAG GCAAGGAACG GTGTTTTGGC CGATGAAAAT
GATACTGCGT TTGTAGACGT ATATCCTTAT TCAAGAAAAG ACGATCCAGC TCCTACTGTT
GAGTTTAGTG ATTTAAATAT TGATCATATC AGATATAACC TGACGGTTTA TATTCCGGAC
CATACGAGCA AAGAAGCTGA AACCGTCATT AAACCGGTAA GGGTGTATCT GGATGAGGAT
ACGGACAAAT CCGGCAGAAT CAAGGGAAGT ATTTTCACTT TGCCCGATTT CACACGGGAC
GGTGTTTATG CATTGGAATT GACTGCTGTG GATGTGGCCG GAAATGAGAG CCTGCTTAAC
CTGAATACAT ATGCGAGAAT GGTTGAGCAG GATGTGCTGG CGTATATCAT GGACAGCAAC
CCGGAAGCAA AAACGGGGTT GTATTCTTTC CAATACGAAA ACGGCGAGCC TATCAGCAAA
AGGCCTGATA ACTTCAGCGA TATAAAAATA TGTGCTTTTG CAAAAGACGA TACTGATGTT
GAGGTTGTTT TAAGAGACAA TAACGGTGAT GAAATAAAAA CTGATGCTAC GGGAACGATA
GATGACAGCA TATATGGGTT TAATATTCAC AACCTTGTTT TGGAATCCCG CTTTTTCAAG
GAGACTTTCC AGGATGATAC AGATATTGAA CTGCATTTGT CAGTAAAGAA TGACGGCAGG
AGAGTGGACT TGGGTACAAT GCATATTGAT AATATAGCTC CGACGTGCGA ATTGCCTGAA
GAATTCAGGT CGTGGCATTG GTACTTTGGA GAAGAGGAAC GTACAATTAC TGTTTCGAAT
ATCAACGAAT TGGTTGACGA AAACCGGTGC AAGGTATATG ATAACGGCAG GGAAATAGAT
TTCAAATATT ACAGTGATAA CAATACATTA ACTTTTACCC TCGGAAAGGG TTGGCACAAC
GTTGGAATTG TCCTTGTGGA TATGGCGGGA AATGTGAACA ATATTCAAGA GATAAGGAAT
ATACATGTTG GATTTTTCTG GCTGTGGGTT ATTGCAGCGG GATCTGCAGT ATTAATTGCG
GCAATAGTTG CTGCTGTGAT TCATAATATA AGAAAAAGGA GAAAAGAAGA GGAGGAAAAC
GAGCTGGCGG CTTGA
 
Protein sequence
MKGKIKQFIS LLVTVVLMAG MFPISMMFAS DAVFEPVFTV IIDESVIGDI SVTLTNFQNK 
EETRTEPVIN GVATFENFVN LANSYDIKIT GMFGYKDYYR SNIYFSGPSI SFSASDFVPA
EVIKVSGKIY DENGNLYKGG GTVSYSVYNG YNSYTVNFGH DGSFSVEIYK NTPYNFTFTT
YELKYDSVSL GPISSDKDID NLEIRFSLRT FSITTNASGN GTISPSEHSV PYGSDYTICA
KADSGYEIEE FIVDGEPKWD AVGKDEYTYS FYHITGNHTV SVTFTAKTYE LEFWFNSDGK
IVDGSFKNIN NGGKISVKSG ESPSFTAIAN PNYHISQVLI DGVEQKDGTF DNEQTTYTYT
FNNISSNHHI QVTFSINRYT INISTVGKGR VYAEGAPYGP KVDVEHGQSV KLLLIPDWDY
DVKEVLLDGV EVNNYNLSGN GISYIYELPS ITSDHDITVT FGEMGTVEGD ESDFYTIVTK
SLLDGYPIVS DGVRIYNFKN KDASLTFKLN NKYSSICING TVYFSQATIT ESTFIEEIKV
YRHLAGWKKI KMPEKIQIII DKKAPLISDI PPMEWTNQDY TVEAKVFDED AKDFPSSGLS
RVVWSKKPLT KEEVLAEETN IIPIADGTFS YTITTEQNNE KFYFYAIDNA DNMSEPKTID
VKIDKTKPEI TEFTFRKKQG SASSQVINFL TFGTFSSDEI EVVVTAQDPG ISSGLKEITL
YSDGVAVETK TVTENFAVFN LTLENFSGNE ISASVKDVAG NDSARDGLTK PTDVKTNAFS
NFVGLRNEKP TVVITPMNRS VYREGEKEWH NDSVAFSVYA ATDSAGIYSV EIKVNGKTVA
TDKTGKAVNA DFFESQTLKE TFTVDTDFNA MDGENNIEAI VTNNYGNMET ARVKVFIDKT
NPRIIGFNIT AENNGILSKI LNFLSFGNFF NERVRVTVIA DDRYGATSGI NTIILYMNGN
PVNGSPKTAT LLSDGTYKAE FVLPERLLSN DVSLNAVLSA VAVDNVGNIT GKDKEHPNGV
PVTPDKVNSD FKSSKLIIET VNPAVSIFCP EPDFTANDGK KWYLDDVVFK VSVKDLDAGI
RSVRIKINGT DITKDIEGKI IDAEFYNKET HEEVFLVSTG QAVRADDGSY LIEVTAVDNA
GNSYFLSDVV YKDTDIPVVT DFSFVPSASD GVKSTSQFID FLEYGFYFKT EFNVVVSVSD
NKPSSGLDKI NYRLVSYDNG KKSEEKKGTQ IIADGTAVIS VPKGFKGQIF VEAFDNAGNK
SREVTPGGFV SDDVAPEISI TNNTTTNYSD AEGNKLYVTD MSFTVTISDY DSGIKEIGFS
LSSEKESFER KSTLIANKGY KVGDVLENGW IVSEMDQNLV RKVTKVFTFS SDNNDIFLTF
DASDRSGNKK ENIRTEKVTI DKTAPVINVE FRPDESKNNY YYSGNRAAQI TVVERNFDAG
LIKTSIENKI GRVPTVSFSQ KSNTEHVAVI EFDEGDYVFD ISGTDLGNHT AVVNFSGENE
KLFYVDKTKP SIEENFATFT NEATNNSFNT DKTAVIKITE HNFDPQLAGL KVFRKDPGEE
HNEEGFVDIT SEILSTSRWV SEGDVHTISF TFSRDAVYKI EINPEDLAGN SAEPRRTVVF
EIDKTPPVVK ARNGVLADEN DTAFVDVYPY SRKDDPAPTV EFSDLNIDHI RYNLTVYIPD
HTSKEAETVI KPVRVYLDED TDKSGRIKGS IFTLPDFTRD GVYALELTAV DVAGNESLLN
LNTYARMVEQ DVLAYIMDSN PEAKTGLYSF QYENGEPISK RPDNFSDIKI CAFAKDDTDV
EVVLRDNNGD EIKTDATGTI DDSIYGFNIH NLVLESRFFK ETFQDDTDIE LHLSVKNDGR
RVDLGTMHID NIAPTCELPE EFRSWHWYFG EEERTITVSN INELVDENRC KVYDNGREID
FKYYSDNNTL TFTLGKGWHN VGIVLVDMAG NVNNIQEIRN IHVGFFWLWV IAAGSAVLIA
AIVAAVIHNI RKRRKEEEEN ELAA