Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2032 |
Symbol | |
ID | 4811002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2409329 |
End bp | 2415343 |
Gene Length | 6015 bp |
Protein Length | 2004 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107441 |
Product | hypothetical protein |
Protein accession | YP_001038436 |
Protein GI | 125974526 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGGAA AAATTAAGCA ATTTATTTCT TTGCTGGTGA CGGTAGTTCT GATGGCCGGA ATGTTCCCGA TATCCATGAT GTTTGCTTCA GATGCGGTGT TTGAACCTGT TTTTACTGTT ATAATTGACG AATCTGTAAT AGGAGATATT TCCGTTACAC TCACAAATTT TCAAAACAAG GAAGAGACAA GAACGGAACC TGTGATAAAC GGAGTTGCGA CATTTGAGAA TTTTGTTAAC CTGGCTAATT CCTACGATAT AAAAATAACA GGAATGTTCG GGTATAAAGA CTATTATCGT TCAAACATAT ATTTTAGCGG ACCGAGTATT TCGTTCAGTG CTTCCGATTT TGTACCGGCG GAGGTTATTA AAGTATCGGG TAAAATATAT GATGAAAACG GTAATCTTTA TAAAGGAGGC GGAACGGTTT CTTATTCTGT ATATAATGGG TATAACAGTT ATACCGTTAA TTTCGGACAC GACGGCAGTT TTTCTGTTGA AATCTACAAG AATACTCCTT ATAACTTTAC TTTTACAACG TATGAACTTA AATATGATTC TGTTTCTCTG GGCCCCATAA GTAGCGATAA AGATATTGAC AATCTGGAAA TTCGATTTTC TCTCAGGACT TTTTCAATCA CAACGAACGC CAGTGGGAAC GGAACAATTT CACCTTCCGA ACATTCTGTT CCATATGGCT CAGACTACAC CATATGTGCC AAGGCCGATT CCGGGTATGA GATTGAGGAA TTTATTGTTG ACGGAGAGCC GAAATGGGAT GCTGTCGGCA AAGATGAGTA TACGTATTCT TTCTATCACA TTACCGGCAA TCATACCGTT AGTGTTACTT TTACCGCAAA AACTTATGAG CTGGAATTTT GGTTTAATTC TGATGGCAAA ATAGTGGATG GGTCTTTCAA AAATATAAAC AATGGTGGTA AAATTTCGGT TAAAAGTGGG GAAAGCCCCA GTTTTACTGC GATTGCAAAC CCCAATTATC ACATCAGCCA GGTTCTTATT GATGGTGTTG AGCAGAAAGA TGGTACGTTT GACAATGAAC AAACCACTTA CACGTATACT TTTAACAATA TAAGCTCAAA CCATCATATT CAGGTAACAT TCTCCATCAA TAGATATACC ATAAATATTT CTACTGTCGG GAAAGGACGG GTTTATGCTG AAGGAGCTCC ATATGGACCG AAGGTTGATG TGGAACATGG ACAGTCTGTT AAGCTGCTTT TGATACCTGA CTGGGATTAT GATGTGAAAG AAGTCCTGTT GGACGGTGTG GAAGTTAATA ATTATAACTT GTCAGGCAAC GGTATTTCTT ACATTTATGA ACTGCCATCC ATAACGTCAG ACCATGATAT AACGGTTACT TTTGGTGAGA TGGGGACTGT TGAAGGCGAT GAAAGCGATT TTTACACTAT CGTTACTAAA AGCCTTTTGG ATGGATATCC GATAGTGTCT GACGGTGTAC GAATTTACAA TTTTAAAAAC AAAGATGCTT CGTTAACCTT TAAACTCAAT AATAAATATA GTAGTATTTG TATAAATGGA ACGGTGTACT TTTCTCAGGC GACAATTACA GAGTCGACTT TTATAGAAGA GATCAAAGTG TACAGGCATT TAGCAGGCTG GAAAAAAATA AAAATGCCGG AGAAAATTCA AATAATTATT GACAAAAAAG CTCCGCTTAT TTCCGATATT CCGCCGATGG AATGGACAAA TCAGGATTAT ACCGTTGAGG CGAAGGTATT TGATGAAGAT GCGAAAGATT TCCCGTCATC CGGCTTGTCA AGGGTTGTTT GGAGCAAGAA ACCGTTAACA AAGGAAGAGG TTTTGGCGGA AGAAACAAAT ATAATTCCAA TTGCGGACGG GACATTTTCG TATACAATTA CAACAGAGCA AAATAACGAG AAATTTTATT TCTACGCAAT TGATAATGCT GACAACATGT CAGAACCCAA AACAATAGAT GTAAAAATTG ACAAAACAAA ACCTGAAATA ACAGAATTCA CATTCCGGAA AAAACAGGGG TCTGCCTCGT CACAAGTTAT TAATTTTCTG ACTTTTGGAA CCTTTTCAAG CGATGAAATT GAAGTGGTTG TAACTGCCCA GGACCCCGGT ATTTCCTCAG GTTTGAAAGA AATTACTTTG TACAGTGACG GTGTCGCTGT GGAAACTAAG ACAGTTACAG AAAATTTTGC GGTATTCAAT TTGACCCTTG AGAATTTCAG CGGGAATGAA ATTTCAGCGT CTGTAAAGGA CGTGGCAGGT AATGATTCGG CAAGGGACGG ACTTACAAAG CCTACAGATG TGAAGACAAA TGCATTCAGC AATTTTGTAG GGCTTAGAAA TGAAAAACCG ACTGTTGTTA TTACACCTAT GAACAGGTCG GTGTATAGAG AAGGCGAAAA AGAATGGCAC AACGACAGTG TGGCATTTTC AGTATATGCC GCCACTGACA GTGCCGGAAT TTATTCGGTT GAGATCAAAG TCAACGGAAA GACCGTTGCA ACTGACAAAA CCGGAAAAGC CGTGAATGCC GACTTTTTCG AATCTCAGAC GTTGAAAGAA ACATTTACAG TTGATACGGA CTTCAATGCC ATGGATGGGG AGAACAACAT AGAAGCGATT GTTACCAACA ACTATGGAAA CATGGAAACT GCCAGGGTAA AGGTGTTTAT TGACAAGACA AATCCCAGAA TTATAGGATT TAATATTACT GCAGAAAATA ACGGAATTTT AAGTAAAATA TTAAATTTCC TGTCATTTGG AAATTTCTTT AACGAAAGAG TAAGAGTAAC GGTTATTGCT GATGACAGAT ATGGAGCTAC TTCGGGTATC AATACAATTA TTTTGTATAT GAACGGTAAT CCTGTAAACG GTTCGCCCAA AACTGCAACC TTGCTTAGTG ACGGAACATA TAAAGCAGAG TTTGTTTTAC CGGAACGACT GCTTTCCAAT GACGTTTCTC TCAATGCAGT TTTGTCAGCC GTAGCTGTTG ACAATGTCGG AAATATCACG GGCAAAGACA AAGAACATCC CAATGGTGTT CCGGTAACGC CTGATAAAGT AAATTCCGAT TTTAAGAGCA GCAAACTGAT TATTGAAACG GTTAATCCGG CAGTAAGCAT TTTTTGCCCG GAACCTGATT TTACAGCAAA TGACGGTAAG AAATGGTATT TGGACGATGT TGTGTTCAAA GTATCCGTTA AAGATTTGGA TGCAGGTATT CGTTCTGTCA GGATTAAAAT TAATGGTACT GATATTACAA AAGATATAGA GGGGAAAATT ATAGATGCTG AATTTTATAA CAAAGAAACA CATGAAGAAG TTTTTCTGGT AAGTACCGGG CAGGCAGTCA GGGCGGATGA CGGCTCTTAT TTGATTGAAG TTACTGCAGT GGATAATGCA GGAAACAGTT ACTTTTTAAG CGATGTTGTA TATAAGGATA CAGATATCCC TGTTGTTACA GACTTCAGTT TTGTTCCGTC AGCATCTGAC GGTGTTAAAA GTACATCTCA ATTCATCGAT TTTCTGGAAT ATGGCTTCTA TTTCAAAACT GAATTCAACG TTGTTGTCAG TGTGTCGGAT AACAAACCTT CTTCAGGGCT TGACAAGATA AACTACCGTC TTGTATCGTA CGATAACGGG AAAAAATCTG AAGAGAAAAA AGGAACTCAA ATTATTGCGG ACGGTACAGC GGTTATATCT GTTCCAAAAG GTTTTAAAGG ACAGATATTT GTAGAGGCAT TTGACAATGC CGGCAACAAA TCACGGGAAG TGACTCCGGG AGGATTTGTA AGTGATGATG TTGCTCCGGA GATCAGCATT ACAAACAATA CGACTACAAA CTACAGTGAT GCCGAGGGGA ACAAATTGTA TGTCACCGAC ATGAGCTTTA CTGTCACAAT CTCGGATTAT GACTCGGGTA TAAAGGAAAT AGGATTTTCA CTGAGTTCGG AAAAAGAATC CTTTGAGAGA AAGAGCACTT TGATTGCCAA CAAAGGCTAT AAAGTAGGTG ATGTTCTTGA AAACGGATGG ATAGTTTCGG AAATGGATCA GAACCTTGTT AGAAAGGTTA CAAAAGTATT TACTTTCAGT TCAGACAATA ATGATATTTT CCTGACTTTT GATGCAAGCG ACCGTTCAGG GAACAAAAAA GAAAATATCC GAACCGAAAA AGTTACAATT GATAAAACGG CACCGGTGAT TAATGTAGAG TTCCGTCCTG ATGAAAGCAA AAACAATTAT TACTACAGCG GAAACAGGGC TGCGCAGATA ACGGTTGTCG AGAGGAATTT TGATGCCGGT TTGATAAAAA CTTCTATTGA AAATAAAATT GGAAGAGTGC CGACTGTTTC CTTTTCTCAA AAGTCAAATA CTGAGCATGT TGCGGTGATT GAGTTTGACG AGGGCGACTA TGTATTTGAT ATCAGCGGTA CTGATTTGGG CAATCATACT GCGGTTGTGA ATTTCAGCGG AGAAAATGAG AAACTGTTCT ATGTGGATAA AACTAAACCC AGTATTGAGG AGAATTTTGC AACATTTACG AACGAGGCTA CAAATAATAG TTTTAATACA GATAAAACTG CTGTTATAAA AATAACTGAG CATAATTTTG ACCCTCAACT GGCTGGTTTG AAAGTTTTCA GAAAGGATCC GGGAGAAGAA CATAACGAAG AAGGTTTTGT TGACATAACA TCTGAAATAT TGAGTACATC CCGTTGGGTG TCGGAAGGTG ATGTTCATAC AATTTCCTTT ACGTTTAGCA GAGATGCTGT CTACAAGATT GAAATTAATC CCGAAGATTT GGCCGGAAAC AGTGCTGAGC CAAGACGTAC GGTTGTCTTT GAAATTGACA AAACACCACC GGTTGTTAAG GCAAGGAACG GTGTTTTGGC CGATGAAAAT GATACTGCGT TTGTAGACGT ATATCCTTAT TCAAGAAAAG ACGATCCAGC TCCTACTGTT GAGTTTAGTG ATTTAAATAT TGATCATATC AGATATAACC TGACGGTTTA TATTCCGGAC CATACGAGCA AAGAAGCTGA AACCGTCATT AAACCGGTAA GGGTGTATCT GGATGAGGAT ACGGACAAAT CCGGCAGAAT CAAGGGAAGT ATTTTCACTT TGCCCGATTT CACACGGGAC GGTGTTTATG CATTGGAATT GACTGCTGTG GATGTGGCCG GAAATGAGAG CCTGCTTAAC CTGAATACAT ATGCGAGAAT GGTTGAGCAG GATGTGCTGG CGTATATCAT GGACAGCAAC CCGGAAGCAA AAACGGGGTT GTATTCTTTC CAATACGAAA ACGGCGAGCC TATCAGCAAA AGGCCTGATA ACTTCAGCGA TATAAAAATA TGTGCTTTTG CAAAAGACGA TACTGATGTT GAGGTTGTTT TAAGAGACAA TAACGGTGAT GAAATAAAAA CTGATGCTAC GGGAACGATA GATGACAGCA TATATGGGTT TAATATTCAC AACCTTGTTT TGGAATCCCG CTTTTTCAAG GAGACTTTCC AGGATGATAC AGATATTGAA CTGCATTTGT CAGTAAAGAA TGACGGCAGG AGAGTGGACT TGGGTACAAT GCATATTGAT AATATAGCTC CGACGTGCGA ATTGCCTGAA GAATTCAGGT CGTGGCATTG GTACTTTGGA GAAGAGGAAC GTACAATTAC TGTTTCGAAT ATCAACGAAT TGGTTGACGA AAACCGGTGC AAGGTATATG ATAACGGCAG GGAAATAGAT TTCAAATATT ACAGTGATAA CAATACATTA ACTTTTACCC TCGGAAAGGG TTGGCACAAC GTTGGAATTG TCCTTGTGGA TATGGCGGGA AATGTGAACA ATATTCAAGA GATAAGGAAT ATACATGTTG GATTTTTCTG GCTGTGGGTT ATTGCAGCGG GATCTGCAGT ATTAATTGCG GCAATAGTTG CTGCTGTGAT TCATAATATA AGAAAAAGGA GAAAAGAAGA GGAGGAAAAC GAGCTGGCGG CTTGA
|
Protein sequence | MKGKIKQFIS LLVTVVLMAG MFPISMMFAS DAVFEPVFTV IIDESVIGDI SVTLTNFQNK EETRTEPVIN GVATFENFVN LANSYDIKIT GMFGYKDYYR SNIYFSGPSI SFSASDFVPA EVIKVSGKIY DENGNLYKGG GTVSYSVYNG YNSYTVNFGH DGSFSVEIYK NTPYNFTFTT YELKYDSVSL GPISSDKDID NLEIRFSLRT FSITTNASGN GTISPSEHSV PYGSDYTICA KADSGYEIEE FIVDGEPKWD AVGKDEYTYS FYHITGNHTV SVTFTAKTYE LEFWFNSDGK IVDGSFKNIN NGGKISVKSG ESPSFTAIAN PNYHISQVLI DGVEQKDGTF DNEQTTYTYT FNNISSNHHI QVTFSINRYT INISTVGKGR VYAEGAPYGP KVDVEHGQSV KLLLIPDWDY DVKEVLLDGV EVNNYNLSGN GISYIYELPS ITSDHDITVT FGEMGTVEGD ESDFYTIVTK SLLDGYPIVS DGVRIYNFKN KDASLTFKLN NKYSSICING TVYFSQATIT ESTFIEEIKV YRHLAGWKKI KMPEKIQIII DKKAPLISDI PPMEWTNQDY TVEAKVFDED AKDFPSSGLS RVVWSKKPLT KEEVLAEETN IIPIADGTFS YTITTEQNNE KFYFYAIDNA DNMSEPKTID VKIDKTKPEI TEFTFRKKQG SASSQVINFL TFGTFSSDEI EVVVTAQDPG ISSGLKEITL YSDGVAVETK TVTENFAVFN LTLENFSGNE ISASVKDVAG NDSARDGLTK PTDVKTNAFS NFVGLRNEKP TVVITPMNRS VYREGEKEWH NDSVAFSVYA ATDSAGIYSV EIKVNGKTVA TDKTGKAVNA DFFESQTLKE TFTVDTDFNA MDGENNIEAI VTNNYGNMET ARVKVFIDKT NPRIIGFNIT AENNGILSKI LNFLSFGNFF NERVRVTVIA DDRYGATSGI NTIILYMNGN PVNGSPKTAT LLSDGTYKAE FVLPERLLSN DVSLNAVLSA VAVDNVGNIT GKDKEHPNGV PVTPDKVNSD FKSSKLIIET VNPAVSIFCP EPDFTANDGK KWYLDDVVFK VSVKDLDAGI RSVRIKINGT DITKDIEGKI IDAEFYNKET HEEVFLVSTG QAVRADDGSY LIEVTAVDNA GNSYFLSDVV YKDTDIPVVT DFSFVPSASD GVKSTSQFID FLEYGFYFKT EFNVVVSVSD NKPSSGLDKI NYRLVSYDNG KKSEEKKGTQ IIADGTAVIS VPKGFKGQIF VEAFDNAGNK SREVTPGGFV SDDVAPEISI TNNTTTNYSD AEGNKLYVTD MSFTVTISDY DSGIKEIGFS LSSEKESFER KSTLIANKGY KVGDVLENGW IVSEMDQNLV RKVTKVFTFS SDNNDIFLTF DASDRSGNKK ENIRTEKVTI DKTAPVINVE FRPDESKNNY YYSGNRAAQI TVVERNFDAG LIKTSIENKI GRVPTVSFSQ KSNTEHVAVI EFDEGDYVFD ISGTDLGNHT AVVNFSGENE KLFYVDKTKP SIEENFATFT NEATNNSFNT DKTAVIKITE HNFDPQLAGL KVFRKDPGEE HNEEGFVDIT SEILSTSRWV SEGDVHTISF TFSRDAVYKI EINPEDLAGN SAEPRRTVVF EIDKTPPVVK ARNGVLADEN DTAFVDVYPY SRKDDPAPTV EFSDLNIDHI RYNLTVYIPD HTSKEAETVI KPVRVYLDED TDKSGRIKGS IFTLPDFTRD GVYALELTAV DVAGNESLLN LNTYARMVEQ DVLAYIMDSN PEAKTGLYSF QYENGEPISK RPDNFSDIKI CAFAKDDTDV EVVLRDNNGD EIKTDATGTI DDSIYGFNIH NLVLESRFFK ETFQDDTDIE LHLSVKNDGR RVDLGTMHID NIAPTCELPE EFRSWHWYFG EEERTITVSN INELVDENRC KVYDNGREID FKYYSDNNTL TFTLGKGWHN VGIVLVDMAG NVNNIQEIRN IHVGFFWLWV IAAGSAVLIA AIVAAVIHNI RKRRKEEEEN ELAA
|
| |