Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2055 |
Symbol | |
ID | 4810651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2444007 |
End bp | 2445926 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107460 |
Product | hypothetical protein |
Protein accession | YP_001038455 |
Protein GI | 125974545 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAGTG TGCTTTCGGT TTTAAGTCCG GAAAACATTC AAAAGAATGA ATATGAAGAA ATAATAAGAG ACATTCGCAG TCAATTGCAA TCTCTTTCTG ACAGCCGAAG ATTCAGACAG TTGCTGGACA AATACGCGGA CAAGCCTTAT TATTTTAAAA GAAACGGAAA TTATCTGGAA AATAAAAGAT GTACTCCTAC GATAAGAGAT GCGGCTTGTT TTTATGTATT AATCAAATAT CTTGAATATT ATAAATTGAA AAACAATCCC GACGATTTGG AGAAAATTTT CTCAAACAGC AGTTTTGATG CCGGAAAGTA TTACATGGAA TTCTTTGGCG ATGACTGGGA CGAAGAAAAG CAGTTCATTT ATAGTTTCAT ATGCAATGAA CCCGATATGA ATTCAAATTT GTTGTCTGAT ATAGAAATTG ATATAGTTTC GGGTTCTGTT TACAAGATTA AGAAGTATTT TCTTGAAAAT TCCAGAATAA AAGATATCCG GGGAGGAAGT GCATTAATTA AATATGTAAA TGAAGATGTC ACGTTTGATT ATCTGAGAGA CAATTATACG GAGGAATGCG CTGTATACTG CGGAGGAGGA AATGTGCTGA TAATAGCACC CGGCGGTGCC GGAGAAAAGA TATGCTCGGC TCTTGAAGAA AAATATACAA GAATTACTCT TACGGCACAG AATGCCTTTG AATACGTCAG TACAAACCTT AATACTTTCA TTAAGCATTA TAAAAACATA ATGGGAGATT TAAACCAGAA GCTTGATGCC CGGAAAAAAC TGAAGATTTA CAGTATAAAT CCTGACGGCA GGCTTGAAAC GATAGAAATG GGAAAAGAGA AAATAAGTTT TGACGATGTT GAGGAAATAA AACAAAGAGG AACAGTATGT TACCTTTGTG GTGTTCGGGA CGGAAGATAC AAAATTAAAA TGCTTGATGT GGAAACAGCG ATGGTTTGCC TCTCATGTTT AAAAAAGCAC AAGGTTGGGA AAGACAAGAC GGTGTTTTAC GATGAGTATG AGGAATTTAC CGGTTTCGCT GTAAAAAAGA AGATCGACAG TATTATTGAG CTCGAAGATG AAAACGGGCA TATAGCTGTT ATATATGCAG ATGGAAATAA CATGGGAAAT GTGGTTAAAA ATATTGAAAC TCCTTTTCAG CATATGTACT TCAGCCGTGC TTTGGACAGA ATTACAAAAA GATGTGTTTA TCAATCTATA AATGAAGTGA TGGGAAATGA TGCAATGTTT GAGGCAATAG CTCTGGGCGG TGATGATATA TTTATTATTG TCCCGGGCAA CAGAAGTTTG GAAATCACAA ACAAAATTAT TGAAAAGTTC GACGGCGCTT TTGAAAATAA AATGACCATG TCTGCCGGAA TATGTATTGC CAAATCCAGC ACTCCGATAA GGACTTTGTT TGAAATTGCA CAGTATATGC TTAAAAGTGC AAAAAGGTAT TCAAGGAAAA ACAACAGTTC TGAAGGAACA GTGGATGTTC AGTTTATTCG CAGCAATGTC GGTGTCGATT TGCTGGAATC CGAAAGCAGT TTGTTCCCTG CTGCAAATTC TGAACTTTCA GCCTACCTGG ATATTATCAG AAGGCTTAAA AACGACGTAA ATATAAAAAC TGCCCAATTA TACAAATTCA GTAATGCATG GCGCATATTA AAAAATCCAA TGGAATTCCA GTTATTTTAT CTTTACCAGA CAGGCAGGCT ATCTTGCAAA TATAATGACT ATGCCATGGA ATTCCTTGGC AATATGAAAA ATGTTGACAA GGATGCTTAT TGTTATTGCG GACTTGTAAA GAAAAAGCCG GGTTATGCGG GTTATGATTC TGTAAAAGGA AACGATTATG TATCTTTGTG GGATGACGTC ATCCTTTTAA TGGATGCGGT AGGGAGGTGA
|
Protein sequence | MDSVLSVLSP ENIQKNEYEE IIRDIRSQLQ SLSDSRRFRQ LLDKYADKPY YFKRNGNYLE NKRCTPTIRD AACFYVLIKY LEYYKLKNNP DDLEKIFSNS SFDAGKYYME FFGDDWDEEK QFIYSFICNE PDMNSNLLSD IEIDIVSGSV YKIKKYFLEN SRIKDIRGGS ALIKYVNEDV TFDYLRDNYT EECAVYCGGG NVLIIAPGGA GEKICSALEE KYTRITLTAQ NAFEYVSTNL NTFIKHYKNI MGDLNQKLDA RKKLKIYSIN PDGRLETIEM GKEKISFDDV EEIKQRGTVC YLCGVRDGRY KIKMLDVETA MVCLSCLKKH KVGKDKTVFY DEYEEFTGFA VKKKIDSIIE LEDENGHIAV IYADGNNMGN VVKNIETPFQ HMYFSRALDR ITKRCVYQSI NEVMGNDAMF EAIALGGDDI FIIVPGNRSL EITNKIIEKF DGAFENKMTM SAGICIAKSS TPIRTLFEIA QYMLKSAKRY SRKNNSSEGT VDVQFIRSNV GVDLLESESS LFPAANSELS AYLDIIRRLK NDVNIKTAQL YKFSNAWRIL KNPMEFQLFY LYQTGRLSCK YNDYAMEFLG NMKNVDKDAY CYCGLVKKKP GYAGYDSVKG NDYVSLWDDV ILLMDAVGR
|
| |