Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0892 |
Symbol | |
ID | 4810511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1068243 |
End bp | 1069349 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106309 |
Product | hypothetical protein |
Protein accession | YP_001037319 |
Protein GI | 125973409 |
COG category | [S] Function unknown |
COG ID | [COG0327] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00486] dinuclear metal center protein, YbgI/SA1388 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0194369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATTA AAGCAAAGGA CATAATAAAA TACATGGAAG AACTGGCCCC TGTCAGTCTT GCGGAGGATT ATGACAATGT CGGATTGCTG ATTGGAAGCC GGGAGAGTAC TGTTGAAAGA ATTTTTGTGT GTCTCGATGT GACTTCGAAA ACTGTGGATG AGGCTGTGGC AAAAAAAGCT GATTTGATTG TTTCCCATCA TCCTGTTATA TTTAAAGGCC TTAAAAGAAT AAATGAGGAC GATCCAAAGG GGAATATCAT TTACAAATTG ATAAGAAACA ATATCGGGGT GTACAGTGCC CATACCAATC TGGATGTTGC CCATGGAGGA GTTAACAATT ATCTTTCCTC GATTTTGGGG CTTAAGGATA TAATTAGTCT CAAAGATTAC AAGGCTGAAA AACTTTACAA GGTTGTTGTA TTTGTACCTC ATGAGAGCGT GGATGCGGTC AGAGATGCAA TGAGCCGGGC CGGAGCCGGA TGGATAGGCA ATTACAGCGA CTGCTCCTTT ATGACGGCGG GAACAGGAAC ATTCAGGCCT TTGGAGGGGA CAAACCCTTA TATAGGCACA ACAGGCAACT TGGAGAAGGT TGATGAATAC AGGATTGAAA CGGTGGTAAG TCAAAGGAAC CTTAAAAAGG TTATAGAGGC CATGATAAAG GTTCATCCTT ATGAGGAAGT GGCTTATGAC GTTTATCCTC TTGAAATAAA AGGCAGGCAG TATGGCATGG GAAATGTGGG AGTGCTTGAC AAACCTAAGA GCCTTGATGA GTTTATAGCA GTTGTAAAAG AAAAGCTTGG CGTAAAAAAC GTAAGAGTAA TTGGCGAAAC TAACAAAGAA ATTGAAAAAG TGGCCGTATT CTGCGGAAGT TTCGACAGGG ATGTAATGGA AGCTGCAAAA TCAAAAGCGG ATGTATTGGT CACCGGAGAC GTAAAATATC ACGATGCCGT AGATATGTTG GAAATAGGAA TGTGTGTTAT AGATGCCGGA CATTTTAATA CGGAAAGGAT TATTGCCGAC AGGCTTGCAC AACTGATAAA AGAGAATTTT CCAGAAGTTG AGGTTATAAA AAGCAATATG GAAGAAGACC CATTTAAATT TTATTGA
|
Protein sequence | MSIKAKDIIK YMEELAPVSL AEDYDNVGLL IGSRESTVER IFVCLDVTSK TVDEAVAKKA DLIVSHHPVI FKGLKRINED DPKGNIIYKL IRNNIGVYSA HTNLDVAHGG VNNYLSSILG LKDIISLKDY KAEKLYKVVV FVPHESVDAV RDAMSRAGAG WIGNYSDCSF MTAGTGTFRP LEGTNPYIGT TGNLEKVDEY RIETVVSQRN LKKVIEAMIK VHPYEEVAYD VYPLEIKGRQ YGMGNVGVLD KPKSLDEFIA VVKEKLGVKN VRVIGETNKE IEKVAVFCGS FDRDVMEAAK SKADVLVTGD VKYHDAVDML EIGMCVIDAG HFNTERIIAD RLAQLIKENF PEVEVIKSNM EEDPFKFY
|
| |