Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1168 |
Symbol | |
ID | 4810120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1392636 |
End bp | 1394024 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106590 |
Product | hypothetical protein |
Protein accession | YP_001037593 |
Protein GI | 125973683 |
COG category | [S] Function unknown |
COG ID | [COG2078] Uncharacterized conserved protein [COG3885] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00296] uncharacterized protein, PH0010 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.773328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAGAA TAATAAGTTC TTATATTTTT CCTCATCCTC CTTTGATTGT ACCTGAGATT GGCAAGGGAG ATGAGAAGGG CGCAATTAAA ACTATAGAGG CATGTGAAAA AGCGGCTGAA CAAATAAGAA AAGAGAAGCC TTCAACCATT ATTCTTACGA CTTCCCACGC GCCTTTGTTT GAGGATTATA TTTTCATTAA TGACCATAAA ACGCTGAAAG GCAACTTTTC AAGATTTGGA GCCCGTAAGG TGGAGCTTGG TTTTGAGAAT AATTTAAAAA TGGTGGAGTC AATTATTGAG TTTGCGAAAA AAGAAGGCTT TGATGCCGGA GGAATCAGCG AAGGTATAGG CAGAAGGTAC GGGATTTCCG GGGAACTGGA CCATGGAGCG CTGGTGCCTC TTTATTATAT AAGCCGGGTG TATTCGGATT TTAAACTTGT TCATGTTGCA ATGTCCACAC TTACTTTGGA GGAACATTAC AAGTTTGGTA TGTGCATAGG CGAAGCCGTC AGAAATTCAG ATGAAGACGT GGTATTTGTC GCAAGTGGAG ATTTGGCACA CCGCCTTACC AGTGACGGAC CCTATGGCTA CAACAAGCAT GCCCCGGAAT TTGATGAGCT TCTGGTTAAA AGCATCGAAA AGGACGATAT TGACAGGATT CTTGATATAG ATGACAAGCT TCGGGATGAA GCCGCAGAGT GCGGATTAAG ATCCTTTGTA ATAATGCTGG GAGCTTTGGA CGGATACAGT GTGGTTCCTG AAGTTTACTC TTATGAAGGT CCTTTTGGAG TGGGATATAT GGTGGCAAGA ATCGGAGTCG GAGCTATGGA TTCTTCCCGA AGGATAATTG AAAACAGGAG AAACAAAAGA AAAAAGAGTA CCGATCCGTA TGTTTCTCTT GCCAAAAGAG CCCTGGAGGC TTATGTAACG GAAGGCAGGG TTTTGGATGA TTACAGCGGT CTTCCGGAGG AGATGCTGAA TAGTAGAGCC GGAACTTTTG TTTCAATAAA GAAAAAGGGT GAACTTAGGG GCTGTATCGG TACTATCGGG CCGACAAGGA AAAATATAGC AAGTGAGATA GTTCATAATG CAATAAGCGC GGGTACTTCC GATCCCCGGT TCTATCCTGT GAAGCCCTAT GAGCTGGATG AGCTTGAATA TTCCGTTGAT GTTTTAATGG AGCCCGAAGA GATTAATTCC ATGGATGAAC TGGATGTAGT AAAATATGGG GTGATTGTAA GAGCCGGAAG AAGGACGGGC CTTTTGCTTC CAAACCTTGA AAACGTTAAT ACTGTAGAGC AGCAGGTATC AATTGCGCTT CAAAAGGCAG GCATAAGTCC AAACGAAAAA TACACAATGG AAAGGTTTGA GGTTATAAGG CACAAATGA
|
Protein sequence | MGRIISSYIF PHPPLIVPEI GKGDEKGAIK TIEACEKAAE QIRKEKPSTI ILTTSHAPLF EDYIFINDHK TLKGNFSRFG ARKVELGFEN NLKMVESIIE FAKKEGFDAG GISEGIGRRY GISGELDHGA LVPLYYISRV YSDFKLVHVA MSTLTLEEHY KFGMCIGEAV RNSDEDVVFV ASGDLAHRLT SDGPYGYNKH APEFDELLVK SIEKDDIDRI LDIDDKLRDE AAECGLRSFV IMLGALDGYS VVPEVYSYEG PFGVGYMVAR IGVGAMDSSR RIIENRRNKR KKSTDPYVSL AKRALEAYVT EGRVLDDYSG LPEEMLNSRA GTFVSIKKKG ELRGCIGTIG PTRKNIASEI VHNAISAGTS DPRFYPVKPY ELDELEYSVD VLMEPEEINS MDELDVVKYG VIVRAGRRTG LLLPNLENVN TVEQQVSIAL QKAGISPNEK YTMERFEVIR HK
|
| |