Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3199 |
Symbol | |
ID | 4809501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3780749 |
End bp | 3781798 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108633 |
Product | hypothetical protein |
Protein accession | YP_001039587 |
Protein GI | 125975677 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0020334 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTCGG CAAGAAAAAT ATTTACCTGT ATAATACTGT CTCTTTTGTT AGTTTCAATT CTTTATTTCG TGGTTGTCCA TAGAGATAAA ATAATAAAGA TAATTTTCCC GGTTTTGCTG GGGGCTGTCA TTGCATACAT ACTTCGGCCC ATAGTTTTAA AACTTGAATC CAAAAACATA TCCAGAACAA AAAGCATAAT TATGATTTAT CTGGTCTTTG GGATTTTACT GACAACGGCC GTAATATTTA TTGCTCCGAT TTTTGTGGAC AACACCCGGG AGTTTATAAA CACCGTTCCG GAAATAACCA CAGAATACGG GGAAAAATTT AATAAAATAT TGAAAATGAT AGATACCAGC GGCTGGTCGG CGGAAGTGAA AAATGCAATT TACAATGAGA TAAACAACGG AGTGAATATA GCGGAAAACA TGCTGATGGA CGCCCTGAGG AAAACCCTCG TGTGGCTTTT TAAATCGTTA ACGGGCATAT CAAACATTAT ACTGGGAATG ATTATTGCCT ACTACATTAT GAAGGATGCC GAGTTTTTTA AAAAGGGAGC CCTTTCTTTG GTTCCGCGAA GGTGGAGAAA TGAAATTATC GGCACATGCC GGGAGATAAA TGAAATACTG TCCTGCTTTA TACAGGGACA GCTCCTGACG GCTCTCATTA TCGGTATCAT GGAGACCGTA GCCCTTGCAA TTATAGGGGT AAAATACTCG CCGATTTTGG GGTTTATAGG TGGAATTTCA AATATAATAC CCTACTTTGG GCCTTTTATA GGGGCAATCC CTTCTGTGGC CGTGGCCCTT ATAGATTCGC CGGTGAAAGC TTTCTGGACG GTGGTTGCCT TTTTGGTTGT CCAGCAGATA GACAACGCTT TTATTTCGCC TAAGATTATT GAAGGAAGGC TTGGGCTTCA TCCCATTACA ACAATTCTTG CGGTTTTGGC CGGCGGTGAG TTTTTCGGCA TAATAGGCAT GTTGGTGGCC GTTCCGGTGA CGGCTGTTTT AAAAGTGATA CTAAAGAGGC TCATTGAGGC TATTGTGTAG
|
Protein sequence | MVSARKIFTC IILSLLLVSI LYFVVVHRDK IIKIIFPVLL GAVIAYILRP IVLKLESKNI SRTKSIIMIY LVFGILLTTA VIFIAPIFVD NTREFINTVP EITTEYGEKF NKILKMIDTS GWSAEVKNAI YNEINNGVNI AENMLMDALR KTLVWLFKSL TGISNIILGM IIAYYIMKDA EFFKKGALSL VPRRWRNEII GTCREINEIL SCFIQGQLLT ALIIGIMETV ALAIIGVKYS PILGFIGGIS NIIPYFGPFI GAIPSVAVAL IDSPVKAFWT VVAFLVVQQI DNAFISPKII EGRLGLHPIT TILAVLAGGE FFGIIGMLVA VPVTAVLKVI LKRLIEAIV
|
| |