Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0296 |
Symbol | |
ID | 4808514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 370827 |
End bp | 372302 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640105707 |
Product | hypothetical protein |
Protein accession | YP_001036727 |
Protein GI | 125972817 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000491065 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAATA CTTTTAGATG GACATTGAAC AAACTGGTTG TTACAGGAGC ATGTGCGTAT GTTCTTTTAA TTCTGCTGTT TATGTTAAAT TCCGGTTTGG TTCGGGTGTC CGGGAATGTT TTGCTGAATA AGTCTTTGGA TGTGTTGTTT TTCACGGTGG CGGCAGTTGT AGGTTACGGA GTGCTACTGC TCTTTGAAAA AGTTTTAAAT TTGCGTATGT TGAATAAGGA GTATGCAAGG ATACTGATTG TGGTTTTAAT GACCCTGATT ACGAGGCTTG TTTGGATACA TATTGTAGAT ATTACTCCCA AAAGTGATTT TGAGCTGTAC AATACTTTGG CGGAGGCATT CTCGCGGGGA GAGGCCGCCG GAGGCAAGTA TGTGGCTCTT TTTCCCCATA CATTTGGTTA TCCCTTTATA CTGGCAACGG TATATCGAAT TTTTTCTCCT GACAAGTATT TTGCTTTGCT TTTAAATATT TTGTTTGAAG CAGGCACGGG TGTTGTTTTA TATTATCTTG GGAAGATGGT CTCAAACTGG AAAACAGGTT TCTTTGCAGG CATTATATGT GCTTTATGGC CTTCCCATGT GTTTTATTCT TCAATTGTTT GCACCGAGCC GCTGTATACA TTATTAATGG CGCTTTTGAT ATTTGTATAT TTTAAGGTGT CAGTCAAAAA TAAAAGCTTA TTGCATTCCT GCGTTTTGTA TCTTTTGCTC GGGTTTTTAT GTGCGGCGGC CAATGCGATC CGTCCCATGG GTACTCTGCT TGTTGCAGTT CTGGGAATAA CTGAGGTTGT GAGAATAATT AAGAAAAAGG AGGGGTTAAA ACAAAGTTTT GCAGGATTTG TGCCCTTTGC CGTATTCTTA ATAGCATATT TCTCTTTTGT TAATTTGACA GGCATGTATG TTTCCTATAA AATTGGCTAC AATACGGCTA AAAATCCCAT AGGTTTTAAT ACTTATGTCG GCGCCAATAT TAATTCCAGC GGGATGTGGA ACCAGAGTGA TGCCAATGTG CTAATGGATT TTATGAAGCA GGAGCCTTTT GATGCCCAAA AAATACACGA ACAGTTGCTC AATCTGGCAA TTCAAAGGGT GAAAAGCCAG GGAACGGGTA ATTTGAAGCT TGTAATAAAA AAGAATATGA TCATGTGGGG CAGGGACGAT GAAGTTGTAA CCTATATGAT TGCCGGAAGC GGTGATAAAA CCTCATCGTT ATTGGAAGTA AAGAACTCTG AAGGTCTTTT GAGGTATATT TGCAACTTCT ATTACTACAT GATAGTTATA TTGGCATTTG GTGGCCTTTT GAAGCAGTGT GCTAAAGAGG ATAATCCAAT CCTTATGGCT TTACTGCTGC TGTTTCTTGG AATTGGTGCT ATACATACCG TTGTTGAGGT ACATGGCAGG TATCATTATT CATCTATGGC TGTTTTTGCC ATACTTGCCG GAATAAACAA TTTTAAAATT AAATAG
|
Protein sequence | MKNTFRWTLN KLVVTGACAY VLLILLFMLN SGLVRVSGNV LLNKSLDVLF FTVAAVVGYG VLLLFEKVLN LRMLNKEYAR ILIVVLMTLI TRLVWIHIVD ITPKSDFELY NTLAEAFSRG EAAGGKYVAL FPHTFGYPFI LATVYRIFSP DKYFALLLNI LFEAGTGVVL YYLGKMVSNW KTGFFAGIIC ALWPSHVFYS SIVCTEPLYT LLMALLIFVY FKVSVKNKSL LHSCVLYLLL GFLCAAANAI RPMGTLLVAV LGITEVVRII KKKEGLKQSF AGFVPFAVFL IAYFSFVNLT GMYVSYKIGY NTAKNPIGFN TYVGANINSS GMWNQSDANV LMDFMKQEPF DAQKIHEQLL NLAIQRVKSQ GTGNLKLVIK KNMIMWGRDD EVVTYMIAGS GDKTSSLLEV KNSEGLLRYI CNFYYYMIVI LAFGGLLKQC AKEDNPILMA LLLLFLGIGA IHTVVEVHGR YHYSSMAVFA ILAGINNFKI K
|
| |