Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0114 |
Symbol | |
ID | 4808740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 145686 |
End bp | 147005 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105525 |
Product | hypothetical protein |
Protein accession | YP_001036548 |
Protein GI | 125972638 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATG ATAATTGGTT TAGAAAAAAA ATTGCAGGAT ACCGATGGTG TCTTTTGATC GTTTTCGGGA TTATACTGAT TAGTGCGGGC ATGCTTTTGG CATACAGACA TGACAATACT TTTGACATCT TTTGTTCTGT ACTTCTTTTT GCGTCAGGAT GTATATTTAT TGTTGTTTCG ATACGGCTTA TTGCAATTGA CACGGTGTCA AAATACGCCC AAAACGGTAT AAATGTAGTA TGCTGCAAAG ACAGGGACGA TAATTTGTAT GAAAAAGAAT TTCTTGACAA AGGGCCGAAA ATAGTTGCAA TAGGTGGAGG AACCGGACTT TCAACCATGC TGAGAGGGCT TAAGGAATGC AGCTCGAATA TAACGGCCGT GGTTACGGTT GCCGATGATG GGGGAGGCTC CGGGATTCTA AGACAGGACC TTGGGATACT TCCTCCCGGG GATATCAGAA ACTGTATTTT GGCCCTTGCC AATACCGAGC CTATTATGGA AAAACTGCTT CAGTACAGAT TCCAGGACGG AATGCTGAAA GGACAGAGTT TTGGAAATCT GTTTCTTGCA GCAATGGATG GTATTTCTTC GAGTTTTGAA CAGGCTGTCC AAAGAATGAG TGATGTACTT GCAGTAAAAG GGAGAGTTCT TCCGGTTACG CTTGAGGACA TTCAGCTGTG TGCGGAGCTG GAAGACGGAT ATGTTATCAC CGGAGAGTCA CAAATTGGCA ATCATAACAG CTTTCACCGT TGCGCAATCA AGAGGGTGTA TTTGGAACCC GGAAAGGTAA AACCCCTGGA TGAGGTGATA GAAGCAATCG GAGAAGCGGA TGTAATTGTG TTGGGGCCGG GAAGTCTCTT TACAAGTATA ATTCCCAACC TTTTGGTTGA CGGTGTGTGT GATGCGATAA AAAAATCAAA GGCTTTGAAA ATATATGTGT GCAATGTCAT GACCCAGCCG GGGGAAACCG ACGGATATAG CGTTTCGGAT CATATAAAGG CACTTGAAAG GCACTCTTTT GAGGGAATTG TTGATTACTG CATTTTTAAC ACTGCTGATA TACCTGAACT ATTGAAAAAG AAGTATAGTG AGGACGGAGC ACAAATTGTC AGAGTTGACT ATGATGAGTT GGATAAATTA GGCATAAAAT TGCTGGGAGG GGACTTTGTC TGCATAACTA ACGGATATAT AAGACATGAT ACAAAGAAAT TGGCTCAGGC CATCATGAAC CTTGTTATTG AGAATGTATT TGGAAAAGAC GACAGAAAAT CATCCGGTTA TGTAAATACA ATGAAGCAAT TTAAAAATAT AGTCGGATAA
|
Protein sequence | MKNDNWFRKK IAGYRWCLLI VFGIILISAG MLLAYRHDNT FDIFCSVLLF ASGCIFIVVS IRLIAIDTVS KYAQNGINVV CCKDRDDNLY EKEFLDKGPK IVAIGGGTGL STMLRGLKEC SSNITAVVTV ADDGGGSGIL RQDLGILPPG DIRNCILALA NTEPIMEKLL QYRFQDGMLK GQSFGNLFLA AMDGISSSFE QAVQRMSDVL AVKGRVLPVT LEDIQLCAEL EDGYVITGES QIGNHNSFHR CAIKRVYLEP GKVKPLDEVI EAIGEADVIV LGPGSLFTSI IPNLLVDGVC DAIKKSKALK IYVCNVMTQP GETDGYSVSD HIKALERHSF EGIVDYCIFN TADIPELLKK KYSEDGAQIV RVDYDELDKL GIKLLGGDFV CITNGYIRHD TKKLAQAIMN LVIENVFGKD DRKSSGYVNT MKQFKNIVG
|
| |