Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0310 |
Symbol | |
ID | 4808528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 388027 |
End bp | 389073 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105721 |
Product | hypothetical protein |
Protein accession | YP_001036741 |
Protein GI | 125972831 |
COG category | [S] Function unknown |
COG ID | [COG3584] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAATA CAATAGGTAA TATTTTTAAC TGCCCGGATC TCAACAGAGT TCATAGCCGC AGCCGGGAAT ACCTGTCTGA GTATTTCCAG GAAGAGGAAA TCAACAGTCT GAATTATCCC GAAATTCTCG AAGAAGTTCT CATGAATGCA GCACAACTGA AATTAAACTA TCCGGAAATA CTTGAAGAAA TAATAAGGAA AGTTGATTCA GCCGTTTTAT CCTCCAAATC ACCTTCCATT TTGGAGCGGG TCAAATGCTG CAGCCAATCA ATTCTTAGTA AAAAGGATTT GGATTTGAAA GTCGTGGCTG CAGGATCATT GATGACCTTC CTGCTTACCT GCGCCGGTAA CAACATTCTC AATTCAATCA CAGAAAAAAC AGCCAGCCAT GTGCACAATT TCGCAAGTTC CACCTGGGTA TCCCCAGACA GTGCAAAATT GTCAAGCCTC GTCAAAGAAA GTATGCCGGC AAAAACCGAA GGAGCCCTGG CTTCTGTTTC AGCTGTGAAT ACCGCGAACA TTTCCTTTGC TGAAATTAAA AAAAGCATAG AAAGCAAAAA TACTTCAAAA ACCCAAAGCA CACCTGTTTC GAGGTCGAAA CTGACCGCCA AATCAAATTC TGCAACTGAG GCAAAGTCTG CCCAAACCCA AGTCAAACGC ACAAAGGTGG AAATTGCGGA CATCATGCCC CAGTGCAAAA GGATAATCAA TATGACGGCT ACGGCATATG ACCTCTCATA TAAAAGCTGC GGCAAAACCC GTGAACATCC TGCGTATGGA ATTACAGCCT TCGGAACACG CGCCACAGTC GGAAGAACGG TTGCCGTGGA TCCGTCCGTC ATTCCTCTCG GAACCAGGGT GTACATATCT TTTCCCGTGG CATACAGCCA TCTTGACGGA ATATACATAG CCGAAGATAC CGGAAGCCTA ATCAAGGGCA ATAAAATAGA TATTTTCTTT GGCGAAGACA AACCGGGCGA AACGGTAATA TACAACAAAG CCATGAAATT TGGATTGCAG GAAGTAGTGG TATACGTGCT GGACTAA
|
Protein sequence | MGNTIGNIFN CPDLNRVHSR SREYLSEYFQ EEEINSLNYP EILEEVLMNA AQLKLNYPEI LEEIIRKVDS AVLSSKSPSI LERVKCCSQS ILSKKDLDLK VVAAGSLMTF LLTCAGNNIL NSITEKTASH VHNFASSTWV SPDSAKLSSL VKESMPAKTE GALASVSAVN TANISFAEIK KSIESKNTSK TQSTPVSRSK LTAKSNSATE AKSAQTQVKR TKVEIADIMP QCKRIINMTA TAYDLSYKSC GKTREHPAYG ITAFGTRATV GRTVAVDPSV IPLGTRVYIS FPVAYSHLDG IYIAEDTGSL IKGNKIDIFF GEDKPGETVI YNKAMKFGLQ EVVVYVLD
|
| |