Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2504 |
Symbol | |
ID | 4809443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2970199 |
End bp | 2971488 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107920 |
Product | hypothetical protein |
Protein accession | YP_001038899 |
Protein GI | 125974989 |
COG category | [S] Function unknown |
COG ID | [COG3584] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAATCA ACATTAATTC AGTGTTTGAA GAATTAAAAA ATCGGGAAAT TCTTACCCAA AAAAATGTAT CCGACATAAA GAACTATATT TCAGTCCGTT ATCCAAATCA TTCCCCGCAA AAAAAAGCCG CAATTTTTGC AGATGCGGTA AACAAGATAA TTAACAAAAA CATATCCTCA ATAAGCATAA CTTACAGAGA CGGAATAAGA AAGACCCTCC TTAGGGAAAC AGTAAAAAAA ACCCCTTTTG CAATTAATGC AAATGACGTT TATCATGCAT GTGTGAACAG TGGCTTTAAA GAAGACAACT TCATAAATGA AGTTTCCCAA TGGTGCGGGA ATATATTGGA AAACCCTCAC ATAAAAGATG CTGTCAAAGA TTACACATTA AAATTAATCA GTCTCGCAGA AACTCCCTCT ACAATAGATG ATACAGTAAA TAATTCAATA AACGATACGA AATGTGAACC CTTAAGTCAT GACAGGGACA CAGAACTTTT GCCGGAAAAT GTATCCGGGC AAGATGTTAA AATGCCAGAT GCTGAAGTAA TCGAAGAAAT TTCCGTTTCA GATGACACTC CTGTTGAGTC TGAGTATGTA AGCAGCATCG AACTTGATGC TGCAAGCAAT ATACCTGATA CAGGCAGTAC ACTCGGTACA CATGAAACCG GCAGTATAAA AAGCGTCCCT GGTATAAAAA AAATTGTGAT TGCTGCCTCG GCATTAGTTT TGATTATTAC ATTTTTCTCC GTTGCTTTGT CCGGCAAACT GAAAAAAACG GCCGGCACCG TACAGCCAAG TGCCCCGGAA GCAACTCCTT CCCCTTCATA TGTCAGCATT TTCAATAGCC TGAACACCCC GGACGACTTA TATCTGAAAA AAGCCATCTT ATCCAACCTG GGCTACTCCA TGGGAATTCA CAGAAATGTT CCCAAACATA AAAAAGTAAT GTATCTTACG GCCACAGCGT ACGACCTTTC TTATGAAAGC TGCGGCAAAA CCCGGGACCA CCCGGAGTAC GGAATTACAT ACACCGGTAC AAGGGCCAAA TTGGGAAGAA CGGTTGCCGT TGACCCTTCG GTAATTCCCC TGGGAAGTGA AATGTACATA ATATTTCCTG AAGAATACAG CCACTTAAAC GGAGTTTATA TTGCGGAAGA CACCGGTTCT CTTATCAAAG GCAATAAAAT TGATATATTT TTCGGAGAAG ACAAACCTGG CGAATCCATA GTAAATGAAT CCGCCATGAA ATTTGGAGTA CGAAAAGTGT ATGTGTACAT ATTGAATTGA
|
Protein sequence | MEININSVFE ELKNREILTQ KNVSDIKNYI SVRYPNHSPQ KKAAIFADAV NKIINKNISS ISITYRDGIR KTLLRETVKK TPFAINANDV YHACVNSGFK EDNFINEVSQ WCGNILENPH IKDAVKDYTL KLISLAETPS TIDDTVNNSI NDTKCEPLSH DRDTELLPEN VSGQDVKMPD AEVIEEISVS DDTPVESEYV SSIELDAASN IPDTGSTLGT HETGSIKSVP GIKKIVIAAS ALVLIITFFS VALSGKLKKT AGTVQPSAPE ATPSPSYVSI FNSLNTPDDL YLKKAILSNL GYSMGIHRNV PKHKKVMYLT ATAYDLSYES CGKTRDHPEY GITYTGTRAK LGRTVAVDPS VIPLGSEMYI IFPEEYSHLN GVYIAEDTGS LIKGNKIDIF FGEDKPGESI VNESAMKFGV RKVYVYILN
|
| |