Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0561 |
Symbol | |
ID | 4808236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 688931 |
End bp | 689977 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105975 |
Product | ApbE-like lipoprotein |
Protein accession | YP_001036990 |
Protein GI | 125973080 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAAAA GATTTACGGT AATTATTTTA AGTATAGTTT TGTGCACAAG TGTTTTGTTT GTATCATGCG GCTATAATTC TTCCGATTTG TATGAAACTC AGGAATTTTT AATGGGGACT GTTGTTTTAC AGAAAATATA TCATGAAAAT GCCGCTGAAA TTGCAAAAGA GGTAAATGAC AGAATAGCCG AAATTGAATC GACCATGACA ATAAACAAGC CCGGTGGGGA AATAAATCTT TTAAACGACG CAGCGGGAAA AGAATATGTA AAACTTGGCG AGGATACTCT GTATGTGCTT GACAAAGCAA AACAATATGC AGAGATTAGC AATGGAGCCT TTGACGTTAC TATAGGTCCT TTGGTAAAAG CGTGGGGTGT TTTTACAGAC AATCCGAGGG TTCCATCGAA AAATGAAATT GATGAGCTTT TAAAACTGGT AAATTATAAA GATATAAATA TTGACTTTGA AAATTCAACG GCTATGCTGG CAAAAGAAGG ACAAATTGTG GATCTTGGCG GAATTGCAAA GGGATTTGCC GCGGATGAAG CGGTTGAAAT ATACAAAGAA CACGGTGTAA AATCTGCGTT GATAAGCCTT GGAGGCAACA TTTTTACATT GAGCGGCAAA CCTGACGGAA GTCCCTGGAT GGTGGGCATA AGAAATCCCA GAGGTAACGA TGGTTCGTAT ATCGGGATTG TTAGGGTGAA AGACAAAGCG GTAGTCAGTT CCGGTGACTA TGAGAGGTTT TTTGAAAAAG ACGGTGTGAG ATATCACCAT ATTTTGGACC CCAAGACCGG CTATCCTGCT GATACGGGAC TTATTGGGAC CACTATTATT TCGGACTTTT CAATTGATGC CGATGCTCTT TCGACAGCGG TTTTTGTGCT GGGTCTTGAG GAAGGCATGA AACTTGTTGA AAGCCTTGAT GGGGTGGATG CGGTGTTTAT TACCGCGGAT AAGAAAATAT ATGTAACGGA CGGATTGAAG GATACATTCA TATTTAAGGA TGAAAGCAAG GAATTTGAAT ATGTTGAAAA AAGGTGA
|
Protein sequence | MTKRFTVIIL SIVLCTSVLF VSCGYNSSDL YETQEFLMGT VVLQKIYHEN AAEIAKEVND RIAEIESTMT INKPGGEINL LNDAAGKEYV KLGEDTLYVL DKAKQYAEIS NGAFDVTIGP LVKAWGVFTD NPRVPSKNEI DELLKLVNYK DINIDFENST AMLAKEGQIV DLGGIAKGFA ADEAVEIYKE HGVKSALISL GGNIFTLSGK PDGSPWMVGI RNPRGNDGSY IGIVRVKDKA VVSSGDYERF FEKDGVRYHH ILDPKTGYPA DTGLIGTTII SDFSIDADAL STAVFVLGLE EGMKLVESLD GVDAVFITAD KKIYVTDGLK DTFIFKDESK EFEYVEKR
|
| |