Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2273 |
Symbol | |
ID | 4809862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2702860 |
End bp | 2703981 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107679 |
Product | hypothetical protein |
Protein accession | YP_001038668 |
Protein GI | 125974758 |
COG category | [S] Function unknown |
COG ID | [COG3581] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTA CATTTCCACA TATGGGAGAC ACATATATAC CTGTAAAAGT ACTGCTGGAA ACCGCGGGGA TTGATTATGT CATGCCACCG GTTTCCGACA GAAGTCTGTT AGAACAGGGG ATACTGCACT CGCCTGAATT TGCCTGTCTT CCTTTCAAAA CAATAATGGG TGATTTTATT TACGGAATTG AACATGGAGC GGACTGGATT CTTTTCGGCG GTGGCTGTGG CCAGTGCAGA TTCGGCTATT TTGGAAAGCT TCAGGCCGAA ATATTAAAAA GCATTGGGTA TGATGTAAAT TTTATATATA TTGATCTTAG CAATATTTCC GTGAAAGAAG TGCTGGAGAA AATAAGGCCT CTTACAGAAG GAAAGAGTAT TTTTGAGCTT TTAAAGGCAA TATTTTATGC CGTTAAAACC GTTTTTGCCG TTGACAGGAT AAACGAACTG GCAAGATTCA CAAGGTGCCG GGAGATAAAC AAGGGAGAAA CGGACAGAAT AATGACTGAA TTTCACAATG AAATCCAAAA AGCCAGGGGG TATAAAAGCA TAAACAAAAT AATTCATTCC ACCGCCAAAA AACTGCGGAA GATGCCTTTG GACAAAAAAT ACAGGCCAAT CAGGGTTTCC ATTGTGGGTG AAATATATAT TGCCGCCTAT CCCGGCATTA ATTTTGAGAT AGAAAGAAAG CTTGGCAACA TGGGTGTGGA AGTGCATAAC ACCATGAGCA TGAGCTTTTG GATAAAAGAA CATTTTATAA AGAAGCTTCT CCCCTTCAAA ATAAAAAACA AAAACCATGA AGCCGGAAAG GAATTTATGA ATACTGACGA TATCGGCGGT CATGGCCTCA GCTCCATAGG TGCCTCCATA AGAAGTGCCA AAAAGGGATT TGACGGCGTT GTCCATATAT ATCCCTTCAC CTGCATGCCT GAAATAATTG CTCAAAGCAC CTTTAGCGAA GTGCAAAAGA AATACGGTAT ACCCATTATT ACACTGATAA TTGATGAAAT GACCGGTGAA GCAGGTTATA TGACAAGGCT TGAGGCATTT GTGGATATGA TTAAAATGAG AAGGAAGCCA TCTTACTTCC CTATGCCCAG ATTTTTTTCG CAAAAAATTT AA
|
Protein sequence | MKITFPHMGD TYIPVKVLLE TAGIDYVMPP VSDRSLLEQG ILHSPEFACL PFKTIMGDFI YGIEHGADWI LFGGGCGQCR FGYFGKLQAE ILKSIGYDVN FIYIDLSNIS VKEVLEKIRP LTEGKSIFEL LKAIFYAVKT VFAVDRINEL ARFTRCREIN KGETDRIMTE FHNEIQKARG YKSINKIIHS TAKKLRKMPL DKKYRPIRVS IVGEIYIAAY PGINFEIERK LGNMGVEVHN TMSMSFWIKE HFIKKLLPFK IKNKNHEAGK EFMNTDDIGG HGLSSIGASI RSAKKGFDGV VHIYPFTCMP EIIAQSTFSE VQKKYGIPII TLIIDEMTGE AGYMTRLEAF VDMIKMRRKP SYFPMPRFFS QKI
|
| |