Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2235 |
Symbol | |
ID | 4809973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2662160 |
End bp | 2664106 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640107641 |
Product | hypothetical protein |
Protein accession | YP_001038630 |
Protein GI | 125974720 |
COG category | [S] Function unknown |
COG ID | [COG2604] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCGGCCT TTTTTAAAAC ATTTGACAGG CAGGAGGATT TTATGGTTGA CGTATTCAGA CTTAATATTG ATGCTTTAAA GGAAAATTAT CTTCCTTTGG CGCGGTTTTT CGAAAATTTG AATGAAAATA CCGGAAATGA AAGTAGTGTT ATCCTTGAGA CCTCAAAAAG CGGGATGCCG AATTTCAGGG TAAAAAAAGG AGAACATACT TTTTTTGTTC ATAGCCCGTA TGATCCCAGA ACTGAAGCGA TAAGATGGGC AGAGAAAATT GATTTAAAAG GTTTTGATAC AATTGCGGTT TTGGGTATTG GATGTGGATA TCATATAGAA GAGCTTGAAA AAAAGTATCC TGATAAAAAC AAAATTGTGA TTGAACCGGA CAGAAATGTG TTTTTAAAAC TTCTTAACAC AAGGGACATT ACACATTTAA TATCCAGTAA AAACATATTA TTTATAATCA GTGACAATAC CGAGGAAATC GCAAAGGTTT TCTTGTTGCT CAGAGAAGAA GGAGAAATAG ACAGTGTGGA GTTTAATGAA TTGTTAAGTT ACAGAAAAGT TTATGAGGAT TGGTGGTTGG AATTAAAAAA GGAATATATA AAGTTTGCAA GACTTCATCA GATAAATACC AATACGAGTG TTTTTTTTGC GGAAGCATGG TTAACTAATT TATTTGAAGG AATGTGGCAG TTGACAAAGA GTGTGCATAT CAAGGAATAC AAATCTGCTT TTGCCAATAT TCCGGCAATT GTTGTTTCTG CAGGCCCGGC ATTGAATAAA AATGTACACC TTTTAAAAGA ACTGTACAAT AAGGCGGTTA TTATTTCTGC CGGTTCCGCC CTTAACATTT TGGAAAGCAG GGGCATTACA CCCCATATTA TGGTTGGGGT GGACGGCGGA GAAGCGGAAA GCCGAATATT TAACAATGTT AAATCCAATG AAATATATTT TGCCTATTCT CTTTCGGTTC ATTATGACGG ATTGAAAAAT TACAGCGGTC CCAAAATATA TTTCAAGACC AATGTGCTGG GATATGGTGA CTGGATTGAC GAAAAGTTGG GCATTGAAGG TGCGGAAAAC CTTCGCTCAG GTTCTTCCGT GTCAAATCTG TCTTTGGACA TTGCGAGGTA TATGGGATGC AATCCGATTA TTGTCATAGG CCAGAATTTG TCCTTTCCAA ACCTGGAATC TTATGCTGAC GGTGCTGTGT TAAAACAGGA ACAGGACCGG CATATCCAAC AGTGCGTGGA AAATTCAAAT AAGTATTATG TGCTTGAAAA AGATATTGAT GGTAATGATG TATATACTAC TCACAGCATG CTTTCCATAA GATTTTATTT TGAAGAGTAT ATCAAGAACC ACCCGGACAG GTTGTATTTG AACGGCTCTG AAGAAGGACT GCCTATAAAA GGGATGAAAA ACATGCCCTT GAAGGAGATT GTAGAAAAGT ACTGCACAAA GGAATATGAC ATTAAAGGAA TTTTGGATAA AAAATTCAAA GAAGAATTTG AGGCAGAAAA CGTAAAAGCA AAAGAAATTA AAATTAGAAG AATTTTGGAA GATATTCACA AAGAAAGCAC TGAAATAAGA CAAAAAGCTA TAAAGAGAAT AGATTTAATT CTCGATATAT TAAGTAATAT CAGAGGCAGC CATAATGACA AGTGGGAGGA AATTGACAGG TTGACAGATG AAATTGAAAG CAGTGATTTG TACAAATATT TCGTTGAACC TTTGAGCAAA TACTTTATTC AGGCTGTTAA GAATGAAAGG GAAAGAAAGA TGGAAAGCAT TCCTGATATA CAAGAGAGAT TGAAATATCT GTATGAGGGA TTGCTTATAC AGTATGTTGA GGTAAAGGAC AAAATTGTCC TGATAGACGA TTTGTCCCAA AAAATAATAG AGAATATTGA TAAAAAGGAG GCAACAAAAT GTCTAAGTAT GGTTTGA
|
Protein sequence | MPAFFKTFDR QEDFMVDVFR LNIDALKENY LPLARFFENL NENTGNESSV ILETSKSGMP NFRVKKGEHT FFVHSPYDPR TEAIRWAEKI DLKGFDTIAV LGIGCGYHIE ELEKKYPDKN KIVIEPDRNV FLKLLNTRDI THLISSKNIL FIISDNTEEI AKVFLLLREE GEIDSVEFNE LLSYRKVYED WWLELKKEYI KFARLHQINT NTSVFFAEAW LTNLFEGMWQ LTKSVHIKEY KSAFANIPAI VVSAGPALNK NVHLLKELYN KAVIISAGSA LNILESRGIT PHIMVGVDGG EAESRIFNNV KSNEIYFAYS LSVHYDGLKN YSGPKIYFKT NVLGYGDWID EKLGIEGAEN LRSGSSVSNL SLDIARYMGC NPIIVIGQNL SFPNLESYAD GAVLKQEQDR HIQQCVENSN KYYVLEKDID GNDVYTTHSM LSIRFYFEEY IKNHPDRLYL NGSEEGLPIK GMKNMPLKEI VEKYCTKEYD IKGILDKKFK EEFEAENVKA KEIKIRRILE DIHKESTEIR QKAIKRIDLI LDILSNIRGS HNDKWEEIDR LTDEIESSDL YKYFVEPLSK YFIQAVKNER ERKMESIPDI QERLKYLYEG LLIQYVEVKD KIVLIDDLSQ KIIENIDKKE ATKCLSMV
|
| |