Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0744 |
Symbol | |
ID | 4810362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 906953 |
End bp | 909805 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640106161 |
Product | copper amine oxidase-like protein |
Protein accession | YP_001037172 |
Protein GI | 125973262 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0284695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TCAAACAAAT GAAAGTGGCG TTATTTTTGT TTTCTTTTTT AAGTGTAATT GTTTTTATGT CTGTAAGTTC TTTGGCTTTG CCTGATGTAC CGAAAAAAAG TTCAAACACT TCGGGGAATA TGGCCAATAA CGGTCGTGTT TTAAAATACG ACGGCTGGGT TTATTATTCT TTCGATGAAA GCGGACTGTA TAGAATGAAA GAGGATGGAA GCCAGAAGAA AAAGATATGT GACGGAATGT ATGACAACCT GACCGCATAT GACGGCTATA TTTACGGCTA TTGCAGATAT ACAAAAACAA ATAACCCTGA GGAAACAGGA TTGTTCAGGT TGAAGCCTGA TGGAACGGAA AGAGTGAAGA TTAGCGATAA ATCCATGCTT TTTGTTACGA TATATGATGG ATGGATATAT TATACATCGT TTGATGATAA TTTCAAACCT TATAAAATGA AACTTGACGC TACCGATGAC CAAAAACTGA GTGATTATTC TGCTAGTTAC ATAAATGTTG ATAATCAATG GGTATATTTC CAGAATGATG CGAATGGTGG GCGTATATAT AAAGTGAAAC ATGACGGCAG TCAAATTACT GAAGTTAGCG ATCATGGTAA TATGTATACG TATTTGAACA TTGACGGTGA ATGGATTTAC TTTTCCGGTC ATTATTTCCT TTATAAAATG AAGGTTGATG GAACTGAATT AACTCCGCTG TTTGAGGAAC TTATAAATAA TGTAAATTCT AAGGATAGTT GGATTTACTT TTCTGTTTTT GAAGAAGGGA TATACAGAAT TAAAACGGAC GGAACGCAAT TACAGAAACT GCGTGATGTG GAGGATTTTG TTTCAGGTAT AAGTCTTACG GATGAATGGC TGTATTATGA GGTATATGAT GTTAAGGATT CCAGTACTCG TGTTTACAGA ATGAGGTTGG ACGGTTCTAG TCACCAGAAG TTTAAAATTA CCGAGGATCA TGTTCCCGAT GAAGTGGAAA ATGTAAAAAT CAGAATTGAC GGCAAGTTCG GTGAATACAG CAATGTACCG TTAAATCTCT ATGGAAGGAT TTTGCTTCCT TTCAGGGAGA TTCTGAAAAA TCTTGGAGTT CCCGACGATG ATAAACATAT TATCTGGGAC GGAAAAAACA GGACTGTTAC CGTAAAAAAG GACAACATTA CGATTCTTCT GACAATAGGG AAAAACACGG CATTGGTAAA CGGAAAAGAA TATGTACTGG ATGTTGCACC CATTATATAC AATGACCGTA CGTATATTCC GACAAGGTTT ATCGCAGAAA GCTTGAACAA GAAGGTTTTA TGGGACGGGG AAAAGCAAAT TGTGTCTATT TGTGAGCCTG CAGAGTTTGA AAAGGTAAAA GATATACTTG CAAAAACCAA TGATGCAATG GTGAACAGTG TTGAAAGATA TAGTGTTGAC CAGAAAAATA GTATGAAATA TAACAATGAC TATTTGGATT ATGAGATTCA GATGACAGCG AAACTTGAAA TTGATTCGAA AAAGCGGTTG TGCAATATTG TTGGCGAAAG AAAAATATTG GAAACGGGTA ATATTACAAA TTTTAATTCT GAAGAAACAT CAAATATATA TTGCTGGTGT GAAAATGAGA AGATTTATTT GAAAGATGAG AATTATGATT TGTGGTATGA GTCGGAATTA ACAGAAGATG AATGGGAAAA TTTTAAACGC AGGGTTTTAA GCAACTATAG CTATATAAAC GCTGATGATT TAATCTGTTC AGGTTTTACT GTGGACGAAA CCGACGAACA TTATATATTA AAGGGTGAGC ATATTTTTGA TGAGTTGATA TCCGGCGCAT TGTATGAACT GGATATAGTA AATGAAATAA TCAGTGGTAC AAACACCGAA TTGTTAATCA ATAAGAAAAC TTATTATTTG GAAAAAATAA ATACTACAGT AACCGGTAAA GAGAAAAGTT CTTATGGAGG AAATTTCAGC ATTGCTGTTA GCCTGGAAAA TACGAATTTT AACAGCGAAG CACGGGTTAC AGCTCCAGAA GGATTCGATC CTGATAAACT TATAAATAAA AAAGCTGTTG AATTGGTGAA TTTTATATCC CAGGCATACC TTTATTTTAA TGAAATAGAT AACTTTGAAG AAAAGAGTAA AGAATTTATA ACAAAGAAGG ACTTTAATTT TGATGATGTA AAACAGTATA TTGAAGCTAT TAAAGCAGAT GATGATGTTT TTACCGTATG CGTGGATGAA AATAGTCTTG AATATGGATA TAAAGAAGAA CAGATTGAGA CAAAAGACTT GGGGAAAGAC GCTGTCTATA TAAAAATAAA AAGCTTTACG GAGGATGTCG GGGATAAATT TATTGAGGAA GCGGACAAGA TTGAGAATTC CGAGGATAAA ACTCTTGTAA TTGATTTAAG AGATAACGGA GGGGGATTTA TAATATCCGC AAATGATATT CTTGACTATT TGCTGCCAAG GTGCCTGATG AATTATTATA TCAGCCGGAG CGGAGATATG TTGCCTGTTT ATTCTAATGA TGATTATAAA GAGTTTAAAC AAATACTCAT TCTTGTTAAC GAGAATACTG CAAGCAGTGC GGAACTTCTT GCATTGGGAC TTAAAAAGCA CTTGAAAAAT ACCACTGTTA TAGGACGGAC AACCCTTGGA AAAGGTGTCG GACAACTTGT ATACAGAGAT GATGATAAAA AGTTTTCGGT ATACCTTGTA AGTTTTTACT GGAATGTAAA AGAACAGAAC GTAATGAAAA GCGGCATTAC ACCGGATATA GTTGTTAACA GTGATTCTGA TTATTTGAAG GAAGTGGAAA AATTATTGAA GTCAAAACGT TGA
|
Protein sequence | MKKIKQMKVA LFLFSFLSVI VFMSVSSLAL PDVPKKSSNT SGNMANNGRV LKYDGWVYYS FDESGLYRMK EDGSQKKKIC DGMYDNLTAY DGYIYGYCRY TKTNNPEETG LFRLKPDGTE RVKISDKSML FVTIYDGWIY YTSFDDNFKP YKMKLDATDD QKLSDYSASY INVDNQWVYF QNDANGGRIY KVKHDGSQIT EVSDHGNMYT YLNIDGEWIY FSGHYFLYKM KVDGTELTPL FEELINNVNS KDSWIYFSVF EEGIYRIKTD GTQLQKLRDV EDFVSGISLT DEWLYYEVYD VKDSSTRVYR MRLDGSSHQK FKITEDHVPD EVENVKIRID GKFGEYSNVP LNLYGRILLP FREILKNLGV PDDDKHIIWD GKNRTVTVKK DNITILLTIG KNTALVNGKE YVLDVAPIIY NDRTYIPTRF IAESLNKKVL WDGEKQIVSI CEPAEFEKVK DILAKTNDAM VNSVERYSVD QKNSMKYNND YLDYEIQMTA KLEIDSKKRL CNIVGERKIL ETGNITNFNS EETSNIYCWC ENEKIYLKDE NYDLWYESEL TEDEWENFKR RVLSNYSYIN ADDLICSGFT VDETDEHYIL KGEHIFDELI SGALYELDIV NEIISGTNTE LLINKKTYYL EKINTTVTGK EKSSYGGNFS IAVSLENTNF NSEARVTAPE GFDPDKLINK KAVELVNFIS QAYLYFNEID NFEEKSKEFI TKKDFNFDDV KQYIEAIKAD DDVFTVCVDE NSLEYGYKEE QIETKDLGKD AVYIKIKSFT EDVGDKFIEE ADKIENSEDK TLVIDLRDNG GGFIISANDI LDYLLPRCLM NYYISRSGDM LPVYSNDDYK EFKQILILVN ENTASSAELL ALGLKKHLKN TTVIGRTTLG KGVGQLVYRD DDKKFSVYLV SFYWNVKEQN VMKSGITPDI VVNSDSDYLK EVEKLLKSKR
|
| |