Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1912 |
Symbol | |
ID | 4810770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2277470 |
End bp | 2279077 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107329 |
Product | copper amine oxidase-like protein |
Protein accession | YP_001038324 |
Protein GI | 125974414 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000429846 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA TTGCAAGAAA AATATCAATG TTACTGGTCG TTGCACTTTT GGCGGTATCG ATGGTGGCCT GTACCAGTGA TGAAATTGCA TTAATTGAAG CCATGTCCAA GACATCAGAA ATTTCATCCT ATGAAGGAAA TTCAAAAATT CAGTTAAGTT TTAAAGGCCA GGGATTTTCG GAAAAAACGC AGAAAGTTTT CGATTTTCTG GCTTCATATG TTGACGGATT CACTTTTGAG GCAAATCAGA AGTATTCGTC CAACGATGAA AAGACAAAAG CAACGCTTGC CATGGACGGA AATGTGGATA TGCAAGGTTT GAGCGTTAAA TATAAATATT GGACCGATAT GGACTTTACA ACCGAGAATC CTAGCTTAAT ACAGATTGTT GAACTTCCGC CGGCAATTAC CCAGCCGATG TTTACCTTTG CCAACACAGG AACAAAAAAA TATATTACTA TTGATTACGG TAGTGTATTG TCTGCGGAAA ATAACGGGGG CATATCTCTT AACCCGGAGA ATTTGGCAAA AAACAGTGTT GAGCTGCAGG AAATGTTGCT GGACTTTGTA AAAACAACAG CCAAGGACTT TGACCCCGGT ATGGTGGCAG TAACCAAAAA AGGTTCCGCT GTTACTGACA AGGGAGAAAA AGTTACGGAA TACGAATTGA AATTGGATGA TGCTGCAGCT AAAAAATTAT TGCATGCTTT TATAAATGAT GTCATATTGC AGGAAGATAC TATAGAGTTT GGCAAAAAGT ATATGGAAGC TGCCATAAAT ATGTATGATT TCCCGGAAGA GGAAAAACAA GAGGCATTGG ACGAAATCAA CAAAGGTCTG GATGAATTTG CATCCCAGCT TCCTGCATAC AGGGACAGTG TTACACAAGT TTTTGAGTCG ATAAAAGACG TCAAATTCTT TGGTGACAAA GGTTTGGTAG CAAAATACTA CATAAACAAT GACGGCTTCC TTGTAGGAGG AAAATCATCC ATTGACATTA AGATAAAAAT GGCAGATTTT GCGGCATTAT TGGGTGACAA TTTTGACGAA AAAGATAAAA ATGGAGTGCT TTATTTAACA ATAGACGCTG AAAGTTCTGT TTACAACATA AATAAAGAAG TATCCATAGA ACTTCCGGAA ATAACTGAAG AAAATTCCTT TGATGTTTTG AAAGGATTTA TGCCGATACT GTCTGGTATT GGTTCTGCTC CGGGTATCGG TGAAGAGGAT TATACATATG ATATTCCTGC TTTGTCAGAC GGAATTAATG TTGTTATGAA TGGAAAAGTA GTATATTTCC CTGATGTAAA GCCTGAAAAC GTCAATGGAA GAGTGTTGGT TCCAATAAGA ACCATATCCG AGGAAATGGG AGCGGAAGTA ACCTATAATG ATGCAACAAA GCAGGTTCTT ATTGCCAAGG ATGATACGGA AATTCTTCTT ACAATTGGTT CCCAGGAAGC TTACGTTAAC GGCGAGAAGA TAATGCTTGA TGTACCGGCA ATGATTATTG AAGGACGTAC AATGGTTCCG TTAAGATTCA TATCTGAGAG TATGAATGCA ACGGTTGAAT GGGACGGAGA AGCTCAGATA GTATACATAT TTTATTAA
|
Protein sequence | MKKIARKISM LLVVALLAVS MVACTSDEIA LIEAMSKTSE ISSYEGNSKI QLSFKGQGFS EKTQKVFDFL ASYVDGFTFE ANQKYSSNDE KTKATLAMDG NVDMQGLSVK YKYWTDMDFT TENPSLIQIV ELPPAITQPM FTFANTGTKK YITIDYGSVL SAENNGGISL NPENLAKNSV ELQEMLLDFV KTTAKDFDPG MVAVTKKGSA VTDKGEKVTE YELKLDDAAA KKLLHAFIND VILQEDTIEF GKKYMEAAIN MYDFPEEEKQ EALDEINKGL DEFASQLPAY RDSVTQVFES IKDVKFFGDK GLVAKYYINN DGFLVGGKSS IDIKIKMADF AALLGDNFDE KDKNGVLYLT IDAESSVYNI NKEVSIELPE ITEENSFDVL KGFMPILSGI GSAPGIGEED YTYDIPALSD GINVVMNGKV VYFPDVKPEN VNGRVLVPIR TISEEMGAEV TYNDATKQVL IAKDDTEILL TIGSQEAYVN GEKIMLDVPA MIIEGRTMVP LRFISESMNA TVEWDGEAQI VYIFY
|
| |