Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1778 |
Symbol | |
ID | 4810023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2100038 |
End bp | 2100997 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107192 |
Product | copper amine oxidase-like protein |
Protein accession | YP_001038192 |
Protein GI | 125974282 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000103341 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTCAA AAGGTTTTGT TAGTCTGCTT GTGTCGATTT TATTAATTGC GGTAAGTGTC ATTGCACCGG TCGGTGTGTT TGCACAGAAT CAGGGAAAAA CGATTGTGCT CCAGGTAAAT AATACTGTTG CAACGGTTAA TAACGAAAGC GTAACTTTGG ATGCTGCTCC ATATATCGAC GAAAGCAGCG GCAGGACATT GGTTCCCATA AGGTTTATAT CAGAATCTAT GGGATATTCC GTTACATGGG ATGATGAAGA AAAAACAGTC AGGATTTTAA ACAAGGTTGA TATGAATACC ATAGATGAAT CTGAAGTTGA TGAAACCACA GGTACATCAG TGGAATATTT CAGGTCATGG AATACATACA AGTACATAAA ATTGAAAATA GGCAGCAATG TTGCCGAAAT ATGCGACAAT TACATAATCG GTGAGTATGT TGAAATGACT GAGGTACCTA TAGATCAAGC TCCGGTTATT AAAAATGGGA GAACAATGAT TCCTATAAGA TTTGTTGCGG AGCAGATGAA TTTAAAGGTG GACTGGGACG GTAAAACCAA AAAAATAACA ATTTCATCGA CAGGAGAAGA ATACGTTCCC GCTGCTATTG AAACTGCGCT TGAAAAAATT GCGGTATCCG GCACTTCGGA TACTGAAAAA ACAGAAGATG ATCCTGCGTA TATAAAATCC AAAGAACCTC AGAATTATTT TTTAAAAATT GAAAAACAAG GTTTTGAGTA TTCTGTGGAT TTGGCAGTTC AATCATCAGA AGGCACTGAA GCTGTTTTAA ACGGAACGGT AATTGGACTG AAAGAGGATA AGGCATGCTT TACTTATACC CTGAACAATC AGAAAAATTA CAAATATGAC GGAGTTATAC AGGCTACTGA TAATGGTATT ATTGTAAACT ATACAAATGA AAAGGGTGAA AAGTGTTCTG TTACTTTCGC GGCTAACTGA
|
Protein sequence | MNSKGFVSLL VSILLIAVSV IAPVGVFAQN QGKTIVLQVN NTVATVNNES VTLDAAPYID ESSGRTLVPI RFISESMGYS VTWDDEEKTV RILNKVDMNT IDESEVDETT GTSVEYFRSW NTYKYIKLKI GSNVAEICDN YIIGEYVEMT EVPIDQAPVI KNGRTMIPIR FVAEQMNLKV DWDGKTKKIT ISSTGEEYVP AAIETALEKI AVSGTSDTEK TEDDPAYIKS KEPQNYFLKI EKQGFEYSVD LAVQSSEGTE AVLNGTVIGL KEDKACFTYT LNNQKNYKYD GVIQATDNGI IVNYTNEKGE KCSVTFAAN
|
| |