Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0824 |
Symbol | |
ID | 4810442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1002382 |
End bp | 1004157 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106241 |
Product | copper amine oxidase-like protein |
Protein accession | YP_001037252 |
Protein GI | 125973342 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.195798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAA AGCTGTTATT CATGGCGCTG TGCATCCTTG TTTTTTACTT TCCGTTGATT TCCCATGCCA GTGCAAACAT CACGGTTGTG GTAAACGGCG AAAAAGTTAA CTTTACTGAC CAACAACCTT TCATTGACAG CAACTCAAGA ACAATGGTAC CCATAAGATT CATTTCCGAA GCCCTCGACG CCAAAGTGGA TTGGATTGAA AAAGAGCGCA TGGTGGTAAT AAAAAAGCAG GGAACGGAAA TTTCACTGGT TGTAGGTATG AAAACCGCCA AAGTCAATGG TAAAGAAATC AAGCTTGATA CTTCATCGGT TATTGCAGGA GGCAGGACTT TTGTCCCTGT AAGATTTATC TCAGAAGCCT TTGGTGCTAC CGTAGAATGG GACGGTAAAA ACAAAATTGT CACAATCACA ACCAAGCCGC AAGCCGGAAA TGAAATCGGT AAAATGCCCG AATCAAGCTA TTACAACGAA CTTTACAACC TCTTCACCCT CGTGCCCAAA GGTATGAAAG GCTCCAATAT AGAGCAGTTC GCTTATTACA CCTTTAAAGA AGATGGTACT GTCCTCAACC TCAACTTTAC CGAAGAGGAA ATATTCAAAG GTCATTCATT CCCCGAGTTG TCAAAAGGCG GGGTTATCTT TTCCCCCAGC AACCTTGAAA CCGTTGAATC ATACACTGTT GAAATCTTCA AAAATGTGTC AAGGACAAAC AAACAGCTTG TCGGTCAATA CATATATTTT TCCGATTTAA AACCTGAAAA CCTTATGGTT ATTGAAAAAA ACGGGTTTAC TTTTGTCACC ACAAAGGGAT TTAAACCTTT TGAAATCAAA GAAACCGGCT ACAAGCTGAT GAGTAAAAAA CTCTTTATAT ACAACGACAA AAACCTGTAT GTTTTTTCAT TTATTTATGA CCCCTCGGAC AGAAAGTATC TTACGGAAAG TGTTATGGAC AGCATCATTA ATTCCATTGA AATAAACGGT ACCAAAATCA ATCTGAAAAA ATCAACAACA CCGGGCAAAA ACGGCGAAGT TGAAGATATT TTACCGGCAG AAGCCAATTA TACCAGGAAA ATGACTGTAA AAGAAAATGA GGAACGGCTT AAAAAGAAAT ATTTTTCTCA AGGATTTGAC CTTTCGGACA TGGTCAGAAT TGAAGGAGGC ACCTTTATCG ACGAAAACGG TGAAAAAGTA ACCGTAAAGC CCTTCTACGT CAGCAAAAAC TTAGTGACAA TCAGCGAATG GAACAGCATC TCAAAAAATA AAATCAACAT CAAAGACCTC AACGCAAAAT ACAATTTAAA CATAAAATCC GAAAATTATC CGGCCGTATT TGAAACCGAA GAAAAATACG GCACGGTCAA AATAAAGAAC AATATGGAGC TTTACGTGTT CTGCAATGAA AAAAGCAAAA GTTTCGGAAT CGAAGAATAT TATACCTTCC AAAAAACCAG TTACGGTACT TCTACCATCT TTGAACACAA CGGAGGATTC AGGTTGCTGA GTGAAGATGA ACTAAAATAC ATATTGAGAA AAACCAAATC CGATGCCTAT AAAAACAACG TAAAATCAAA TACCGTTTCC GAAGTCGGAC GTTCTTCCAA AAACGAGTTT GGAATTTTTG ACTACGACTC TAATGTTGCG GAAGTAACGG ATTTTGATTC GGTACTCAAA ATCAAAGACA ATTCTCAAAT GCTGTGCGGT TTCAGATATG CAAAAGATAT TGGAAATTCA CCACAAGATT TGATACTGGA TTTTTTCAGT AATTAA
|
Protein sequence | MAKKLLFMAL CILVFYFPLI SHASANITVV VNGEKVNFTD QQPFIDSNSR TMVPIRFISE ALDAKVDWIE KERMVVIKKQ GTEISLVVGM KTAKVNGKEI KLDTSSVIAG GRTFVPVRFI SEAFGATVEW DGKNKIVTIT TKPQAGNEIG KMPESSYYNE LYNLFTLVPK GMKGSNIEQF AYYTFKEDGT VLNLNFTEEE IFKGHSFPEL SKGGVIFSPS NLETVESYTV EIFKNVSRTN KQLVGQYIYF SDLKPENLMV IEKNGFTFVT TKGFKPFEIK ETGYKLMSKK LFIYNDKNLY VFSFIYDPSD RKYLTESVMD SIINSIEING TKINLKKSTT PGKNGEVEDI LPAEANYTRK MTVKENEERL KKKYFSQGFD LSDMVRIEGG TFIDENGEKV TVKPFYVSKN LVTISEWNSI SKNKINIKDL NAKYNLNIKS ENYPAVFETE EKYGTVKIKN NMELYVFCNE KSKSFGIEEY YTFQKTSYGT STIFEHNGGF RLLSEDELKY ILRKTKSDAY KNNVKSNTVS EVGRSSKNEF GIFDYDSNVA EVTDFDSVLK IKDNSQMLCG FRYAKDIGNS PQDLILDFFS N
|
| |