Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2073 |
Symbol | |
ID | 3831104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2163660 |
End bp | 2165618 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637830001 |
Product | copper amine oxidase-like |
Protein accession | YP_430911 |
Protein GI | 83590902 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4124] Beta-mannanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.195333 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTAT CGCGTATAGT TGTCATTCTT TTATTCTTAT TGCTGCTCCC GCCCGCAGCC GCCTGGGCCG ACCCCTGGCA GGACTACCGG CGTGCCGGGG AATTGGAGGC CGGTGACCCG GTCACGGCCG AAAGCCTCTA CCGCAGCGCC GCCGCTGCTT TTGAGGCCAG CGGCGACCTG AAGAACGCCG GCCTGGCCTG GCAGAAGATC TTTCACCTCT GCGACCGCCA GGGGAAATTG GCCGAAGCCG GGGCCGCCTA CGTGAAGGAG GCCGCCCTCT TCCAGCAGGC TGGGGTACCC GACTGGGCCT GGGGGGATAC AGCCCGGGGC GAGAGCCTGA AATCGGTACT GCGTCTCTTT TTCCCGGTGC CGGACCCCGG GCGGATAACC CTGGCTAAAT TTGAGCCGGC TGCCGGCACC TACCTGGGCC TTTACGAAGA GCGGGGGCCT GCCGGGAACG ACTACAACCG GGTCAAAGAC CTTTACGGCC GGCAGCACGC CCTCTTTTTA GGCTACGGCC ACATCTACGG GCCGGGCAAC TTTGTTTTGC CCCGGGATGC CATGGAGAAG GCCCGGCAGG TCCCCGGCGC GGGCCTGGTC CTGGCCCTGG AGCCCAACGG CGGCCTGGAG GGCCTGCGAG AGGGGGATTT GGTGGCTATC GCCCGGGAGC TCGCCGCCTT TAAAATCCCG GTTTTTCTCC GTTTCGCCTC GGAAATGAAC ATGGAGGGCA CCAACGCCTG GCATGGCGAC CCCGCTACCT ACGTCACCTG GTTTCGCCGG GCGGCGACCA TCATGCGCCG GGAAGCCCCC AACGTGGCCC TGGTCTGGAA CCCTTTTGAT ATCGTCCAGC CGGAGGGGGT CAAAGCCACG GCCCTGGAAT ATTACCCCGG CGATGACTAT GTCGACTGGG TGGGCGTCAA CTTCTACAGC GACTACTACC TGAGCGGCCG CGCCGACCAG CCGGGGGCGG GTCTGGATCC CCTGCAACGC CTGGACTACT GGTACCGCAT TTTCGCCGGC CGCAAGCCCC TAATGGTGGG CGAGTTCGCC ATCGCCCACA CGGTGTTGAG CCCCGGCAGG GCCGATGTCA GCCGCTGGGC CGCCTTCAAT ATTCGTAAAT TCTACCGCAC CCTGCCCCTT CTCTACCCGC GGGTCAAGGC GGTGGTCTAC TTTGATCTCA ACGAAAGCGA CCCCCTCTAC ACCCAGGCGA AGGTGAGCGA TTATCGCCTG GATGACAACC AGGAGGTGTT GACCGCCTAC CGGGAGGCCA TCGCCGGCCC CCACTACCTG GAGCAGGTCG GCCAGGAGGC CCCAGGGACT TACCGGGAAC TTGAGGACGG TATGGCCATC CAGGGGGAAA TTACCCTGGG GGCCGCCGTC AAGCTCTACG ACCCCTTCAT CAGCAGGGTG GAGTATTACT TAAATGGTGA GCTTCTGAGT AGCCCTGCCA GCCCGCCCTA CCAGTTGACC GTTGACTTCA ACCGCCTGCC CGGAGAAGGC AGCCTCGTAG TTAAGGCCTA TGACAGCCAG GGCCGGGAAG CCATTAGCCG CACCTTTACC GTATTCGGCA GCGCCATTCC AGTGGCCACT TTTACCCCCG GGCAGAAGGG CTACACCATT AATGGCCAGG CGCAGGAGAT GGATGTCGCT CCCTTTATCG AAAACGGCCG CACCTACGTC CCCCTGCGCT ACCTGGCTCT GGGCCTGGGT GTTCCCGAGG AGGGGATTGG CTGGGATCCC TCCAGCCAAA CGGTGACTTT GAGCAGCGCC GGCCATATCC TGAAGTTTTA TCCCGGCCGG GCGGAAATGG AACGCGACGG CCGCCGGCAA GCCCTGGATG TAGCTCCCCT GAACCGCGCC AGCCGGGTTT TCCTTCCGGC CCGCTATGTC GCCGAAGCCC TGGGGTACCG GGTCACCTGG TCCCCGGCCC GGCAGCAGGT GCTGGTAATC CGGGAATAA
|
Protein sequence | MKVSRIVVIL LFLLLLPPAA AWADPWQDYR RAGELEAGDP VTAESLYRSA AAAFEASGDL KNAGLAWQKI FHLCDRQGKL AEAGAAYVKE AALFQQAGVP DWAWGDTARG ESLKSVLRLF FPVPDPGRIT LAKFEPAAGT YLGLYEERGP AGNDYNRVKD LYGRQHALFL GYGHIYGPGN FVLPRDAMEK ARQVPGAGLV LALEPNGGLE GLREGDLVAI ARELAAFKIP VFLRFASEMN MEGTNAWHGD PATYVTWFRR AATIMRREAP NVALVWNPFD IVQPEGVKAT ALEYYPGDDY VDWVGVNFYS DYYLSGRADQ PGAGLDPLQR LDYWYRIFAG RKPLMVGEFA IAHTVLSPGR ADVSRWAAFN IRKFYRTLPL LYPRVKAVVY FDLNESDPLY TQAKVSDYRL DDNQEVLTAY REAIAGPHYL EQVGQEAPGT YRELEDGMAI QGEITLGAAV KLYDPFISRV EYYLNGELLS SPASPPYQLT VDFNRLPGEG SLVVKAYDSQ GREAISRTFT VFGSAIPVAT FTPGQKGYTI NGQAQEMDVA PFIENGRTYV PLRYLALGLG VPEEGIGWDP SSQTVTLSSA GHILKFYPGR AEMERDGRRQ ALDVAPLNRA SRVFLPARYV AEALGYRVTW SPARQQVLVI RE
|
| |