Gene Moth_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2073 
Symbol 
ID3831104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2163660 
End bp2165618 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content64% 
IMG OID637830001 
Productcopper amine oxidase-like 
Protein accessionYP_430911 
Protein GI83590902 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.195333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTAT CGCGTATAGT TGTCATTCTT TTATTCTTAT TGCTGCTCCC GCCCGCAGCC 
GCCTGGGCCG ACCCCTGGCA GGACTACCGG CGTGCCGGGG AATTGGAGGC CGGTGACCCG
GTCACGGCCG AAAGCCTCTA CCGCAGCGCC GCCGCTGCTT TTGAGGCCAG CGGCGACCTG
AAGAACGCCG GCCTGGCCTG GCAGAAGATC TTTCACCTCT GCGACCGCCA GGGGAAATTG
GCCGAAGCCG GGGCCGCCTA CGTGAAGGAG GCCGCCCTCT TCCAGCAGGC TGGGGTACCC
GACTGGGCCT GGGGGGATAC AGCCCGGGGC GAGAGCCTGA AATCGGTACT GCGTCTCTTT
TTCCCGGTGC CGGACCCCGG GCGGATAACC CTGGCTAAAT TTGAGCCGGC TGCCGGCACC
TACCTGGGCC TTTACGAAGA GCGGGGGCCT GCCGGGAACG ACTACAACCG GGTCAAAGAC
CTTTACGGCC GGCAGCACGC CCTCTTTTTA GGCTACGGCC ACATCTACGG GCCGGGCAAC
TTTGTTTTGC CCCGGGATGC CATGGAGAAG GCCCGGCAGG TCCCCGGCGC GGGCCTGGTC
CTGGCCCTGG AGCCCAACGG CGGCCTGGAG GGCCTGCGAG AGGGGGATTT GGTGGCTATC
GCCCGGGAGC TCGCCGCCTT TAAAATCCCG GTTTTTCTCC GTTTCGCCTC GGAAATGAAC
ATGGAGGGCA CCAACGCCTG GCATGGCGAC CCCGCTACCT ACGTCACCTG GTTTCGCCGG
GCGGCGACCA TCATGCGCCG GGAAGCCCCC AACGTGGCCC TGGTCTGGAA CCCTTTTGAT
ATCGTCCAGC CGGAGGGGGT CAAAGCCACG GCCCTGGAAT ATTACCCCGG CGATGACTAT
GTCGACTGGG TGGGCGTCAA CTTCTACAGC GACTACTACC TGAGCGGCCG CGCCGACCAG
CCGGGGGCGG GTCTGGATCC CCTGCAACGC CTGGACTACT GGTACCGCAT TTTCGCCGGC
CGCAAGCCCC TAATGGTGGG CGAGTTCGCC ATCGCCCACA CGGTGTTGAG CCCCGGCAGG
GCCGATGTCA GCCGCTGGGC CGCCTTCAAT ATTCGTAAAT TCTACCGCAC CCTGCCCCTT
CTCTACCCGC GGGTCAAGGC GGTGGTCTAC TTTGATCTCA ACGAAAGCGA CCCCCTCTAC
ACCCAGGCGA AGGTGAGCGA TTATCGCCTG GATGACAACC AGGAGGTGTT GACCGCCTAC
CGGGAGGCCA TCGCCGGCCC CCACTACCTG GAGCAGGTCG GCCAGGAGGC CCCAGGGACT
TACCGGGAAC TTGAGGACGG TATGGCCATC CAGGGGGAAA TTACCCTGGG GGCCGCCGTC
AAGCTCTACG ACCCCTTCAT CAGCAGGGTG GAGTATTACT TAAATGGTGA GCTTCTGAGT
AGCCCTGCCA GCCCGCCCTA CCAGTTGACC GTTGACTTCA ACCGCCTGCC CGGAGAAGGC
AGCCTCGTAG TTAAGGCCTA TGACAGCCAG GGCCGGGAAG CCATTAGCCG CACCTTTACC
GTATTCGGCA GCGCCATTCC AGTGGCCACT TTTACCCCCG GGCAGAAGGG CTACACCATT
AATGGCCAGG CGCAGGAGAT GGATGTCGCT CCCTTTATCG AAAACGGCCG CACCTACGTC
CCCCTGCGCT ACCTGGCTCT GGGCCTGGGT GTTCCCGAGG AGGGGATTGG CTGGGATCCC
TCCAGCCAAA CGGTGACTTT GAGCAGCGCC GGCCATATCC TGAAGTTTTA TCCCGGCCGG
GCGGAAATGG AACGCGACGG CCGCCGGCAA GCCCTGGATG TAGCTCCCCT GAACCGCGCC
AGCCGGGTTT TCCTTCCGGC CCGCTATGTC GCCGAAGCCC TGGGGTACCG GGTCACCTGG
TCCCCGGCCC GGCAGCAGGT GCTGGTAATC CGGGAATAA
 
Protein sequence
MKVSRIVVIL LFLLLLPPAA AWADPWQDYR RAGELEAGDP VTAESLYRSA AAAFEASGDL 
KNAGLAWQKI FHLCDRQGKL AEAGAAYVKE AALFQQAGVP DWAWGDTARG ESLKSVLRLF
FPVPDPGRIT LAKFEPAAGT YLGLYEERGP AGNDYNRVKD LYGRQHALFL GYGHIYGPGN
FVLPRDAMEK ARQVPGAGLV LALEPNGGLE GLREGDLVAI ARELAAFKIP VFLRFASEMN
MEGTNAWHGD PATYVTWFRR AATIMRREAP NVALVWNPFD IVQPEGVKAT ALEYYPGDDY
VDWVGVNFYS DYYLSGRADQ PGAGLDPLQR LDYWYRIFAG RKPLMVGEFA IAHTVLSPGR
ADVSRWAAFN IRKFYRTLPL LYPRVKAVVY FDLNESDPLY TQAKVSDYRL DDNQEVLTAY
REAIAGPHYL EQVGQEAPGT YRELEDGMAI QGEITLGAAV KLYDPFISRV EYYLNGELLS
SPASPPYQLT VDFNRLPGEG SLVVKAYDSQ GREAISRTFT VFGSAIPVAT FTPGQKGYTI
NGQAQEMDVA PFIENGRTYV PLRYLALGLG VPEEGIGWDP SSQTVTLSSA GHILKFYPGR
AEMERDGRRQ ALDVAPLNRA SRVFLPARYV AEALGYRVTW SPARQQVLVI RE