Gene Moth_1790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1790 
Symbol 
ID3832456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1845558 
End bp1846844 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content57% 
IMG OID637829715 
Productcopper amine oxidase-like 
Protein accessionYP_430634 
Protein GI83590625 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGTCA TTAATCGCTG CATTATTAAG AAGGCTGCTA CGTTGCTCTT GTGTGCCAGT 
TCTCTGGCAG TAGTGCCGGT GCCTGCCGCT GCTGCCAGCG AAAGCGTTAA ACTGGTGATT
GACAACAAAC CACTGACAGT ACCGGCAGGG GACCAGGGAG CTTTTATAAT GGGGAACAGG
ACCTACGTTC CTTTAAGGAT CATCAGTGAA AAACTGGGAG CCAGGGTGGA CTGGCAACAG
GATGCCAATC GGGTTATTAT CACTACCAGA GAAACGCCAT CGGTTCCACC CCCGGCGGAT
GATCGAAAGA ATCAGGGAGA GGTACAGATT ATCATTGATG GTAAGATTTT ACAACTCCCT
CCTTCTCTGG GGCGCCCTTA CATAACCCCT GCAGGGCGGA CAGTAGTTCC CCTGCGGGCG
GTGGGCGAAG CCCTCGGTTG CGAAGTTAAC TGGGTGGCCT CCACCAGTAC AGTAGAAATT
AGATCCGCTA CTTATAAACT GCTGCTAGAG TTAGCCGGTT ACCGAAGCAA CCTGCGGTTG
CTGGACGGGA CAGTAATCAA CTCCGCCGAA CTCTTAAAGA TGGATCCCTC ATCTTTCGGC
CGGGAACAAC TGCAGCAGTT CCGGGAATTC CTGGGGTACC TCAAGAAGTA TGACCAGCAG
GTCAAGTTGC CCGACGGCAC GGTGTTAAAC GTCGCCGATA TCACCATCGA GGGGCAACCG
GTAGCCAGCG CCGCCCAGCT CCGGGCCTGG ATCGCCAGTG AAATCCCCCG CCTGCGGGTC
AAGATGCAGG AACAGTACCA CCGCGACCTG CTTCCCATCC CGGATCTGGC CGAACTGTAC
CTGCGGCTCG GCGCCGAGTA CGGCATCCGC GGGGACCTGG CCTTTGCCCA GGCGGCCAAG
GAGACCAACT TCTGGCAGTT TACCGGGAGC GTCAAGCCCG ACCAGAATAA TTACTGCGGC
CTGGGGGCCC TTAGCAGTCC CAATACGGGG AATGAGCCCC TTAATGGCGC CGATCCCACT
AAAGTCCGGT TCGCGCCCGG CGTCTACGGG GCCATCTTCG CTTCCCCGGA AATCGGGGTC
GAAGCCCATA TTCAACACCT TTATGCCTAC GCCACTAAAA AACCCTTGCC CCCGGGTAAG
GTGCTCTATG ACCCGCGCTT CAATTTGGTA CAGCGCGGCT CCGCCACCAC CTGGCAGGGA
CTCAACGCCC GTTGGGCGGT TCCGGGCATT ACTTACGGCC AGAGCATTAT TGAGGATTAC
TGGCTGAAAG CCCTGGCGGC GAAATAA
 
Protein sequence
MPVINRCIIK KAATLLLCAS SLAVVPVPAA AASESVKLVI DNKPLTVPAG DQGAFIMGNR 
TYVPLRIISE KLGARVDWQQ DANRVIITTR ETPSVPPPAD DRKNQGEVQI IIDGKILQLP
PSLGRPYITP AGRTVVPLRA VGEALGCEVN WVASTSTVEI RSATYKLLLE LAGYRSNLRL
LDGTVINSAE LLKMDPSSFG REQLQQFREF LGYLKKYDQQ VKLPDGTVLN VADITIEGQP
VASAAQLRAW IASEIPRLRV KMQEQYHRDL LPIPDLAELY LRLGAEYGIR GDLAFAQAAK
ETNFWQFTGS VKPDQNNYCG LGALSSPNTG NEPLNGADPT KVRFAPGVYG AIFASPEIGV
EAHIQHLYAY ATKKPLPPGK VLYDPRFNLV QRGSATTWQG LNARWAVPGI TYGQSIIEDY
WLKALAAK