Gene Moth_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1004 
Symbol 
ID3833307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1032219 
End bp1033484 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content52% 
IMG OID637828933 
Productcopper amine oxidase-like 
Protein accessionYP_429862 
Protein GI83589853 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0001699 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGGGGG AAGCGTTCAT GCGCCGGATA ACCATTATCT TGATTATAGC CCTTTTAACC 
GTCTTTACCC TTCCGGCCTA TGCTGCCGGG GAATTTAAAG CCTCCTTTAC CGTCGGGCAA
AACTTCTATA CAGTTAATAA CCAGAAAATA GTCATGGATG CGGTCCCATT TGTCCGGGAA
AACCGCACCT ACATACCTGT GCGTTACCTG GCCAATGCCC TGGGCATTGA CGCTAACCAT
ATCTCCTGGA ATGAGGTTAG CGGCACTGTT ACCTTAAAGG ACGCTACCGG TACCAGTTCG
GTAGAAATCA GCATGAAGGT CAACCACAGG GATTTAACAA CGGTAACCCA AAATGGAGCT
AACAACAGCC TGAGGGCGGT GGCCAGGACG GAAACTATGG ATGTGGCCCC GCTGTTAATC
AACGGGCGGG TTTATCTACC CGCCCGTTAC GTCGCTGAAG CCCTGGGTTA TACCGTTACC
TGGGACCCAG ATACCCAAAC GGTCTACATT ATGCCCGGCG GAGTCTCCAT CACCCCGGGG
GAGACCTTAC CGGCCCCTCC CGACCAGTAT CTAAACAAGG GGTATATATG GGTTTATGAC
GGCCTTCAAT ACAGTTGGCA GGTGGCCCAA CCCCGCTCCC TGAGTAATTA TAACCAGAGT
TTAGAGAAAA CCATAGCCGG CTGGAACGAT TTGAGTGTAT ATGAACAAGT CCTGGCCCGG
GAAAATATGC CGGCTGAGAT AGCCCAGTTT TTAGATGCCG TCTTGAATGC CCCTCCCGGT
GATTTGCGGC CCTGGATAAA AGAAAAATTA AACCTGGAGT ACACCGCCTC CCTGGCCCGG
GAACTGGAAG GCATGGCCGG GTCCGAGGGA TATGATCGCT TTCACCGGGC CGAATTCATA
TTGAGCTTTG TCCAATCGTT ACCGTATATA TATACGCCCC TGCCCCGCCT GGCAGGAGAA
ACCCTGATCA AGGGCGGCGA CTGTAAAAGC AAGTCAATTT TATTGGCTTC TTTGCTCCAT
AACCTCGGCT ACCAGACAAT CCTCCTGGAA TTCCCGCCGG AAGAGTTTGC CGATAATATT
GGCCATGAAG CTGTGGCCAT TGCCTTTAAC AGGGATGAAC TACCTCCCGG CAGGAATCTC
TTTGCCTTTG ATTACAACGG TCGCAGCTAT TATTATGCTG AAACGACCGC CACCGGCTGG
GGCCTGGGGG AAATGCCGGA GCTCCTGCAA AACAAAAAGG CTGCTATCTT CCCCCTGGAT
TTCTAG
 
Protein sequence
MQGEAFMRRI TIILIIALLT VFTLPAYAAG EFKASFTVGQ NFYTVNNQKI VMDAVPFVRE 
NRTYIPVRYL ANALGIDANH ISWNEVSGTV TLKDATGTSS VEISMKVNHR DLTTVTQNGA
NNSLRAVART ETMDVAPLLI NGRVYLPARY VAEALGYTVT WDPDTQTVYI MPGGVSITPG
ETLPAPPDQY LNKGYIWVYD GLQYSWQVAQ PRSLSNYNQS LEKTIAGWND LSVYEQVLAR
ENMPAEIAQF LDAVLNAPPG DLRPWIKEKL NLEYTASLAR ELEGMAGSEG YDRFHRAEFI
LSFVQSLPYI YTPLPRLAGE TLIKGGDCKS KSILLASLLH NLGYQTILLE FPPEEFADNI
GHEAVAIAFN RDELPPGRNL FAFDYNGRSY YYAETTATGW GLGEMPELLQ NKKAAIFPLD
F