Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1004 |
Symbol | |
ID | 3833307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1032219 |
End bp | 1033484 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637828933 |
Product | copper amine oxidase-like |
Protein accession | YP_429862 |
Protein GI | 83589853 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0001699 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGGGGG AAGCGTTCAT GCGCCGGATA ACCATTATCT TGATTATAGC CCTTTTAACC GTCTTTACCC TTCCGGCCTA TGCTGCCGGG GAATTTAAAG CCTCCTTTAC CGTCGGGCAA AACTTCTATA CAGTTAATAA CCAGAAAATA GTCATGGATG CGGTCCCATT TGTCCGGGAA AACCGCACCT ACATACCTGT GCGTTACCTG GCCAATGCCC TGGGCATTGA CGCTAACCAT ATCTCCTGGA ATGAGGTTAG CGGCACTGTT ACCTTAAAGG ACGCTACCGG TACCAGTTCG GTAGAAATCA GCATGAAGGT CAACCACAGG GATTTAACAA CGGTAACCCA AAATGGAGCT AACAACAGCC TGAGGGCGGT GGCCAGGACG GAAACTATGG ATGTGGCCCC GCTGTTAATC AACGGGCGGG TTTATCTACC CGCCCGTTAC GTCGCTGAAG CCCTGGGTTA TACCGTTACC TGGGACCCAG ATACCCAAAC GGTCTACATT ATGCCCGGCG GAGTCTCCAT CACCCCGGGG GAGACCTTAC CGGCCCCTCC CGACCAGTAT CTAAACAAGG GGTATATATG GGTTTATGAC GGCCTTCAAT ACAGTTGGCA GGTGGCCCAA CCCCGCTCCC TGAGTAATTA TAACCAGAGT TTAGAGAAAA CCATAGCCGG CTGGAACGAT TTGAGTGTAT ATGAACAAGT CCTGGCCCGG GAAAATATGC CGGCTGAGAT AGCCCAGTTT TTAGATGCCG TCTTGAATGC CCCTCCCGGT GATTTGCGGC CCTGGATAAA AGAAAAATTA AACCTGGAGT ACACCGCCTC CCTGGCCCGG GAACTGGAAG GCATGGCCGG GTCCGAGGGA TATGATCGCT TTCACCGGGC CGAATTCATA TTGAGCTTTG TCCAATCGTT ACCGTATATA TATACGCCCC TGCCCCGCCT GGCAGGAGAA ACCCTGATCA AGGGCGGCGA CTGTAAAAGC AAGTCAATTT TATTGGCTTC TTTGCTCCAT AACCTCGGCT ACCAGACAAT CCTCCTGGAA TTCCCGCCGG AAGAGTTTGC CGATAATATT GGCCATGAAG CTGTGGCCAT TGCCTTTAAC AGGGATGAAC TACCTCCCGG CAGGAATCTC TTTGCCTTTG ATTACAACGG TCGCAGCTAT TATTATGCTG AAACGACCGC CACCGGCTGG GGCCTGGGGG AAATGCCGGA GCTCCTGCAA AACAAAAAGG CTGCTATCTT CCCCCTGGAT TTCTAG
|
Protein sequence | MQGEAFMRRI TIILIIALLT VFTLPAYAAG EFKASFTVGQ NFYTVNNQKI VMDAVPFVRE NRTYIPVRYL ANALGIDANH ISWNEVSGTV TLKDATGTSS VEISMKVNHR DLTTVTQNGA NNSLRAVART ETMDVAPLLI NGRVYLPARY VAEALGYTVT WDPDTQTVYI MPGGVSITPG ETLPAPPDQY LNKGYIWVYD GLQYSWQVAQ PRSLSNYNQS LEKTIAGWND LSVYEQVLAR ENMPAEIAQF LDAVLNAPPG DLRPWIKEKL NLEYTASLAR ELEGMAGSEG YDRFHRAEFI LSFVQSLPYI YTPLPRLAGE TLIKGGDCKS KSILLASLLH NLGYQTILLE FPPEEFADNI GHEAVAIAFN RDELPPGRNL FAFDYNGRSY YYAETTATGW GLGEMPELLQ NKKAAIFPLD F
|
| |