Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1224 |
Symbol | |
ID | 3832859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1261375 |
End bp | 1262700 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829159 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_430081 |
Protein GI | 83590072 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTATAT TAATCAAAAA CGGCACCCTG GTCACCATGA ACCCCCAGCG GGAGGTGTAT CAGGGAAACA TCTATATTGA AGACGATCGC ATTGCCGCCA TCGGCCAGAC GCCGGCCACC GCCGACCGGA TCATCGAGGC TAAGGGCCAG TTAGTCATCC CCGGCCTCGT CCAGCCCCAT ATCCACCTCT GTCAGACCCT CTTCCGCGGC CGCGCCGACG ACCTGGAACT CCTGGACTGG TTGCGCCTGC GTATCTGGCC CCTGGAGGGT GCCCACGATC CCGAGTCCCT TTATTACTCC GCCCTCCTCG GCATAGGTGA ACTCTTCCTG AGCGGGACCA CTACCATCGT GGATATGGAA ACCGTCCACC ATACAGAGGC AGCCATCGAG GCCATCGCCC AAAGCGGCAT CCGAGCCATC ACCGGCAAAG TGATGATGGA TTTTGGCGAA GACGTTCCGG AGACCCTCAG GGAAACCACG GAGGCATCCC TGCAGGAGAG CGTCCAATTG CTTGAGAAGT GGCACGGGCA CGATAACGGC CGCATCCAGT ACGCCTTCGA GCCCCGCTTT GTGGTCTCCT GTACCGAGGA ATTATTATTA AAGGTCCGGG ATCTCGCCCG CAAGTACGGT GTTAAAATCC ATACCCACGC CTCGGAAAAC TTGGGTGAAT GCGCCCTGGT GGAAAAGCTC CACCACCGGC GTAACGTCCT CTATCTTGAC GATATCGGCC TCACCGGCCC CGGCCTCATC CTTGTCCACT GCATCTGGCT CGATGAAGAA GAGAAAGATA TTCTGGCTCG CACTGGTACC AAGGTGGTTC ATTGCCCTTC CTCTAACTTA AAGATGGCTT CCGGTATCTG CCCCGTACCG GATCTTTTGA GCCGGGGGAC GGTGGTTTCC CTGGCCGCCG ATGGCGCGCC CTGCAATAAC AACCTGGACG CTTTCATGGA AATGCGGCTG GCGGCGCTAA TTCAAAAGCC TGTCCACGGC CCTACGTCCA TGCCGGCGTC TGTAATCTTT GAAATGGCCA CCCTGGGTGG GGCCCGAGCC ATGGGAATGG AAAAGGAAAT CGGCAGCCTG GAAGTGGGGA AGAAGGCTGA CCTGGCCCTG GTCTCCGTGG ATGGCCTCCA TACCCAGCCA GAAGACGGCG TTAACGTCTA CACCCAGCTG GTCTACCAGG CTAAAGGGTC GGACGTCACC CTGACCATGG TCGACGGCAA AATCGTCATG GAGAAGGGCG AACTGAAGAC CATAGATGTC GATGAAGTCC GGCGGAAGGC CAACCAGGCC ATCCAGCGTG TCGCCCACAG AGCCGGGCTG GCTTAG
|
Protein sequence | MTILIKNGTL VTMNPQREVY QGNIYIEDDR IAAIGQTPAT ADRIIEAKGQ LVIPGLVQPH IHLCQTLFRG RADDLELLDW LRLRIWPLEG AHDPESLYYS ALLGIGELFL SGTTTIVDME TVHHTEAAIE AIAQSGIRAI TGKVMMDFGE DVPETLRETT EASLQESVQL LEKWHGHDNG RIQYAFEPRF VVSCTEELLL KVRDLARKYG VKIHTHASEN LGECALVEKL HHRRNVLYLD DIGLTGPGLI LVHCIWLDEE EKDILARTGT KVVHCPSSNL KMASGICPVP DLLSRGTVVS LAADGAPCNN NLDAFMEMRL AALIQKPVHG PTSMPASVIF EMATLGGARA MGMEKEIGSL EVGKKADLAL VSVDGLHTQP EDGVNVYTQL VYQAKGSDVT LTMVDGKIVM EKGELKTIDV DEVRRKANQA IQRVAHRAGL A
|
| |