Gene Moth_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1224 
Symbol 
ID3832859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1261375 
End bp1262700 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content58% 
IMG OID637829159 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_430081 
Protein GI83590072 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATAT TAATCAAAAA CGGCACCCTG GTCACCATGA ACCCCCAGCG GGAGGTGTAT 
CAGGGAAACA TCTATATTGA AGACGATCGC ATTGCCGCCA TCGGCCAGAC GCCGGCCACC
GCCGACCGGA TCATCGAGGC TAAGGGCCAG TTAGTCATCC CCGGCCTCGT CCAGCCCCAT
ATCCACCTCT GTCAGACCCT CTTCCGCGGC CGCGCCGACG ACCTGGAACT CCTGGACTGG
TTGCGCCTGC GTATCTGGCC CCTGGAGGGT GCCCACGATC CCGAGTCCCT TTATTACTCC
GCCCTCCTCG GCATAGGTGA ACTCTTCCTG AGCGGGACCA CTACCATCGT GGATATGGAA
ACCGTCCACC ATACAGAGGC AGCCATCGAG GCCATCGCCC AAAGCGGCAT CCGAGCCATC
ACCGGCAAAG TGATGATGGA TTTTGGCGAA GACGTTCCGG AGACCCTCAG GGAAACCACG
GAGGCATCCC TGCAGGAGAG CGTCCAATTG CTTGAGAAGT GGCACGGGCA CGATAACGGC
CGCATCCAGT ACGCCTTCGA GCCCCGCTTT GTGGTCTCCT GTACCGAGGA ATTATTATTA
AAGGTCCGGG ATCTCGCCCG CAAGTACGGT GTTAAAATCC ATACCCACGC CTCGGAAAAC
TTGGGTGAAT GCGCCCTGGT GGAAAAGCTC CACCACCGGC GTAACGTCCT CTATCTTGAC
GATATCGGCC TCACCGGCCC CGGCCTCATC CTTGTCCACT GCATCTGGCT CGATGAAGAA
GAGAAAGATA TTCTGGCTCG CACTGGTACC AAGGTGGTTC ATTGCCCTTC CTCTAACTTA
AAGATGGCTT CCGGTATCTG CCCCGTACCG GATCTTTTGA GCCGGGGGAC GGTGGTTTCC
CTGGCCGCCG ATGGCGCGCC CTGCAATAAC AACCTGGACG CTTTCATGGA AATGCGGCTG
GCGGCGCTAA TTCAAAAGCC TGTCCACGGC CCTACGTCCA TGCCGGCGTC TGTAATCTTT
GAAATGGCCA CCCTGGGTGG GGCCCGAGCC ATGGGAATGG AAAAGGAAAT CGGCAGCCTG
GAAGTGGGGA AGAAGGCTGA CCTGGCCCTG GTCTCCGTGG ATGGCCTCCA TACCCAGCCA
GAAGACGGCG TTAACGTCTA CACCCAGCTG GTCTACCAGG CTAAAGGGTC GGACGTCACC
CTGACCATGG TCGACGGCAA AATCGTCATG GAGAAGGGCG AACTGAAGAC CATAGATGTC
GATGAAGTCC GGCGGAAGGC CAACCAGGCC ATCCAGCGTG TCGCCCACAG AGCCGGGCTG
GCTTAG
 
Protein sequence
MTILIKNGTL VTMNPQREVY QGNIYIEDDR IAAIGQTPAT ADRIIEAKGQ LVIPGLVQPH 
IHLCQTLFRG RADDLELLDW LRLRIWPLEG AHDPESLYYS ALLGIGELFL SGTTTIVDME
TVHHTEAAIE AIAQSGIRAI TGKVMMDFGE DVPETLRETT EASLQESVQL LEKWHGHDNG
RIQYAFEPRF VVSCTEELLL KVRDLARKYG VKIHTHASEN LGECALVEKL HHRRNVLYLD
DIGLTGPGLI LVHCIWLDEE EKDILARTGT KVVHCPSSNL KMASGICPVP DLLSRGTVVS
LAADGAPCNN NLDAFMEMRL AALIQKPVHG PTSMPASVIF EMATLGGARA MGMEKEIGSL
EVGKKADLAL VSVDGLHTQP EDGVNVYTQL VYQAKGSDVT LTMVDGKIVM EKGELKTIDV
DEVRRKANQA IQRVAHRAGL A