Gene Moth_0460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0460 
Symbol 
ID3830889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp462427 
End bp463650 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content60% 
IMG OID637828395 
Productamidohydrolase 
Protein accessionYP_429334 
Protein GI83589325 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000769094 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATT TGCTAATCCG AAGAGCAAGG CTCATGGACG GCCAGGGGAC AGTGGATATA 
GCCATCAAAG ACGGATATAT TGTTGCTGCC GGCAATAATG TGGCGGGGTC GGCCCGGCAG
ATGGTTGATG CCTCCGGCAG GCTCCTGATA CCGGCCTTTG TCGACGCCCA TACCCACCTG
GATAAAGCCC TGACGGCGAC AGACGGCGGC GCCGGTTCCC TGGAAGCAGC CATCGAAGAC
TTCCAGCGCC GGAGTAAAGA TATAGATAAA AACGACCTGC TGGCCAGGGG CCGCCAGGTA
CTGCGGTTGG CCCTGCGCCA CGGTACCACA GCCATGCGCA CCCACATCAC CGTCAGTGAG
AACCTGGGCC TGCGGGGCAT TGAGGCTGCC CTGGAATTGA GGGAAGAGTT TGCCGGTAAG
GTTGACCTCC AGGTGATTGC CATGTTCAGC GGTCCGGAGC CGGAGCCGGC ACCCCCTTTA
AAGGAACTCC TAGAGGAAGC CCTGCGGCTG GGGGTAGACG GCCTGGGCGG GGCACCCCAT
CTCTCGCCTG GTATGCAACA ATGGGTGGAC TATATCTTCG AACTGGCCGG CAAATACGAT
GTTCCCATTG ACCTGCACGC CGACGAGACT GACGCTCCTT CGGTGGCTTC CCTGGAGTAT
ATAGCCAGTA AGACTATTCA GGCAGGTTAC CAGGGCCGGG TGGTTGCCGA CCACTGCTGT
GGCCTGGCGG CAGTTGATGA AGCTACTGCC GGCCGTACCA TAGCCGCCGT CAAGGAGGCC
GGCCTGAGTA TCATTACCTT ACCCTCCTGC AACCTCTACC TGATGGGCCG TAACGATAAA
GGACTGGTCC GCCGGGGGGT GACCCGGGTA CGGGAACTCC AGGCCGCCGG CGTCAATGTC
GCCTACGCCT CCGACAACAT CCGCGATGCC TTCCGGCCCT TTGGTAATGC CAACATGCTG
GAGGAAGGCC TGATCACCGC CCAGGTTTTG CAGATGGGTA CCCCGGCGGA GCTCGAACAG
GTCATGAAGA TGGGCACCTA TAACGCCGCC GCTGCCATGG GATTGCAGGA TTACGGCATC
AAGGTCGGCG CCAGGGCCGA CCTGGTCCTC CTGGATGCCA CCACCCCGGC CGGGGCGATT
ATAGGCCAGG TGGAGAAGGT CTGCGTCATT AAAGGCGGCC GGGTGGCCGT GCGCAATGAT
AAAAAATCCG ATATCATTAT CTAA
 
Protein sequence
MNDLLIRRAR LMDGQGTVDI AIKDGYIVAA GNNVAGSARQ MVDASGRLLI PAFVDAHTHL 
DKALTATDGG AGSLEAAIED FQRRSKDIDK NDLLARGRQV LRLALRHGTT AMRTHITVSE
NLGLRGIEAA LELREEFAGK VDLQVIAMFS GPEPEPAPPL KELLEEALRL GVDGLGGAPH
LSPGMQQWVD YIFELAGKYD VPIDLHADET DAPSVASLEY IASKTIQAGY QGRVVADHCC
GLAAVDEATA GRTIAAVKEA GLSIITLPSC NLYLMGRNDK GLVRRGVTRV RELQAAGVNV
AYASDNIRDA FRPFGNANML EEGLITAQVL QMGTPAELEQ VMKMGTYNAA AAMGLQDYGI
KVGARADLVL LDATTPAGAI IGQVEKVCVI KGGRVAVRND KKSDIII