Gene Moth_0963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0963 
Symbol 
ID3831238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp994889 
End bp996175 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content61% 
IMG OID637828892 
Productamidohydrolase 
Protein accessionYP_429821 
Protein GI83589812 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000377976 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCAAGA TTTTAATCAA GGATTGTACC ATCGTTCCGA TAAGCGGTCC GGTAATTGGG 
AAGGGGGTTA TCGCCATTAA TGACGACCGC CTCCATTATG TCGGCCCGGC AGGCGGTTTG
CCGGCCGGCT GGCAGGCGGA TACCGTAATT GATGCCGGCG ATATGGTGGC CCTGCCGGGC
CTGGTCAATG CCCATACCCA CGCCGCCATG ACCCTGCTGC GGAGCTACGC CGACGACCTG
CCACTTAAAC AATGGTTGGA AGAAAAAATC TGGCCCCGCG AGGACCGGCT GGAGCGGGAA
GATATCTACT GGGGTAGCAA AATCGCCCTG CTGGAGATGA TCCGCTCCGG CACCACCACC
TTTGCCGATA TGTATTTCCA TATGGATGCC GTGGCCGGGG CAGTGGTGGA AGCCGGGCTG
CGGGCCAGCC TTTGCCAGGG ATTGATAGGC CTTCAGGATA CCAGCAACAA ACGGCTGGAA
GCAGGCATCA GTATGGTAAA AGAGTGGCAT GGCGCTGGTG AGGGACGGAT TACCACCATG
CTGGGTCCCC ACGCCCCCAA TACCTGCACG CCGGAGTATT TAACCAGGGT TGCCGAGACA
GCGGCCGGGC TGGGGGTGGG GTTACATATC CACCTGGCGG AAACGCGGGG AGAGGTCGAA
GACGTAAAAG CCCGTTACGG GGCTACCCCG GTAGCCCTGG TCAACAAGCT GGGTCTCCTG
GACCTGCCAG TCCTGGCGGC CCACTGCGTC CACCTGACCA CCGAGGAAAT CGCTATCCTG
GCCGAGAAAA AGGTCGGCGT GGCCCACTGC CCGGAGAGTA ATCTTAAACT GGCCAGCGGC
GTGGCCCCGG TAAAGGAGAT GCTGGCTGCC GGCGTCAACG TGGCCATCGG CACTGACGGC
GCCTCCAGCA ACAATAACCT GGACATGGTG GCTGAGACCC GGACGGCGGC CCTGCTGGCT
AAAGGCATTA CCGGCGACCC CACGGTAGTA CCCGCCCACC AGGCCCTGGT TATGGCCACC
CTGAACGGTG CCCGGGCCCT GGGCCTGGAA AAGGAGATCG GCACCCTGGA AGCGGGTAAA
AAGGCCGACC TGATCCTGGT GGACATGCGC CAGCCCCACT TGATGCCGCC CAACGATGTC
GAGGCCAACC TGGTTTACGC CGCCCGGGGA AGCGACGTGG ATACGGTTAT CGTTAACGGG
AAGATTCTCA TGGCCAGGGG TGAGGTTAAG ACCCTGGATG CCGAAGAGAT TTACGCCCAG
GTGACGAAGA GGATGCACAA AGGGTAA
 
Protein sequence
MGKILIKDCT IVPISGPVIG KGVIAINDDR LHYVGPAGGL PAGWQADTVI DAGDMVALPG 
LVNAHTHAAM TLLRSYADDL PLKQWLEEKI WPREDRLERE DIYWGSKIAL LEMIRSGTTT
FADMYFHMDA VAGAVVEAGL RASLCQGLIG LQDTSNKRLE AGISMVKEWH GAGEGRITTM
LGPHAPNTCT PEYLTRVAET AAGLGVGLHI HLAETRGEVE DVKARYGATP VALVNKLGLL
DLPVLAAHCV HLTTEEIAIL AEKKVGVAHC PESNLKLASG VAPVKEMLAA GVNVAIGTDG
ASSNNNLDMV AETRTAALLA KGITGDPTVV PAHQALVMAT LNGARALGLE KEIGTLEAGK
KADLILVDMR QPHLMPPNDV EANLVYAARG SDVDTVIVNG KILMARGEVK TLDAEEIYAQ
VTKRMHKG