Gene Moth_0462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0462 
Symbol 
ID3830891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp464606 
End bp465811 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content62% 
IMG OID637828397 
Productamidohydrolase 
Protein accessionYP_429336 
Protein GI83589327 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000842595 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCT TCCACATTAG CCCGGAGACC AACCTGGACC TCCTGGTTGA CAACCTCTGG 
GACGGCCAGC AGGAAGGGTA CCGGCAGCAG GTAGTCATCA GCATCCGCCG GGGGCGGATC
GCTGCTGTCC AGCCCAGAGA AATGCAGGGA ACGGGAAACG GTACCCAACA GATGCGCCTT
ACAGGACTTA CGGTCCTGCC CGGCTTAATC GACGCCCACG TCCACCTGGC CCTCGATGGC
ATCGATTTTC AGGCTTCCCT GGACCGCTGG CAGGACCCGC TCCTCAGGGA GAGGGTCCTG
GCCCGGGCCC TGCGGGTCAC CCTGGAGCAT GGCCTGGTGG CCATCAGGGA CGGCAGCGAC
CGGGAAGGTC TCAACCTCCA GGCCCGGGAA TGGGTCCGTG CCGGCAAGTA CCCGGGCCCC
CGGGTGGTAG CGACAGGAAT GGCCGTCCAT AAAAAAGGAA AATATGGTTC TTTCCTTGGC
CCTGGCACCA CTGACCCGGC CTCAATCAGG GAACTGGTTA CTAGCCTGGT AAACCGGAAC
GTCGACCAGG TTAAAGTGGT TGTTTCCGGC CTGGTTACCT TCCACCGTTA CGGGGAGGTT
GGCAGTCTGG AGTTTGCTAC TGCTGAATTG GTTGAAGCCG TCAAGACGGC CCATGCCGCC
GGGCGACCGG TGATGGCCCA TGTTAACTCG GCCCCCGGCG TAGACCTGGC CCTGGCCGCC
GGGGTAGATA GCATCGAGCA CGGCTATTTC CTCACGACGG CCCAGCTGGA GACTATGGCT
GCCAGGGGTA CTTTCTGGGT ACCGACGGTA GCCGCTATCG CCAACCGCCT GCACACCGCG
AAAAGAGAGG TCTACCCGGA AAGGGAAATT GATATAATCC GGCGGACCCA GGAATCCCAG
CAGGAGATGG TTGCCCGGGC CCACCGCCTG GGAGTAAAGC TGGTGGTAGG CACCGATGCC
GGTGCCCCCG GTGTCTACCA CGGGGAATCC TACCTGGATG AACTGTTGTA CTGGTACCAG
GCCGGTATCC CGGCGGCGGC CATCCTCCGG GCGGCTACGG TCACGGCTGC CGCTGCCCTG
GGCCTGGACG GGGAACTGGG GCAAATCCGT CCCGGCTACC GGCCCTGCCT GATAGCCGTC
CGGGGTAACC CCCTGGAGAA TTTAAGGGTC CTGGCGCAAC CCGAAATGGT TTTTATTGAT
AATTGA
 
Protein sequence
MKLFHISPET NLDLLVDNLW DGQQEGYRQQ VVISIRRGRI AAVQPREMQG TGNGTQQMRL 
TGLTVLPGLI DAHVHLALDG IDFQASLDRW QDPLLRERVL ARALRVTLEH GLVAIRDGSD
REGLNLQARE WVRAGKYPGP RVVATGMAVH KKGKYGSFLG PGTTDPASIR ELVTSLVNRN
VDQVKVVVSG LVTFHRYGEV GSLEFATAEL VEAVKTAHAA GRPVMAHVNS APGVDLALAA
GVDSIEHGYF LTTAQLETMA ARGTFWVPTV AAIANRLHTA KREVYPEREI DIIRRTQESQ
QEMVARAHRL GVKLVVGTDA GAPGVYHGES YLDELLYWYQ AGIPAAAILR AATVTAAAAL
GLDGELGQIR PGYRPCLIAV RGNPLENLRV LAQPEMVFID N