Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0462 |
Symbol | |
ID | 3830891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 464606 |
End bp | 465811 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828397 |
Product | amidohydrolase |
Protein accession | YP_429336 |
Protein GI | 83589327 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0000842595 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCT TCCACATTAG CCCGGAGACC AACCTGGACC TCCTGGTTGA CAACCTCTGG GACGGCCAGC AGGAAGGGTA CCGGCAGCAG GTAGTCATCA GCATCCGCCG GGGGCGGATC GCTGCTGTCC AGCCCAGAGA AATGCAGGGA ACGGGAAACG GTACCCAACA GATGCGCCTT ACAGGACTTA CGGTCCTGCC CGGCTTAATC GACGCCCACG TCCACCTGGC CCTCGATGGC ATCGATTTTC AGGCTTCCCT GGACCGCTGG CAGGACCCGC TCCTCAGGGA GAGGGTCCTG GCCCGGGCCC TGCGGGTCAC CCTGGAGCAT GGCCTGGTGG CCATCAGGGA CGGCAGCGAC CGGGAAGGTC TCAACCTCCA GGCCCGGGAA TGGGTCCGTG CCGGCAAGTA CCCGGGCCCC CGGGTGGTAG CGACAGGAAT GGCCGTCCAT AAAAAAGGAA AATATGGTTC TTTCCTTGGC CCTGGCACCA CTGACCCGGC CTCAATCAGG GAACTGGTTA CTAGCCTGGT AAACCGGAAC GTCGACCAGG TTAAAGTGGT TGTTTCCGGC CTGGTTACCT TCCACCGTTA CGGGGAGGTT GGCAGTCTGG AGTTTGCTAC TGCTGAATTG GTTGAAGCCG TCAAGACGGC CCATGCCGCC GGGCGACCGG TGATGGCCCA TGTTAACTCG GCCCCCGGCG TAGACCTGGC CCTGGCCGCC GGGGTAGATA GCATCGAGCA CGGCTATTTC CTCACGACGG CCCAGCTGGA GACTATGGCT GCCAGGGGTA CTTTCTGGGT ACCGACGGTA GCCGCTATCG CCAACCGCCT GCACACCGCG AAAAGAGAGG TCTACCCGGA AAGGGAAATT GATATAATCC GGCGGACCCA GGAATCCCAG CAGGAGATGG TTGCCCGGGC CCACCGCCTG GGAGTAAAGC TGGTGGTAGG CACCGATGCC GGTGCCCCCG GTGTCTACCA CGGGGAATCC TACCTGGATG AACTGTTGTA CTGGTACCAG GCCGGTATCC CGGCGGCGGC CATCCTCCGG GCGGCTACGG TCACGGCTGC CGCTGCCCTG GGCCTGGACG GGGAACTGGG GCAAATCCGT CCCGGCTACC GGCCCTGCCT GATAGCCGTC CGGGGTAACC CCCTGGAGAA TTTAAGGGTC CTGGCGCAAC CCGAAATGGT TTTTATTGAT AATTGA
|
Protein sequence | MKLFHISPET NLDLLVDNLW DGQQEGYRQQ VVISIRRGRI AAVQPREMQG TGNGTQQMRL TGLTVLPGLI DAHVHLALDG IDFQASLDRW QDPLLRERVL ARALRVTLEH GLVAIRDGSD REGLNLQARE WVRAGKYPGP RVVATGMAVH KKGKYGSFLG PGTTDPASIR ELVTSLVNRN VDQVKVVVSG LVTFHRYGEV GSLEFATAEL VEAVKTAHAA GRPVMAHVNS APGVDLALAA GVDSIEHGYF LTTAQLETMA ARGTFWVPTV AAIANRLHTA KREVYPEREI DIIRRTQESQ QEMVARAHRL GVKLVVGTDA GAPGVYHGES YLDELLYWYQ AGIPAAAILR AATVTAAAAL GLDGELGQIR PGYRPCLIAV RGNPLENLRV LAQPEMVFID N
|
| |