Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1966 |
Symbol | |
ID | 3831148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2047566 |
End bp | 2049335 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637829897 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_430807 |
Protein GI | 83590798 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGAGA AGAAACGCGC CGGCGCCCTG CGGGATGTTG ATCCCTTTAC CGTGGAAATT ATCAAGGATT CCCTGATAGC CATTGGCGAT GAGATGTTCG TGGCCCTCCA GCGCACCAGC AAGAGCACCA TTATCTACGA GGTCCTCGAC TATGCCTGCG GCCTCACCGA CAACCAGGCC CGGTTAATTA CCCAGGGCAA CGGGGTGACC GGTTTTCTGG GTACCCTGGA TTTTTCCGTC AAGGACGCCA TCAATAAATT TGCTGCCCGG GGCCTCTTGA ACCCCGGGGA TGTCATTATC ACCAACGATC CTTACGGCGG CGGCGGTACC CACCTCTCCG ACGTCTGCCT GGTCATGCCC ATCTTCTATG ATGGCGAGAT CGTTGCCTAT TCGGCCAACA AGGCCCACTG GACCGAAGTG GGCGGTAAGG ATCCGGGCAG CTGGACAACC GACTCCACCG AGGTTTACCA GGAGGGGCTG CAGTTTCCCT GCATTAAAAT ATTCGAAGGC GGCAAGCCAA TCCAGAGTCT GGTGGATCTC ATCGCCGCCA ACGTGCGCAT GCCGGATATG ACCCTGGGGG ACCTCTGGGC CGGGGTCGCC GCCCTGCGGG TGGGCGAGAG ACGGTTCGTG GAGCTCTGCG ATAAATACGG CAAGGACCTG GTCCTGGCCA GCATCGAGCA GCTCCTGGAC AACGGGGAGC GGCTGGTACG CCTGGAATTA GCCAAATTGC CCAAGGGCGT GTACGAGGCT GAAGACTACA TCGATGATGA CGGCCTGGGT AACGGCCCCT TCCGGGTCTG CGTCAAAGTG ACCATTACTG ACGACGAGTT CATCTGTGAC TTCCGGGGCA CCCATCCCCA GGTGCCCGGG CCGGTGAATT GTAGCTATAC CGGCCTGGTG GCCGGGGCGC GGTGCATCTT TAAGGCCATT ACCAACCCGG CCATACCTGC CAATGAGGGG ACCTTCCGGC CCCTGAAGAT CATTACCGAG CCCCGGACCA TTTTTACGGC CGAGCGGCCT GCCGCCGTCT CCACCTACTG GGAGACTATG CTCTACGTCA CCGATCTCAT CTGGAAGGCC CTGGCGCCGG CGATACCGGA GCGCTTAACT GCTGGGCACT TCCTCAGCGT CTGCGCCGAT GTCCTGGCCA ATATCCACCC CGACACCGGC GAACTGGCCC TCCTGGTGGA ACCAACGGCC GGGGGATGGG GCGCCGGCAA GTATAAAGAC GGCGAGAACG GCATGGTCTG CATCGGCGAC GGTGAAACTT ATATTATTCC GGTGGAGGTC GCCGAAACAC GTTACGGTTT TCTGGTCGAC CAGTACGCTT TAAATACCGC CGGTGGCGGT GCCGGCGAGA AGCGTGGCGG CACAGGGGTC ATCCGCGACT ATCGCATCCT GGCTGATGAG GCCTACTTCA CCGCCACCTT CGGCCGGCAC AAGTTCCTGC CCTGGGGTAT GGACGGTGGC AAACCGGGCA CGCGGAACGA AGTGCGGGTT ATTTCAGCCC ACGGTGAACC GGAACGGGTC TTTGGCAAGT GCGCCCGCCT GCGCCTGCGC CGGAACGATG TAGTCCGCCT GATTACCGGC TGCGGTGGAG GATATGGCAA CCCTTATCGC CGGCCGGTGG AGAAGGTCCA GGAGGATGTC CGCAACGGGT ATATCACCCT GACCCAGGCC TGGGAAGATT ACGGTGTGCG CTTGAACCCC GAAACCCTGG CGGTCGAGGA ACTGGCGCCC CAGCGGCAAG ATAATAGAGC AAAGGATTAG
|
Protein sequence | MEEKKRAGAL RDVDPFTVEI IKDSLIAIGD EMFVALQRTS KSTIIYEVLD YACGLTDNQA RLITQGNGVT GFLGTLDFSV KDAINKFAAR GLLNPGDVII TNDPYGGGGT HLSDVCLVMP IFYDGEIVAY SANKAHWTEV GGKDPGSWTT DSTEVYQEGL QFPCIKIFEG GKPIQSLVDL IAANVRMPDM TLGDLWAGVA ALRVGERRFV ELCDKYGKDL VLASIEQLLD NGERLVRLEL AKLPKGVYEA EDYIDDDGLG NGPFRVCVKV TITDDEFICD FRGTHPQVPG PVNCSYTGLV AGARCIFKAI TNPAIPANEG TFRPLKIITE PRTIFTAERP AAVSTYWETM LYVTDLIWKA LAPAIPERLT AGHFLSVCAD VLANIHPDTG ELALLVEPTA GGWGAGKYKD GENGMVCIGD GETYIIPVEV AETRYGFLVD QYALNTAGGG AGEKRGGTGV IRDYRILADE AYFTATFGRH KFLPWGMDGG KPGTRNEVRV ISAHGEPERV FGKCARLRLR RNDVVRLITG CGGGYGNPYR RPVEKVQEDV RNGYITLTQA WEDYGVRLNP ETLAVEELAP QRQDNRAKD
|
| |