Gene Moth_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1966 
Symbol 
ID3831148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2047566 
End bp2049335 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content61% 
IMG OID637829897 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_430807 
Protein GI83590798 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAGA AGAAACGCGC CGGCGCCCTG CGGGATGTTG ATCCCTTTAC CGTGGAAATT 
ATCAAGGATT CCCTGATAGC CATTGGCGAT GAGATGTTCG TGGCCCTCCA GCGCACCAGC
AAGAGCACCA TTATCTACGA GGTCCTCGAC TATGCCTGCG GCCTCACCGA CAACCAGGCC
CGGTTAATTA CCCAGGGCAA CGGGGTGACC GGTTTTCTGG GTACCCTGGA TTTTTCCGTC
AAGGACGCCA TCAATAAATT TGCTGCCCGG GGCCTCTTGA ACCCCGGGGA TGTCATTATC
ACCAACGATC CTTACGGCGG CGGCGGTACC CACCTCTCCG ACGTCTGCCT GGTCATGCCC
ATCTTCTATG ATGGCGAGAT CGTTGCCTAT TCGGCCAACA AGGCCCACTG GACCGAAGTG
GGCGGTAAGG ATCCGGGCAG CTGGACAACC GACTCCACCG AGGTTTACCA GGAGGGGCTG
CAGTTTCCCT GCATTAAAAT ATTCGAAGGC GGCAAGCCAA TCCAGAGTCT GGTGGATCTC
ATCGCCGCCA ACGTGCGCAT GCCGGATATG ACCCTGGGGG ACCTCTGGGC CGGGGTCGCC
GCCCTGCGGG TGGGCGAGAG ACGGTTCGTG GAGCTCTGCG ATAAATACGG CAAGGACCTG
GTCCTGGCCA GCATCGAGCA GCTCCTGGAC AACGGGGAGC GGCTGGTACG CCTGGAATTA
GCCAAATTGC CCAAGGGCGT GTACGAGGCT GAAGACTACA TCGATGATGA CGGCCTGGGT
AACGGCCCCT TCCGGGTCTG CGTCAAAGTG ACCATTACTG ACGACGAGTT CATCTGTGAC
TTCCGGGGCA CCCATCCCCA GGTGCCCGGG CCGGTGAATT GTAGCTATAC CGGCCTGGTG
GCCGGGGCGC GGTGCATCTT TAAGGCCATT ACCAACCCGG CCATACCTGC CAATGAGGGG
ACCTTCCGGC CCCTGAAGAT CATTACCGAG CCCCGGACCA TTTTTACGGC CGAGCGGCCT
GCCGCCGTCT CCACCTACTG GGAGACTATG CTCTACGTCA CCGATCTCAT CTGGAAGGCC
CTGGCGCCGG CGATACCGGA GCGCTTAACT GCTGGGCACT TCCTCAGCGT CTGCGCCGAT
GTCCTGGCCA ATATCCACCC CGACACCGGC GAACTGGCCC TCCTGGTGGA ACCAACGGCC
GGGGGATGGG GCGCCGGCAA GTATAAAGAC GGCGAGAACG GCATGGTCTG CATCGGCGAC
GGTGAAACTT ATATTATTCC GGTGGAGGTC GCCGAAACAC GTTACGGTTT TCTGGTCGAC
CAGTACGCTT TAAATACCGC CGGTGGCGGT GCCGGCGAGA AGCGTGGCGG CACAGGGGTC
ATCCGCGACT ATCGCATCCT GGCTGATGAG GCCTACTTCA CCGCCACCTT CGGCCGGCAC
AAGTTCCTGC CCTGGGGTAT GGACGGTGGC AAACCGGGCA CGCGGAACGA AGTGCGGGTT
ATTTCAGCCC ACGGTGAACC GGAACGGGTC TTTGGCAAGT GCGCCCGCCT GCGCCTGCGC
CGGAACGATG TAGTCCGCCT GATTACCGGC TGCGGTGGAG GATATGGCAA CCCTTATCGC
CGGCCGGTGG AGAAGGTCCA GGAGGATGTC CGCAACGGGT ATATCACCCT GACCCAGGCC
TGGGAAGATT ACGGTGTGCG CTTGAACCCC GAAACCCTGG CGGTCGAGGA ACTGGCGCCC
CAGCGGCAAG ATAATAGAGC AAAGGATTAG
 
Protein sequence
MEEKKRAGAL RDVDPFTVEI IKDSLIAIGD EMFVALQRTS KSTIIYEVLD YACGLTDNQA 
RLITQGNGVT GFLGTLDFSV KDAINKFAAR GLLNPGDVII TNDPYGGGGT HLSDVCLVMP
IFYDGEIVAY SANKAHWTEV GGKDPGSWTT DSTEVYQEGL QFPCIKIFEG GKPIQSLVDL
IAANVRMPDM TLGDLWAGVA ALRVGERRFV ELCDKYGKDL VLASIEQLLD NGERLVRLEL
AKLPKGVYEA EDYIDDDGLG NGPFRVCVKV TITDDEFICD FRGTHPQVPG PVNCSYTGLV
AGARCIFKAI TNPAIPANEG TFRPLKIITE PRTIFTAERP AAVSTYWETM LYVTDLIWKA
LAPAIPERLT AGHFLSVCAD VLANIHPDTG ELALLVEPTA GGWGAGKYKD GENGMVCIGD
GETYIIPVEV AETRYGFLVD QYALNTAGGG AGEKRGGTGV IRDYRILADE AYFTATFGRH
KFLPWGMDGG KPGTRNEVRV ISAHGEPERV FGKCARLRLR RNDVVRLITG CGGGYGNPYR
RPVEKVQEDV RNGYITLTQA WEDYGVRLNP ETLAVEELAP QRQDNRAKD