Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0990 |
Symbol | |
ID | 3830866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1017408 |
End bp | 1019081 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828919 |
Product | hydantoinase/oxoprolinase |
Protein accession | YP_429848 |
Protein GI | 83589839 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.454811 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGTAG GCCTGGATAT GGGTGGTACC CATACCGATG TCGTCCTCAT AGCCGACGGC CGGGTGCAAC GCTACTGCAA GACACCGACC GACCCTGAAG ATTACCTCCA TACCGTTACC AGGGCCCTGG ACGAGGTCCT GGAGGATGTT GCAACCGGCG ATGTACAGCG CCTGAACCTC AGCACCACTG TCTGCACCAA TGCCATCGTG ACCGGCAGGA CCAGCCCGGT AGGCTTGCTC CTGGAACCGG GGCCGGGATT AAATCCCGCC GGGCTGACCT GCGGCGCTAA AAATTTTATC CTCTCGGGCG CCATCGATCA TCGCGGTCGC CCTACCTCTT CCCTGGTAGC AACCGAAATC GAAGCTGCCG ATAAAGAGCT AAGGAAAGCA GCCATCCGTC ACCTGGCCAT AGTAGGTAAG TTTTCCACCC GCAACCCGGA ACATGAATTG CAGATCAAGG AAGCCCTTTC CCCGACCTAT GACTTTATCA CCCTGGGCCA TCGCCTGTCG GGACGTTTGA ACTACCCCAG GCGGGTTTTT ACCGCCTATC TGAACAGTGC CGTAGCATCT ATCTATGATG CCTTTGCCAC CGCCATAAAT GCCTATGCCG ATGGCAGGCA ACTGCCAGTA GTACCGGATA TACTAAAAGC CGATGGTGGC ACCCTCTCCC TGGCCGCATC CCGGTCCCTC CCGGTGGAGA CCATCCTCTC CGGTCCCTCG GCCAGTATTA TGGGAGCCCT GGCCCTGGCG CCCTCTTCCC GGGATACTAT TATTCTCGAT ATCGGCGGTA CGACCACTGA TATCGCCTTC CTGGCCGACG GTGTCCCCCT CTTTGAACCT TTAGGCATAA CCCTGGCCGG TTACCCAACC CTGGTACGCG CCCTTTACAG TTATTCCTTG GGTCTGGGCG GCGACAGCTG CCTGCGGGTC AGGGACGGGC GCCTGAGCAT CGGTCCGGAG CGCCTGGGGC CGGCCCTGGC CCTGGGCGGC CCCGCCCCTA CTCCCACCGA CGCCCTGATT ACCATGGGCC GCCTGGACCT GGGGGATAAA GGGGCTGCCC GCCGGGGCCT GGATCAGTTA GGGGATCAAC TGGGCCTCAA TACCGGGGAG GTGGCTAATG CCATTATCAA ACAAATGGCC GGCGAGATTG CCCGCCAGAC CCGGGCCCTC CTTGAGAGAA TTAATAGCCG CCCGGTATAT ACCGTGCGGG AAGTCCTGGA GGACAAAAAA CTACGCCCGG AACAGGTAAT CGTCATCGGT GCCCCGGCGC CCCTCCTGGC GGCCGAGCTG GAAGCCGCTT TCGGCCTCCC GGTGGTAGCA CCCTCCCTGG CCGGGGTTGC CAATGCCATT GGCGCCGCCC TGAGCCAGCC CACTACGGAA ATCACCCTCC AGGCCGATAC AGAGCAGGGC TTCCTGACGA TTCCCGAAGA GGGCATCAGG GAGAAAGTCC AACGCGGTTT CAACCTGGAG GCAGCCCGCC AGCGGGCCCT GGCAGCCCTG CAGGATCGCC TGCGGCGCCT GGCACCGGCG GCAGCCGGTT CCGAATTGGA AGTAGTTGAG GAACAATCCT TTAACATGGT CTCCGGCTTT TATACCACGG GCAAAAACAT CAGAGTCAAA GTCCAGGTTA AACCCCAGGT CAGCCCCCTG GAGGGAGGTG AGCATAATGT ATAA
|
Protein sequence | MYVGLDMGGT HTDVVLIADG RVQRYCKTPT DPEDYLHTVT RALDEVLEDV ATGDVQRLNL STTVCTNAIV TGRTSPVGLL LEPGPGLNPA GLTCGAKNFI LSGAIDHRGR PTSSLVATEI EAADKELRKA AIRHLAIVGK FSTRNPEHEL QIKEALSPTY DFITLGHRLS GRLNYPRRVF TAYLNSAVAS IYDAFATAIN AYADGRQLPV VPDILKADGG TLSLAASRSL PVETILSGPS ASIMGALALA PSSRDTIILD IGGTTTDIAF LADGVPLFEP LGITLAGYPT LVRALYSYSL GLGGDSCLRV RDGRLSIGPE RLGPALALGG PAPTPTDALI TMGRLDLGDK GAARRGLDQL GDQLGLNTGE VANAIIKQMA GEIARQTRAL LERINSRPVY TVREVLEDKK LRPEQVIVIG APAPLLAAEL EAAFGLPVVA PSLAGVANAI GAALSQPTTE ITLQADTEQG FLTIPEEGIR EKVQRGFNLE AARQRALAAL QDRLRRLAPA AAGSELEVVE EQSFNMVSGF YTTGKNIRVK VQVKPQVSPL EGGEHNV
|
| |