Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2046 |
Symbol | |
ID | 3831192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2136820 |
End bp | 2137872 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637829975 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_430885 |
Protein GI | 83590876 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.200017 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGCTG GGGAAGGCTT CACCTATGCT GCCGCCGGCG TGGATATCGC CGCCGGCAAC CGCGCCGTCG AATTAATGAA AGAGCATGTC CGCCGGACCT TCCGGCCGGG GGTGCTGGGG GACCTGGGTG GCTTCGGCGG CCTCTTTGCC CTGGAGGCGG GGCGTTACCG GCAGCCGGTG CTGGTGGCCG GGACCGACGG GGTGGGGACG AAGCTGAAGA TAGCCTTTAG CCTGGACCGG CACGACACCA TCGGCATCGA CGCCGTGGCC ATGTGCGTCA ACGACATCCT GGTCCAGGGG GCCGAACCCC TCTTTTTCCT GGACTACCTG GCCGTGGGCA AAATGGTGCC GGAAAGGGTG GCCCGGATTG TCGCCGGGGT GGCCGAAGGG TGCCGCCGGG CTGGTTGCGC CCTCATCGGC GGCGAGACGG CCGAGATGCC CGGCTTTTAC CGGGAGGATG AATACGACCT GGCCGGATTT GCCGTCGGTG TTGTGGAACG GGAGGAGCTT TTGGACGGTA GCCGCATCCG GCCCGGGCAG GTAGTCCTGG GACTGGCATC CAGTGGCCTG CATTCCAATG GCTTTTCCCT GGCCCGGCGG GTTTTGTTGG TGGAAGCCGG TTATACCCTG GAACGGAAAC TGCCGGAACT GGGCCGAACC CTGGGAGAAG AACTCCTGGA ACCGACGCGG ATTTATGTGG CCAGTATTTT ACCCTTGCTG AAAGAAGGAC TTATAAAGGG CCTGGCCCAC ATCACCGGCG GTGGGCTGAT TGAGAACCCG CCGCGCATCC TGCCGCCGGG TTGCAGCCTG CGCCTGGATC GGCGCAGCTG GCCGGTGCCG CCCGCTTTCC GGCTCATCCA GGCTACCGGC AGGGTGCCTG AGGAAGAAAT GTACCGCACC TTCAATATGG GTCTGGGGAT GCTGGTGGTG GTTGAGGAAA GCGACGCCGG CAGGGTAAAG TCCCGGTTGG AGGCCGCAGG CGAGAAGGTT TTTGTCGTTG GTGAGGTTAT TCCCGGCCGG CGGGAAGTGG AATTTGTGCC CGGTTTGCAA TAA
|
Protein sequence | MVAGEGFTYA AAGVDIAAGN RAVELMKEHV RRTFRPGVLG DLGGFGGLFA LEAGRYRQPV LVAGTDGVGT KLKIAFSLDR HDTIGIDAVA MCVNDILVQG AEPLFFLDYL AVGKMVPERV ARIVAGVAEG CRRAGCALIG GETAEMPGFY REDEYDLAGF AVGVVEREEL LDGSRIRPGQ VVLGLASSGL HSNGFSLARR VLLVEAGYTL ERKLPELGRT LGEELLEPTR IYVASILPLL KEGLIKGLAH ITGGGLIENP PRILPPGCSL RLDRRSWPVP PAFRLIQATG RVPEEEMYRT FNMGLGMLVV VEESDAGRVK SRLEAAGEKV FVVGEVIPGR REVEFVPGLQ
|
| |