Gene Moth_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2046 
Symbol 
ID3831192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2136820 
End bp2137872 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content64% 
IMG OID637829975 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_430885 
Protein GI83590876 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.200017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGCTG GGGAAGGCTT CACCTATGCT GCCGCCGGCG TGGATATCGC CGCCGGCAAC 
CGCGCCGTCG AATTAATGAA AGAGCATGTC CGCCGGACCT TCCGGCCGGG GGTGCTGGGG
GACCTGGGTG GCTTCGGCGG CCTCTTTGCC CTGGAGGCGG GGCGTTACCG GCAGCCGGTG
CTGGTGGCCG GGACCGACGG GGTGGGGACG AAGCTGAAGA TAGCCTTTAG CCTGGACCGG
CACGACACCA TCGGCATCGA CGCCGTGGCC ATGTGCGTCA ACGACATCCT GGTCCAGGGG
GCCGAACCCC TCTTTTTCCT GGACTACCTG GCCGTGGGCA AAATGGTGCC GGAAAGGGTG
GCCCGGATTG TCGCCGGGGT GGCCGAAGGG TGCCGCCGGG CTGGTTGCGC CCTCATCGGC
GGCGAGACGG CCGAGATGCC CGGCTTTTAC CGGGAGGATG AATACGACCT GGCCGGATTT
GCCGTCGGTG TTGTGGAACG GGAGGAGCTT TTGGACGGTA GCCGCATCCG GCCCGGGCAG
GTAGTCCTGG GACTGGCATC CAGTGGCCTG CATTCCAATG GCTTTTCCCT GGCCCGGCGG
GTTTTGTTGG TGGAAGCCGG TTATACCCTG GAACGGAAAC TGCCGGAACT GGGCCGAACC
CTGGGAGAAG AACTCCTGGA ACCGACGCGG ATTTATGTGG CCAGTATTTT ACCCTTGCTG
AAAGAAGGAC TTATAAAGGG CCTGGCCCAC ATCACCGGCG GTGGGCTGAT TGAGAACCCG
CCGCGCATCC TGCCGCCGGG TTGCAGCCTG CGCCTGGATC GGCGCAGCTG GCCGGTGCCG
CCCGCTTTCC GGCTCATCCA GGCTACCGGC AGGGTGCCTG AGGAAGAAAT GTACCGCACC
TTCAATATGG GTCTGGGGAT GCTGGTGGTG GTTGAGGAAA GCGACGCCGG CAGGGTAAAG
TCCCGGTTGG AGGCCGCAGG CGAGAAGGTT TTTGTCGTTG GTGAGGTTAT TCCCGGCCGG
CGGGAAGTGG AATTTGTGCC CGGTTTGCAA TAA
 
Protein sequence
MVAGEGFTYA AAGVDIAAGN RAVELMKEHV RRTFRPGVLG DLGGFGGLFA LEAGRYRQPV 
LVAGTDGVGT KLKIAFSLDR HDTIGIDAVA MCVNDILVQG AEPLFFLDYL AVGKMVPERV
ARIVAGVAEG CRRAGCALIG GETAEMPGFY REDEYDLAGF AVGVVEREEL LDGSRIRPGQ
VVLGLASSGL HSNGFSLARR VLLVEAGYTL ERKLPELGRT LGEELLEPTR IYVASILPLL
KEGLIKGLAH ITGGGLIENP PRILPPGCSL RLDRRSWPVP PAFRLIQATG RVPEEEMYRT
FNMGLGMLVV VEESDAGRVK SRLEAAGEKV FVVGEVIPGR REVEFVPGLQ