Gene Moth_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2048 
Symbol 
ID3831194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2139444 
End bp2141645 
Gene Length2202 bp 
Protein Length733 aa 
Translation table11 
GC content63% 
IMG OID637829977 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_430887 
Protein GI83590878 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.156991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGCCTG AAATCTACCG GAGAATGGGC CTCACTGATG CAGAATTTGA GAAAGTTAAG 
GCCATCCTGG GCCGGGAACC CAACTACGTC GAACTGGGCA TGTTTGCCGT TATGTGGTCA
GAGCACTGCG GCTATAAAAG CTCCCGTTCG GTACTGAAGC TGTTCCCGAC GAAGGCCCCC
TGGGTCCTCC AGGGCCCGGG GGAGAACGCC GGCATCGTGG ACATCGGTGA CGGCCAGGCC
GTGGTCTTTA AAATCGAAAG CCACAACCAT CCCTCGGCCA TAGAACCCTT CCAGGGGGCG
GCCACCGGGG TGGGGGGCAT CGTCCGCGAC ATCTTTGCCA TGGGGGCGCG GCCCATTGCC
GTCCTCAATT CCTTACGTTT CGGACCCCTG GACGACTCCC GCACCCGGTA CCTCATGGGC
GGGGTGGTGG GCGGCATCGC CTTTTACGGC AACTGCCTGG GCCTGCCCAC GGTGGCCGGG
GAGGTTTATT TTGAACCCTC GTATGCCTGT AACCCCCTGG TCAACGTCAT GGCCGTGGGC
TTAATCGAGC AAAAGAATAT CCGCCGGGGT ACGGCTGCCG GAGTGGGCAA CGCCGTCATG
CTAATAGGTG CTCGCACCGG CCGCGACGGC ATCCACGGGG CCACCTTCGC CTCCGAGGAA
TTGAGCGAGG CTTCCGAGGA GCGACGGCCC TCCGTCCAGG TAGGCGACCC CTTCCGGGAA
AAGCTCCTCA TCGAAGCCTG CCTGGAAATA ATAAATGAAG ATTTAATCAT CGGCATGCAG
GATATGGGCG CCGCCGGTAT TACCAGCTCC TCCTGCGAGA TGGCGGCCCG GGCCGGCACC
GGCATGGAGA TTGACATCGC CCTGGTGCCG CGGCGGGAAG AAGGTATGAC CCCCTACGAG
GTCATGCTGT CGGAATCCCA GGAACGGATG CTCTTGGTAC CCAAAAAAGG CGCTGAAGAG
AGGATCCGGG CCATCTGCCG GCGCTGGGGC CTGGAAGCGG TAATCATCGG CCGGGTAACG
GGCGACGGCC TGATGCGCAT CATGGAGAAC GGCCGGGTGG TGGCCGAGGT GCCGGCCAAG
GCCCTGACGG ACCAGTGCCC GGTCTATGAA CGGGAACGCC GCCGGCCGGC CTACCTGGAT
GAAGTACGGC AAAGGGACTT AAGCCGCCTG CCGGAACCGG AGGATTACGG CCGGGTGCTC
CTGGGGCTGC TGGCGGCGCC GAACCTGGCC AGTAAAGAGT GGGTTTACCG CCAGTATGAT
TATATGGTCC GGACGGATAC GGTGGCCGGG CCCGGGGGAG ACGCCGCGAT CTTAAGGGTT
AAAGGCACGA GCAAAGGCCT GGCCCTGACG GTGGACGGCA ATGGTCGCTA CTGTTACCTG
GACCCGGAGC GGGGCGGGGC CATCGCCGTT GCCGAGGCGG CCCGGAACCT CGCCTGTGTG
GGGGCGCGGC CCCTGGCCAT CACCGATTGC CTGAACTTCG GTAATCCCGA AAAGCCCGAG
GTGGCCTGGC AGTTCTATCA GGCCGTGAGC GGCATGAGCC GGGCCTGTGA GGTTTTGCGG
ACCCCGGTAA CCGGCGGTAA CGTCAGCTTT TATAACGAAA CCGAAAGCGG AGCTATCTAC
CCCACCCCGG TGGTGGGCAT GGTCGGCCTG CTGCCGGATA TAGAAAAACG CTGTGGCATT
GGTTTCCGCC GGGAAGGCGA CCTGTTGATC CTTATGGGCG AGACTTACCC GGAGATAGGC
GGCAGCGAGT ACCTGGCCAC ATTCCATGGC CTGGTGGCCG GAGAACCGCC TGCCCTGGAC
CTGGAGCGGG AAAAGGCCGT CCAGGCCCTG GTCCGGGAGG TTATCGCCGC CGGGCTAGCA
ACAGCAGCCC ATGACTGCGC CGAAGGGGGT CTGGCCGTGG CCCTGGCGGA AAGCGCCCTG
GCCGGAGGCC TGGGGGCGGA GGTGGAACTG GCGAGCGACC TGCGGCCCGA TTTTCTCCTC
TTCAGTGAGA GCCAGTCCAG GATACTGCTG GCGGTAGCAC CTGAGGCCAG GGACAGGGTG
CTGGATCTGG CCCGAGAAAA GGGCGTGCGG GCATCGGTCA TTGGCCGGTG CGGGGGCCAC
AGCCTGGTGG TAAGGATAAA CGGCAGGACA TTGTTCAATC TAAGCCTTGA GGAGATGGGG
AAACAATGGC GGGAGAGCAT ACCGGTGCTG ATGGCCCGGT AA
 
Protein sequence
MEPEIYRRMG LTDAEFEKVK AILGREPNYV ELGMFAVMWS EHCGYKSSRS VLKLFPTKAP 
WVLQGPGENA GIVDIGDGQA VVFKIESHNH PSAIEPFQGA ATGVGGIVRD IFAMGARPIA
VLNSLRFGPL DDSRTRYLMG GVVGGIAFYG NCLGLPTVAG EVYFEPSYAC NPLVNVMAVG
LIEQKNIRRG TAAGVGNAVM LIGARTGRDG IHGATFASEE LSEASEERRP SVQVGDPFRE
KLLIEACLEI INEDLIIGMQ DMGAAGITSS SCEMAARAGT GMEIDIALVP RREEGMTPYE
VMLSESQERM LLVPKKGAEE RIRAICRRWG LEAVIIGRVT GDGLMRIMEN GRVVAEVPAK
ALTDQCPVYE RERRRPAYLD EVRQRDLSRL PEPEDYGRVL LGLLAAPNLA SKEWVYRQYD
YMVRTDTVAG PGGDAAILRV KGTSKGLALT VDGNGRYCYL DPERGGAIAV AEAARNLACV
GARPLAITDC LNFGNPEKPE VAWQFYQAVS GMSRACEVLR TPVTGGNVSF YNETESGAIY
PTPVVGMVGL LPDIEKRCGI GFRREGDLLI LMGETYPEIG GSEYLATFHG LVAGEPPALD
LEREKAVQAL VREVIAAGLA TAAHDCAEGG LAVALAESAL AGGLGAEVEL ASDLRPDFLL
FSESQSRILL AVAPEARDRV LDLAREKGVR ASVIGRCGGH SLVVRINGRT LFNLSLEEMG
KQWRESIPVL MAR