Gene Moth_0076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0076 
Symbol 
ID3832685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp76460 
End bp77428 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content58% 
IMG OID637828008 
Productribose-phosphate pyrophosphokinase 
Protein accessionYP_428958 
Protein GI83588949 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0462] Phosphoribosylpyrophosphate synthetase 
TIGRFAM ID[TIGR01251] ribose-phosphate pyrophosphokinase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGGTTA TAAAAAAGCA AATGGAAACC GAGAGCGGAC GCCTGAAGAT CTTCACCTGC 
AACGCCAACC CCAAACTGGC CGAGGAAATC GGCGCCTACC TGGGTGTGCC CCTGGGAGCG
GCCAAGGTAA AACGCTTTAG CGATGGGGAA ATAAGCGTCG TTATTGACGA GAGCGTGCGG
GGGGAGGATG TTTTCGTCAT CCAGCCCACC TGTGAACCCG TCAATGACAA TCTGATGGAA
CTCTTGATCA TGATCGATGC CCTGCGCCGG GCTTCCGCCA GGCGGATTAC GGCCGTCATA
CCCTACTATG GCTACGCCCG CCAGGAGCGC AAGACCAGGG CCCGGGACCC TATCTCCGCC
AAGCTGGTGG CCAATCTCAT TACCGCCGCA GGCGCCCACC GGGTCCTGAC CATGGACCTG
CACGCGGCGG CCATCCAGGG ATTTTTTGAT ATTCCGGTAG ATCACCTGAC GGCAGTCCCC
ATCCTGGCCG ATTACTTTAA CAGCAAGGGG TTTGAAAAGG CGGTGATTGT TTCCCCGGAC
CTGGGGGGCG TGACCAGGGC GCGTAACTTC GCCGAGCGCA TAGGCGCTGA GATCGCCATT
ATTGACAAGC GGCGGCCGGC GCCCAACGTC GCTGAGATCA TGAACCTCAT CGGCGATGTG
AAAAATAAAA CGGTCATCAT GATTGATGAC CTCATCGACA CCGCCGGGAC CATCTGCCTA
GGAGCTAAAG CCCTGATGGA GCAGGGCGCC TGCGCCGTTT ATGCCTGTTG TACCCATCCG
GTCCTATCCG GACCGGCCCG GGAACGCCTG ATGGCCTCTC CCCTGCAGGA GGTAGTTGTC
TGCAATACCA TTCCTGTCCC GGAGGGGAAA GAAATACCCA AGCTGCATCG CCTCTCCGTA
GCTCCCCTCC TGGGGGAAGC CATTATTCGC ATCCACGAAG ACCTCTCGGT CAGTAAACTT
TTCGATTAA
 
Protein sequence
MEVIKKQMET ESGRLKIFTC NANPKLAEEI GAYLGVPLGA AKVKRFSDGE ISVVIDESVR 
GEDVFVIQPT CEPVNDNLME LLIMIDALRR ASARRITAVI PYYGYARQER KTRARDPISA
KLVANLITAA GAHRVLTMDL HAAAIQGFFD IPVDHLTAVP ILADYFNSKG FEKAVIVSPD
LGGVTRARNF AERIGAEIAI IDKRRPAPNV AEIMNLIGDV KNKTVIMIDD LIDTAGTICL
GAKALMEQGA CAVYACCTHP VLSGPARERL MASPLQEVVV CNTIPVPEGK EIPKLHRLSV
APLLGEAIIR IHEDLSVSKL FD