Gene Moth_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2037 
Symbol 
ID3831412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2126561 
End bp2127730 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content67% 
IMG OID637829966 
ProductATP phosphoribosyltransferase regulatory subunit 
Protein accessionYP_430876 
Protein GI83590867 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00443] ATP phosphoribosyltransferase, regulatory subunit 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCAGCA ACCTTCCCCT CCAGCTACCG GCCGGGGTAA GTGATCTTCT ACCGCCGGAG 
GCGGCCGCCC TCCGGCAGCT GGAGCAGCGA CTGTTAAATT GTTTTCGCAG CTGGGGTTAC
CAGGAAGTAA TGACCCCGAC CTTTGAATTC GCCACTACCT TTCAGGCCGG CTCCCCGGCC
GGGGAAGAAG GGGCCCTCTA CAAGTTTATC GACCGCCAGG GGCGGGTCCT GGCCCTACGG
CCGGAAATGA CCGCCCCCAT TGCCCGCCTG GTGGCAACCT CCCTGCGGCG CCGGGAACTA
CCCCTGCGCC TGGGGTATAG CGCCCGGGTC TTCCGCTATG AGGAGCCCCA GGCCGGCCGC
CGGCGGGAGT TTCACCAGGC CGGCGTCGAA CTTATTGGCG CCGGGGGAGT GGCCGGAGAT
GTCGAGATTA TCGCCCTGGC GGTGGAGAGT CTGGCGCAGG CCGGTCTGGA GGATTTCCGG
CTGGGCCTGG GCCAGGTGGC CGTGACCAAA GGCGTTCTCC AGGATCTGGC TCTGCCGCCC
GAAGCGGTGG CCGGTATCAA ATCCGCCCTG GCCAGCAAGG ACCTGGTAGC CCTGGAACGG
ATCTATGATG AGTACCATCT GGAAGGTGAA CGCCGGCGGC GCTTGGAGCT GCTGGCCACC
ATCCACGGTG GCCGGGAAGC CCTGGAGGAA GCCCGGGCTT GTTTCGGCCG GACGGCTGCG
GCCGCTTCCC TGGCAGAGTT GTCCCGGGTC TGGGAGGCCC TGGGGGCCGC CGGCCTGGAG
AAGTGGCTTT TTATCGACCT TGGTATTCTG CGGGACTTTG ATTATTATAC GGGCATTGTC
TTTGAAGGTT ATGTGCCGGG CCTGGGAGCC CCGGTTTGCG GCGGCGGCCG TTATGACGGC
CTGCTGGCCC AGTTCGGTTA TCCCTGCCCG GCTACGGGTT TTGCCCTGGG CCTGGAGCGG
TTGCTCCTGG CCCGGGGAGA GACGGCACCG GCCTCGCTTG CGGGAGGCTA CCTGGTGGCC
GGGCGGGACC TGGCTGCCCT CCTGAAAAGG GCGCGGGAAT TGCGCAGCAA AGGAACGGCG
GTAGTTCTCG ACGGCGAGAG CCGGAGCCGC CAGGAGGCGG CAGCCCGGGC CGCCGCCCGC
GGCTTGAACC TGGAATGGAT CGGGGAGTAA
 
Protein sequence
MASNLPLQLP AGVSDLLPPE AAALRQLEQR LLNCFRSWGY QEVMTPTFEF ATTFQAGSPA 
GEEGALYKFI DRQGRVLALR PEMTAPIARL VATSLRRREL PLRLGYSARV FRYEEPQAGR
RREFHQAGVE LIGAGGVAGD VEIIALAVES LAQAGLEDFR LGLGQVAVTK GVLQDLALPP
EAVAGIKSAL ASKDLVALER IYDEYHLEGE RRRRLELLAT IHGGREALEE ARACFGRTAA
AASLAELSRV WEALGAAGLE KWLFIDLGIL RDFDYYTGIV FEGYVPGLGA PVCGGGRYDG
LLAQFGYPCP ATGFALGLER LLLARGETAP ASLAGGYLVA GRDLAALLKR ARELRSKGTA
VVLDGESRSR QEAAARAAAR GLNLEWIGE