Gene Moth_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1101 
Symbol 
ID3833067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1128838 
End bp1129788 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content61% 
IMG OID637829029 
Productresponse regulator receiver domain-containing protein 
Protein accessionYP_429958 
Protein GI83589949 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2038] NaMN:DMB phosphoribosyltransferase 
TIGRFAM ID[TIGR03160] nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAACTGC TTGACCAGAC CCTGCAAAGG ATTAAGCCCT TGGACGCAAG GGCCATGGCG 
AAGGCCCAGG CCCACCTTGA TGAACTCACC AAACCCCCGG GAAGCCTGGG AGCCCTGGAG
GATATTGCCA GACGTTTGGC GGGGATCAGG GGGGAAGTCC CCCGCCGATT GTCCCGTAAA
GCCCATATCC TCATGGCCGG GGATCACGGC GTGGTCGCCG AAGGGGTCAG CGCTTTTCCC
CAGGAAGTAA CCCCTCAGAT GGTATTTAAT TTCAGCCGGG GCGGGGCGGC CATCAACGTC
CTGGCCCGCC ACGCCAGCGC TGAGCTGGTT CTTGTCGATA TAGGTGTCGC CAGCGATCTC
CCTGAACTTC CGGGGTTACT GAAACGTAAA GTGGCACCGG GAACGGCCAA CCTGGCCCGG
GGTCCGGCCA TGACCAGGGA ACAGGCCATT GCCGCCCTGG AGGTGGGCAT CGAGGTAGCC
AGTGCCAAAA TCGAAGCCGG TAATGAGTTG CTGGGAATTG GGGAAATGGG GATCGGTAAT
ACCACCCCCA GTTCGGCTAT CCTGGCGGTC TTTAGCGGCC GGCCGGTGGA GGAGATTACC
GGCCGGGGTA CGGGGGTGGA TGCCAACCGG TTACGGCTGA AGATCAAAGC CATTCAACAG
GGTCTGGCCA TAAATAAACC TAATCCTGAT GATCCCCTGG ATGTCCTGGC CAAGGTTGGG
GGCCTGGAGA TTGCCGGCAT GGCCGGGGTA ATCCTGGCCG GGGCAGCAAT GCGGGTGCCG
GTAATCATCG ATGGCTTTAT CTCCGGAGCG GCAGCCCTGG TGGCGACGCG GCTGGCACCC
CTGGCGGGTG AATTTATCCT GGCTTCCCAT CTCTCAGAGG AACCGGGCCA TGCGGTGGCC
CTGGAACTGA TGGGTCTTAA GCCCATGCTG ACCATGCAGA TGCGCCTGTG A
 
Protein sequence
MKLLDQTLQR IKPLDARAMA KAQAHLDELT KPPGSLGALE DIARRLAGIR GEVPRRLSRK 
AHILMAGDHG VVAEGVSAFP QEVTPQMVFN FSRGGAAINV LARHASAELV LVDIGVASDL
PELPGLLKRK VAPGTANLAR GPAMTREQAI AALEVGIEVA SAKIEAGNEL LGIGEMGIGN
TTPSSAILAV FSGRPVEEIT GRGTGVDANR LRLKIKAIQQ GLAINKPNPD DPLDVLAKVG
GLEIAGMAGV ILAGAAMRVP VIIDGFISGA AALVATRLAP LAGEFILASH LSEEPGHAVA
LELMGLKPML TMQMRL