Gene Moth_2396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2396 
Symbol 
ID3830763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2516791 
End bp2517642 
Gene Length852 bp 
Protein Length283 aa 
Translation table11 
GC content67% 
IMG OID637830315 
ProductHemK family modification methylase 
Protein accessionYP_431221 
Protein GI83591212 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.299117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGC GGCAAGCCCT GGGGGAGGCC GTCCGGCGCC TGGCGGCCGG GGGGGTTGAA 
CGCCCGCGGC TGGAGGCGGA AGTCCTTCTC GGGTGGGCCT GTAGTTTAAC CCGGCCCCGC
CTCCTGGCCC GCCTGGAGGA GGAACTGGCA CCGGCAGCCG CAGGACGGTT CTGGCAGGCA
ATTGACCGCC GGGCAGCCGG TTACCCCCTC CAGTACCTCA CCGGACACCA GGAATTTATG
TCCCTGGACT TTAAAGTCAC TCCGGCGGTT TTAATCCCCC GCCAGGATAC CGAAGTGGTG
GTGGAGGCTG TCCTTGAGCG TCTGGACCCC TGCGAGAGCT ATACCATCGC CGACTGCGGT
ACGGGCAGCG GGGCCATTGC CCTGAGCCTG GCCCATTACC TGCCCCGGGC CCGGGTTTAC
GCCACGGACA TCAGCCCGGC GGCCCTGACG GTGGCCCAGG AAAACGCCAG GAAACTGGGG
CTGGCGGCCA GGGTAACCCT TCTCCAGGGT GATTTTTTGG CGCCCCTGCG GGGTTTAAAG
CTCGACGCCC TTGTGGCCAA CCCCCCCTAC ATACCCACTG CCGCCCTGCC AGGGCTGCCC
GCGGATGTCC GCTCTGAACC GCGCCTGGCC CTGGACGGCG GGCCCGACGG CCTGGATGCC
TACCGGTTCC TCCTGCCGGG GGCGGCAGGA CTTTTGCGGC CCGGCGGTCT CCTGGCCCTG
GAAATCGGCT CCGACCAGGG ACAGGCCGTA AAGGACCTGG CCCGGGCCGT GGGAGCCTAT
CGCAACGAAC AGGTTTTACC AGATTATGCC GGCCGCGATC GTTGTTTCCT GGCTTATCGC
CGGGAAGAAT AA
 
Protein sequence
MTLRQALGEA VRRLAAGGVE RPRLEAEVLL GWACSLTRPR LLARLEEELA PAAAGRFWQA 
IDRRAAGYPL QYLTGHQEFM SLDFKVTPAV LIPRQDTEVV VEAVLERLDP CESYTIADCG
TGSGAIALSL AHYLPRARVY ATDISPAALT VAQENARKLG LAARVTLLQG DFLAPLRGLK
LDALVANPPY IPTAALPGLP ADVRSEPRLA LDGGPDGLDA YRFLLPGAAG LLRPGGLLAL
EIGSDQGQAV KDLARAVGAY RNEQVLPDYA GRDRCFLAYR REE