Gene Moth_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1421 
Symbol 
ID3832249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1465669 
End bp1466751 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content54% 
IMG OID637829357 
ProductO-methyltransferase family protein 
Protein accessionYP_430277 
Protein GI83590268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.513829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAATC ACCCGCAAAG TTTAATGGAC CTGGCTTGCC CCCAGGGGGT TGAAAGGATT 
GACAGCATTA CAGCCGGTTA TCAGGCTTAC CAGGTACTGA GGGCCGCCCT GGAACTGGGG
CTGTTTGATT GGTTGGCGGA AAATGGTCCC GGCTGCCGGG AGGAGATCAC CACTGCCCTC
AAGTTAAACG GCATGTTTAC CCGTAGTTTT CTCCAGGCCT TGGTGGACCT CGGCTTTTTA
ACCTGCAAAG GCGAAAAATA CAGGTTAACC GAATTGGCGA GAGATTTCTT GGTGCGGCGG
AGCCCTTGCT ACCAGGGAGA TCTATTCTTG AGCACCGCCC GGCCTGATTC CTGGTGGAAT
AACTTTAAAG ACACCCTTAC CGTCATAAAA CCCCCGGAAC AGGACTTTGA TGCCGTTCCA
ACCCCCGATT TTATTAAAGC CCTGGCCCAG CGTTCCCTCC GGGGAGAGTT GCAGGCAGTC
ACCCGCAGCA TAGTCGCCTG GGAAGGGTTT AGAGGGGCCA GGACGCTCCT TGACCTGGGA
GGCGGGCACG GTTTTTATGC CATAGCCCTG TGCCAGGTCA ATCCTAACCT CAGAGCCGTT
GTTTTCGATA AACCCCACAT TATTGCCTGC ACCAGGGAAT TTATCCGGCA GTACGGCCTG
GAAGACCGGG TGATAGTCCA GGGGGGCGAT GCGTGTTCGG AAGAATGGGG AGGAGGCTAT
GATATAGTCC TTATTTCTCA TTTGCTTTAC AAGTACCGCA AAGAATTAGC GGCATTTATT
GGTAAAGCCT TTACCGCCCT GAAGCCCGGC GGCCTGCTGG CGTGCAATCA CTGGTTCTGC
GCCCCGGGTT GTGGATCAGA GGGAGATGGT TTGCGGGAAC TCGATAGATC CATCCATAGC
TTTGGCCATC CCCTGTGCCA TATGGAGGAA TTCAATAACC TGTTGGCTAC TACCGGCTTT
AGCCTGTGGC AGTTACTTGA TGTTCCCAGC GCCTATGGTA TGGCGAAATT GCACCTTGCT
GTTAAAAAAG GATTGGCATC AACAAAGGCT ATGATGCCGG GGAGTTGCAG CGCCTGCTGC
TAA
 
Protein sequence
MSNHPQSLMD LACPQGVERI DSITAGYQAY QVLRAALELG LFDWLAENGP GCREEITTAL 
KLNGMFTRSF LQALVDLGFL TCKGEKYRLT ELARDFLVRR SPCYQGDLFL STARPDSWWN
NFKDTLTVIK PPEQDFDAVP TPDFIKALAQ RSLRGELQAV TRSIVAWEGF RGARTLLDLG
GGHGFYAIAL CQVNPNLRAV VFDKPHIIAC TREFIRQYGL EDRVIVQGGD ACSEEWGGGY
DIVLISHLLY KYRKELAAFI GKAFTALKPG GLLACNHWFC APGCGSEGDG LRELDRSIHS
FGHPLCHMEE FNNLLATTGF SLWQLLDVPS AYGMAKLHLA VKKGLASTKA MMPGSCSACC