Gene Moth_0898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0898 
Symbol 
ID3831440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp934045 
End bp934980 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content65% 
IMG OID637828829 
Productmethionyl-tRNA formyltransferase 
Protein accessionYP_429758 
Protein GI83589749 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0223] Methionyl-tRNA formyltransferase 
TIGRFAM ID[TIGR00460] methionyl-tRNA formyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.212157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTAG TGTTCATGGG AACCCCCGAT TTTGCCGTTC CCTCCCTGCA GGCTCTGGTG 
GCAGCTGGCC ATGAATTTGC TGCCGTAATC ACCCAGCCCG ATCGGCCTCG GGGGCGGGGC
AAGAAACTCC TGCCGCCACC CGTAAAGAGT ACGGCCCTGG CCGCCGGGCT GCCGGTGCGC
CAGCCATCTG ACATGAAGGA CAGGGAGTTT TTGGAGGACT TGCGGCTATT GCAGCCGGAG
TTAATCGTGG TAGTGGCCTT CGGCCGCATT CTCTCGCGGG AGATCCTGGA CCTGCCGGCG
CGGGGATGCG TTAACCTGCA CGCCTCCTTG TTACCGCGCT ACCGGGGAGC GGCTCCCATC
CACCGGGCCG TGATGAACGG GGAAGTTGAA ACCGGGGTGA CCACCATGTG GATGGCACCG
CAACTGGACG CCGGCGACAT CATCCTCCAG GAGAAGCTGC CCATCCCACC GGAGGCCACG
ACGGGGGAGA TCCATGACCG TCTGGCTGAG GTGGGGGCCG GACTCCTGGT ACATACCCTG
GAATTGATAG CAGCCAGCCG GGCGCCGCGC CTACCCCAGG ACGAGGCCCT GGCCACCTAC
GCACCGCCGC TTAAACCGGA GGAAGAAGTA ATTCACTGGG AGCAGCCGGC GCAGGTTATC
TATAACCAAA TCCGGGGCCT GAACCCCTGG CCGGGGGCTT ATACCCTGCG GTCCGGGGAA
CGGCTAAAGA TATACGGCGC CCGGCTAACT GACCCGTCTG CCATCGGCAG GGCGGGGCGG
GTCGTGGAGG TCGGCCGGGA AGGGTTCGTG GTCCAGGCCG GGACCGGTCG GCTACTGGTC
ACCTCCGTGC AGCCGCCGGG GAAGAAGATC ATGCCGGCCT CCGCCTACCT GCAAGGGTAC
CCCATGGTAC CCGGGGAGAT TCTGGGATGC GTATAA
 
Protein sequence
MRLVFMGTPD FAVPSLQALV AAGHEFAAVI TQPDRPRGRG KKLLPPPVKS TALAAGLPVR 
QPSDMKDREF LEDLRLLQPE LIVVVAFGRI LSREILDLPA RGCVNLHASL LPRYRGAAPI
HRAVMNGEVE TGVTTMWMAP QLDAGDIILQ EKLPIPPEAT TGEIHDRLAE VGAGLLVHTL
ELIAASRAPR LPQDEALATY APPLKPEEEV IHWEQPAQVI YNQIRGLNPW PGAYTLRSGE
RLKIYGARLT DPSAIGRAGR VVEVGREGFV VQAGTGRLLV TSVQPPGKKI MPASAYLQGY
PMVPGEILGC V