Gene Moth_1751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1751 
Symbol 
ID3832896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1805870 
End bp1806892 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content60% 
IMG OID637829675 
Productphenylalanyl-tRNA synthetase, alpha subunit 
Protein accessionYP_430595 
Protein GI83590586 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0018761 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.793002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGAAG TTATCGCCAA AATTCGGGAG GAAGCCCTGG CCCGGGTGGC AGCGGCCACC 
AGCAGCGAGG AACTGGAGGC CTTACGGGTC CGGTACCTGG GAAAAAAGGG TGAACTGACC
CGGGTTTTAC GGGGGATGGG TAAACTTCCG CCTGAAGAGC GGCCCCGGGT AGGACAGATG
GCCAACAAGG TACGGGAAGA GCTGGAAGGG GCTCTGAAAG AACACCGGGA GAATCTCTCC
CGCCGCGAAC AGGCGGAGCG CTTGCGGGCG GAAGCCCTTG ATGTGACCCT GCCGGGACGT
CCGGTTACCA GGGGCAACCG TCACCCCCTT TATCAGATAT TGAACGAAAT CAAGGCCGTT
TTTATCGGCC TGGGGTTTGA CGTCGCCGAG GGGCCGGAGG TGGAGAGCGA CTACTATAAC
TTTGAGGCCC TGAACTTACC CAAGGAGCAC CCGGCGCGGG ATATGCAGGA TTCCTTTTAC
ATTACCGAAG ACGTACTCCT GCGTACCCAT ACCTCCCCGG TACAAGTGAG GGTGATGGAA
GCGCGGCATC CCCAACTGCC AATCCGTATT ATTGCGCCGG GTAAAGTCTA CCGGCGCGAC
GATGACGCCA CCCACTCCCC CTTGTTCCAC CAGGTGGAGG GCCTGCTGGT GGACCGGCGG
GTGACCTTCG GCGACCTTAA AGGCACCTTG ATGGCTTTTT TAAAGCAGAT GTTCGGCGAA
CAGGTCCGGG TGCGTTTCCG GCCCAGCTAT TTCCCCTTCA CCGAGCCCAG CGCGGAAGTA
GATATGTCCT GCGTCATGTG CGGCGGCAGC GGCTGTCGTG TCTGTTCCCA CACCGGCTGG
CTGGAGATCC TTGGCTGCGG TATGGTTCAC CCTAAGGTTT TAAGCATGTC CGGCTACGAC
CCGGAGGAGG TCAGCGGCTT TGCCTTTGGC CTGGGCGTGG AGCGGGTGGC CATGCTGAAG
TACGGCATCG ACGACCTGCG CCTCTTCTAT GAAAACGACC TGCGCTTCCT GCGGCAGTTT
TAA
 
Protein sequence
MLEVIAKIRE EALARVAAAT SSEELEALRV RYLGKKGELT RVLRGMGKLP PEERPRVGQM 
ANKVREELEG ALKEHRENLS RREQAERLRA EALDVTLPGR PVTRGNRHPL YQILNEIKAV
FIGLGFDVAE GPEVESDYYN FEALNLPKEH PARDMQDSFY ITEDVLLRTH TSPVQVRVME
ARHPQLPIRI IAPGKVYRRD DDATHSPLFH QVEGLLVDRR VTFGDLKGTL MAFLKQMFGE
QVRVRFRPSY FPFTEPSAEV DMSCVMCGGS GCRVCSHTGW LEILGCGMVH PKVLSMSGYD
PEEVSGFAFG LGVERVAMLK YGIDDLRLFY ENDLRFLRQF