Gene Moth_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1709 
Symbol 
ID3833159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1745040 
End bp1746281 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content56% 
IMG OID637829634 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_430554 
Protein GI83590545 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000941896 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.172619 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATGATT ATTTCGGAGG ATGGTTTAAG TTGGAACAAG AAGTCGCACG GCAGTTAAGG 
ATCCTCCGCC GGGGAGTGGC TGAGATTGTG CCTGAGGAGG ATTTGCAGGC CAAACTCAGA
AAATCCCTGG CCACCGGTAA ACCCTTAAAG GTTAAACTGG GTTTAGACCC TACGGCCCCG
GATATTCATC TGGGTCATAC GGTGGTGCTC CAGAAACTGC GCCAGTTTCA GGAATTGGGC
CACCAGGTGA TAATCATCAT CGGCGATTTT ACCGGGCGTA TAGGCGATCC CACCGGTAAA
TCGGAAACCA GGCGCCAGCT TACAGAAGCA GAAATCCTGG CCAATGCCGA AACCTATAAG
GAACAGATTT TTAAAGTACT GGACCCAGAG CAAACCCGGG TGACCTTTAA CAGCCACTGG
CTGGGCAAGC TTACCTTTGC CGAAGTCATT GAACTGGCAG CTAGGACGAC GGTGGCCCGC
ATGCTGGAGC GGGACGATTT TGCCCGCCGG TTCCAGGAAA ATCGTCCTAT CAGCATCCAT
GAGTTTTTTT ACCCCCTGAT GCAGGGTTAT GATTCCGTGG CCCTGGCTGC AGATGTCGAA
CTTGGGGGTA CGGATCAGAA GTTTAACCTC CTTATGGGCC GTCACCTGCA GCGTGAATAT
GGCCAGGAGC CCCAGGTGGC CATGATGATG CCCATCCTCC CCGGCCTGGA CGGCGTACAG
AAGATGAGCA AGAGCCTGGG GAACTATATC GGTATCAAGG AATCCCCCCG GGAGATGTAC
GGTAAGACCA TGTCCCTCCC TGATGAACTC ATGCTCACCT ATTACGAGCT GGTGACGGCA
GTGCCCCTGG AGGAGCTGGC AGCCATCAGG CAGGGCCTGG CCAGCGGCAG CCTGCACCCC
AGGGATGCCA AAATGCGCCT GGCCCGGGAG ATAGTAGCCA TGTATCACAC TCCGGAAGCG
GCCCTGGAGG CGGAGAGGGA ATTTCGCCAG GTCTTCCAGC AGCATGACCT GCCTGATGAT
ATGCCGGAAT TAACGATTAA AGAAGACAGG GTGTGGCTGC CCCGGCTCAT GGTCCAGGCC
GGGCTGGCTC CCAGCACCAG CGAGGCCCGG CGCCTGATCC GCCAGGGTGC AGTAAAGATC
GACGGTGAAC GGGTAACCGA TCCTGACACC GAGGTTGAGG TCAGGGAGGG CCAGGTCCTC
CAGGCCGGTA AACGTAAATT TGCCCGGCTG CACACATTTT AA
 
Protein sequence
MHDYFGGWFK LEQEVARQLR ILRRGVAEIV PEEDLQAKLR KSLATGKPLK VKLGLDPTAP 
DIHLGHTVVL QKLRQFQELG HQVIIIIGDF TGRIGDPTGK SETRRQLTEA EILANAETYK
EQIFKVLDPE QTRVTFNSHW LGKLTFAEVI ELAARTTVAR MLERDDFARR FQENRPISIH
EFFYPLMQGY DSVALAADVE LGGTDQKFNL LMGRHLQREY GQEPQVAMMM PILPGLDGVQ
KMSKSLGNYI GIKESPREMY GKTMSLPDEL MLTYYELVTA VPLEELAAIR QGLASGSLHP
RDAKMRLARE IVAMYHTPEA ALEAEREFRQ VFQQHDLPDD MPELTIKEDR VWLPRLMVQA
GLAPSTSEAR RLIRQGAVKI DGERVTDPDT EVEVREGQVL QAGKRKFARL HTF