Gene Moth_1676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1676 
SymbolaspS 
ID3831947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1712258 
End bp1714060 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content63% 
IMG OID637829601 
Productaspartyl-tRNA synthetase 
Protein accessionYP_430521 
Protein GI83590512 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0173] Aspartyl-tRNA synthetase 
TIGRFAM ID[TIGR00459] aspartyl-tRNA synthetase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00150774 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTAGTGG TTGAGGGTTT GGCCGGACTC CACCGCAGCC ACGGCTGCGG GGAGTTGACG 
GCGGCTGATG CCGGCAAGGA AGTCACCCTG ATGGGCTGGG TCCACCGGCG CCGGGATCAC
GGCGGCCTGA TTTTTATTGA CCTCCGGGAT CGCTCCGGCC TGGTGCAGGT AGTTTGCGAC
CCGAAGTCCG GGCCGGCCTT TCAAAAGGCA GAAGAAGTAC GCAACGAGTA TGTGGTGGCC
GTCAGGGGCC TGGTGCGCCG TCGGCCCGAG GGGACTGTCA ACCCCAAACT GCCTACCGGC
GAAATCGAAG TGGTGGCCGA GGAATTTCGC CTGCTAAACC GGGCCAAGAC GCCGCCCTTT
TATATCGACG ACGGCATTGA TGTGGATGAA GCCTTACGCC TGCGTTACCG GTACCTGGAC
CTGCGCCGGC CGGAGATGCA GCGGCTCCTT TACCTGCGTT ACCGGACTAC CAGGGCCATC
CGCGACTTCC TGGACGCCCG CGGTTTCTGG GAGATTGAAA CCCCCATGCT GACCCGCAGC
ACACCGGAAG GGGCCCGGGA CTTCCTGGTG CCCAGCCGGC TGCGACCCGG GGAGTTCTTC
GCCCTGCCCC AATCCCCCCA GCTTTTCAAA CAGATCCTCA TGGTGGCCGG GGTAGAGCGG
TACTTTCAGA TCGTCCGCTG CTTCCGGGAT GAGGACCTGC GGGCCGACCG CCAGCCGGAG
TTTACCCAGC TGGATATGGA GATGTCCTTT GTCCAGCGGG AGGATATCCT CAAACTAGTC
GAGGAACTCA TGGCCTATGT TTTCCGGGAG ACCCTGGGAG TGGAGCTGGC CCTGCCCCTG
CCACGCCTAA CTTACCGGGA GGCCATGGAC CGTTACGGCT CCGACAAGCC CGATATCCGT
TTCGGTATGG AGATAGTAGA CGTATCCGAC CTGGTGGCCG GCTGCGGCTT TAAAGTTTTT
GCCGAGGCCG TTGCCCGCGG GGGCGTGGTA CGCGGTCTCT GCGCCCCGGG CTGCGCCGGG
TACTCCCGTC GGGAACTGGA TGAGCTGACC AGGCAGGCGG CGGTCTTCGG CGCTAAAGGG
CTGGCCTGGA TGGCCGTCAC CCCCGAGGGC ATCCGTTCCC CCATTGCCAA GTTCTTCACC
TCCGGCGAGC TGGAGGGCCT GGTCGCCCGC CTGGCAGGCA AGCCCGGCGA CCTCCTCCTC
TTTGTGGCCG ATACGGAAAC GACGGCCGCC ACGGCCCTGG GAGCCCTGCG CCTGGAAATG
GGGCGGCGCC TGCACCTCTA TGATCCGGAA CAGCTGGCCT TTACCTGGGT GACGGAGTTT
CCCCTCCTGG AATACAGTGC AGAAGAGAAG CGTTATGTAG CCGTGCACCA TCCCTTTACC
ATGCCCATGG AAGAAGACTG GCCCCTGCTG GACAGCGACC CCCTGCGGGT CCGGGCCCTG
GCCTACGACC TGGTCTTAAA CGGCGTCGAG CTGGGCGGGG GCAGTATCCG GATTCACCGC
CGGGACATCC AGGAGAAGAT GTTTAATCTG CTGGGCTTTA CCCCGGAGGC CGCACGGGAT
AAATTCGGGT TTTTGCTGGA TGCCTTTGAA TACGGTACCC CGCCCCACGG CGGCATCGCC
TTTGGCCTGG ACCGGATGCT GATGCTTATG GCCCGGCGGG ATACTATCCG CGATTGCATC
CCTTTCCCCA AGACCCAGAG CGGCACCTGC CTGATGACGG CGGCCCCCTC GGGCGTATCC
CCGGAGCAAC TCCAGGAACT CCACCTACGG AGCACGGCAC GAAAGAGTAC CAATCCTGCC
TGA
 
Protein sequence
MVVVEGLAGL HRSHGCGELT AADAGKEVTL MGWVHRRRDH GGLIFIDLRD RSGLVQVVCD 
PKSGPAFQKA EEVRNEYVVA VRGLVRRRPE GTVNPKLPTG EIEVVAEEFR LLNRAKTPPF
YIDDGIDVDE ALRLRYRYLD LRRPEMQRLL YLRYRTTRAI RDFLDARGFW EIETPMLTRS
TPEGARDFLV PSRLRPGEFF ALPQSPQLFK QILMVAGVER YFQIVRCFRD EDLRADRQPE
FTQLDMEMSF VQREDILKLV EELMAYVFRE TLGVELALPL PRLTYREAMD RYGSDKPDIR
FGMEIVDVSD LVAGCGFKVF AEAVARGGVV RGLCAPGCAG YSRRELDELT RQAAVFGAKG
LAWMAVTPEG IRSPIAKFFT SGELEGLVAR LAGKPGDLLL FVADTETTAA TALGALRLEM
GRRLHLYDPE QLAFTWVTEF PLLEYSAEEK RYVAVHHPFT MPMEEDWPLL DSDPLRVRAL
AYDLVLNGVE LGGGSIRIHR RDIQEKMFNL LGFTPEAARD KFGFLLDAFE YGTPPHGGIA
FGLDRMLMLM ARRDTIRDCI PFPKTQSGTC LMTAAPSGVS PEQLQELHLR STARKSTNPA