Gene Moth_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0021 
Symbol 
ID3831894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp21982 
End bp23265 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content60% 
IMG OID637827948 
Productseryl-tRNA synthetase 
Protein accessionYP_428904 
Protein GI83588895 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0172] Seryl-tRNA synthetase 
TIGRFAM ID[TIGR00414] seryl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.410192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000214163 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCTGGATA TTAGAGTAGT ACGCCAGGAC CCGGAGGGGG TGGCCAGGGG ACTGGCCCGC 
CGGGGCCTGG CTGCTGGCCT GGAACACTTT CTGGAACTGG ATGCCAGGCG CAGGAACCTC
CTGGTGGAGG TTGAGGGACT GAAAAATAGA CGCAACCAGG TCTCGGCCGA GGTGGCCCGA
CTGAAGAAGA GCGGGCAGGA CGCGACAGAG TTGATCGCCT CCATGCGCGA AGTCGGTGAG
CGCATTAAAG AGCTGGATGA AAAAGTCCGG GCTGTGGAAG AAGAGCTTCG GCAGGAGATG
CTGAAACTAC CCAACATCCC CCATGCCTCA GTTCCCGACG GCTTAAGCGA TGCCGACAAC
AAGCCGGTGC GTCACTGGGG TGAATTGCCA CGCTTTAAAT TTGAACCCCG GCCCCATTGG
GAGATAGCAG AAAACCTGGG GATTGTTGAT TTCGAGCGCG GCGGTAAGGT GGCCGGGGCC
CGTTTTGTCT TCTATCGCGG CGCCGGGGCG CGGTTGGAGC GGGCTTTGAT CAACTTCATG
CTGGATCTGC ATACCATCAA GCATGGTTAT ACAGAGATTT TCCCACCATA TATGGTAAAC
AGCGCCAGCA TGCTGGGTAC CGGCCAGCTG CCCAAGTTTG CTGAGGATAT GTTTCATGTC
GAGGGGACGG ACTATTACCT CATTCCTACG GCCGAGGTGC CCGTCACTAA TCTCTACCGG
GGGGAGATCC TGCCCGGTGA GAAGCTGCCT ATCTACCATG TTGCCTACAG CGCCTGCTTC
CGGGCCGAGG CCGGTGCCGC CGGCAGGGAT ACCCGCGGGC TCATCCGCCA GCACCAGTTT
AACAAGGTGG AACTGGTCAA GTTCACCCGG CCCGAGGATT CCTATGAAGA ACTGGAGAAG
CTGACCCGGG ACGCCGAGGA GGTTTTGCAG CTTCTGGGCC TGCCCTACCG GGTGGTAGCC
CTCTGCGCCG GCGACCTGGG CTTTTCGGCG GCCAAGACCT ACGACCTGGA GGTCTGGATG
CCAGGCACCG CTTGCTACCG CGAGATTTCC TCCTGCAGCA ACTTTGAAGA CTTCCAGGCC
CGCAGGGCTG ATATCCGCTT CCGGCCCGGC CCCAAGGAAA AGCCGCGCCT GGTCCATACT
TTAAACGGCT CCGGGGTGGC CGTCGGCCGG ACGGTAGCCG CCATCCTGGA AAACTACCAG
CAGGAGGATG GTACGGTGCT AATCCCCCCC GCTCTGAAAC CGTATATGGG GGGCCTTACT
TCCATCCAGC CGGAGCAGGA TTAA
 
Protein sequence
MLDIRVVRQD PEGVARGLAR RGLAAGLEHF LELDARRRNL LVEVEGLKNR RNQVSAEVAR 
LKKSGQDATE LIASMREVGE RIKELDEKVR AVEEELRQEM LKLPNIPHAS VPDGLSDADN
KPVRHWGELP RFKFEPRPHW EIAENLGIVD FERGGKVAGA RFVFYRGAGA RLERALINFM
LDLHTIKHGY TEIFPPYMVN SASMLGTGQL PKFAEDMFHV EGTDYYLIPT AEVPVTNLYR
GEILPGEKLP IYHVAYSACF RAEAGAAGRD TRGLIRQHQF NKVELVKFTR PEDSYEELEK
LTRDAEEVLQ LLGLPYRVVA LCAGDLGFSA AKTYDLEVWM PGTACYREIS SCSNFEDFQA
RRADIRFRPG PKEKPRLVHT LNGSGVAVGR TVAAILENYQ QEDGTVLIPP ALKPYMGGLT
SIQPEQD