Gene Moth_1677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1677 
Symbol 
ID3831948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1714054 
End bp1715325 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content64% 
IMG OID637829602 
Producthistidyl-tRNA synthetase 
Protein accessionYP_430522 
Protein GI83590513 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0038851 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAACCA GTCGCCCGCG GGGCACGGAA GATATCCTGC CGGAAGAGGT CGGCCGCTGG 
TATCTCCTGG AGAACACGGC CCGGGAGGTC AGCCGCCTCT ACGGCTACCG GGAGATCCGG
ACGCCCATCT TTGAACATAC CGAGCTTTTT AACCGCGGCG TGGGGGATAC CAGCGATATC
GTCGAAAAGG AGATGTACAC CTTCATTGAT CGCGGTGACC GCAGCCTGAC CCTGAGGCCG
GAAGGCACGG CCCCGGTGGT ACGGGCCTTT GTGGAACACA GCCTGGAGGC CCGCGGCCTG
CCGGTAAAGC TGTTCTACCT GGGGCCCATG TTCCGCTACG GCCGGCCCCA GGCCGGACGG
CTGCGCCAGT TTCATCAATT CGGCGTCGAG GCCTTTGGTT CCCGGGACCC GGCCCTGGAC
GCCGAGGTCA TCGCCCTGGC CATGGATTTT TATACCCGCC TGGGGCTAAA GGACCTGGAG
CTGCATCTAA ACAGTGTCGG ATGCCCGGCC TGCCGCCCGG CCCACCGGGA GAAACTCAAG
GCCTACCTGC GTCCCCGCCT GGAAGAGCTC TGTCCTACCT GTCAGGGCCG CTTCGAGCGC
AACCCCCTGC GGATCTTTGA CTGCAAGAGC CCGGCTTGCC AGGAGATTGT CAGGGAGGCG
CCCACGGTGA CCGCTTCCCT GTGCCCCGAT TGCGCCGGGC ACTTTCACCG GGTCCAGGAG
TATTTGAAAG CCCTGGGTAT CGAATTTATT CTGGACGAGC ATTTGGTCCG GGGGCTGGAC
TACTATACCA AGACGGCCTT TGAAATTATG GTGAAGGGTA TCGGCGCCCA GAGCTCCATT
GGCGGCGGCG GCCGCTACGA CGGCCTGGTG GCGGCCCTGG GTGGCAAGCA GGTACCGGGG
ATTGGTTTTG GCTTAGGCCT GGAGCGGGTC CTGCTGGCCC TGGAGATCCA AGGCCAGGAA
CCGCCGCCGG AGGGGGGAGT GGACGTCCTG GTGGTGACTG CCGGCACCGG CGTGGACCTG
GCGGCCTTCC GTTTACTGGC CGGCCTTAGA GCCGCCGGCA TCCGGGCCGA TAAGGATTAC
CTGGAACGCA GCCTCAAGGG CCAGATGAAG TATGCCAATC GCTACCCGGC CAGGATGGCG
GTAATCCTGG GCGAGGAAGA GCTGGCCCGG GGCCGGGTTT CAGTACGCCG CCTGGACGCC
GGCAGCCAGG AAGAGGTCCC CCTGGCAGCG GTAGTAGATT ATTGCAGGAA AATGAAGGAG
AGTGGATGGT AG
 
Protein sequence
MLTSRPRGTE DILPEEVGRW YLLENTAREV SRLYGYREIR TPIFEHTELF NRGVGDTSDI 
VEKEMYTFID RGDRSLTLRP EGTAPVVRAF VEHSLEARGL PVKLFYLGPM FRYGRPQAGR
LRQFHQFGVE AFGSRDPALD AEVIALAMDF YTRLGLKDLE LHLNSVGCPA CRPAHREKLK
AYLRPRLEEL CPTCQGRFER NPLRIFDCKS PACQEIVREA PTVTASLCPD CAGHFHRVQE
YLKALGIEFI LDEHLVRGLD YYTKTAFEIM VKGIGAQSSI GGGGRYDGLV AALGGKQVPG
IGFGLGLERV LLALEIQGQE PPPEGGVDVL VVTAGTGVDL AAFRLLAGLR AAGIRADKDY
LERSLKGQMK YANRYPARMA VILGEEELAR GRVSVRRLDA GSQEEVPLAA VVDYCRKMKE
SGW