Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1677 |
Symbol | |
ID | 3831948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1714054 |
End bp | 1715325 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637829602 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_430522 |
Protein GI | 83590513 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0038851 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTAACCA GTCGCCCGCG GGGCACGGAA GATATCCTGC CGGAAGAGGT CGGCCGCTGG TATCTCCTGG AGAACACGGC CCGGGAGGTC AGCCGCCTCT ACGGCTACCG GGAGATCCGG ACGCCCATCT TTGAACATAC CGAGCTTTTT AACCGCGGCG TGGGGGATAC CAGCGATATC GTCGAAAAGG AGATGTACAC CTTCATTGAT CGCGGTGACC GCAGCCTGAC CCTGAGGCCG GAAGGCACGG CCCCGGTGGT ACGGGCCTTT GTGGAACACA GCCTGGAGGC CCGCGGCCTG CCGGTAAAGC TGTTCTACCT GGGGCCCATG TTCCGCTACG GCCGGCCCCA GGCCGGACGG CTGCGCCAGT TTCATCAATT CGGCGTCGAG GCCTTTGGTT CCCGGGACCC GGCCCTGGAC GCCGAGGTCA TCGCCCTGGC CATGGATTTT TATACCCGCC TGGGGCTAAA GGACCTGGAG CTGCATCTAA ACAGTGTCGG ATGCCCGGCC TGCCGCCCGG CCCACCGGGA GAAACTCAAG GCCTACCTGC GTCCCCGCCT GGAAGAGCTC TGTCCTACCT GTCAGGGCCG CTTCGAGCGC AACCCCCTGC GGATCTTTGA CTGCAAGAGC CCGGCTTGCC AGGAGATTGT CAGGGAGGCG CCCACGGTGA CCGCTTCCCT GTGCCCCGAT TGCGCCGGGC ACTTTCACCG GGTCCAGGAG TATTTGAAAG CCCTGGGTAT CGAATTTATT CTGGACGAGC ATTTGGTCCG GGGGCTGGAC TACTATACCA AGACGGCCTT TGAAATTATG GTGAAGGGTA TCGGCGCCCA GAGCTCCATT GGCGGCGGCG GCCGCTACGA CGGCCTGGTG GCGGCCCTGG GTGGCAAGCA GGTACCGGGG ATTGGTTTTG GCTTAGGCCT GGAGCGGGTC CTGCTGGCCC TGGAGATCCA AGGCCAGGAA CCGCCGCCGG AGGGGGGAGT GGACGTCCTG GTGGTGACTG CCGGCACCGG CGTGGACCTG GCGGCCTTCC GTTTACTGGC CGGCCTTAGA GCCGCCGGCA TCCGGGCCGA TAAGGATTAC CTGGAACGCA GCCTCAAGGG CCAGATGAAG TATGCCAATC GCTACCCGGC CAGGATGGCG GTAATCCTGG GCGAGGAAGA GCTGGCCCGG GGCCGGGTTT CAGTACGCCG CCTGGACGCC GGCAGCCAGG AAGAGGTCCC CCTGGCAGCG GTAGTAGATT ATTGCAGGAA AATGAAGGAG AGTGGATGGT AG
|
Protein sequence | MLTSRPRGTE DILPEEVGRW YLLENTAREV SRLYGYREIR TPIFEHTELF NRGVGDTSDI VEKEMYTFID RGDRSLTLRP EGTAPVVRAF VEHSLEARGL PVKLFYLGPM FRYGRPQAGR LRQFHQFGVE AFGSRDPALD AEVIALAMDF YTRLGLKDLE LHLNSVGCPA CRPAHREKLK AYLRPRLEEL CPTCQGRFER NPLRIFDCKS PACQEIVREA PTVTASLCPD CAGHFHRVQE YLKALGIEFI LDEHLVRGLD YYTKTAFEIM VKGIGAQSSI GGGGRYDGLV AALGGKQVPG IGFGLGLERV LLALEIQGQE PPPEGGVDVL VVTAGTGVDL AAFRLLAGLR AAGIRADKDY LERSLKGQMK YANRYPARMA VILGEEELAR GRVSVRRLDA GSQEEVPLAA VVDYCRKMKE SGW
|
| |