Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1919 |
Symbol | |
ID | 3830843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1990752 |
End bp | 1991765 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637829852 |
Product | tryptophanyl-tRNA synthetase |
Protein accession | YP_430762 |
Protein GI | 83590753 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0180] Tryptophanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00233] tryptophanyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00690514 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAA TCCAAACCAA GAAAAAAGGA CGAATACTGA CCGGGGACCG GCCGACAGGG AAGTTGCACC TCGGACATTA TGTAGGCAGC CTCATCAACC GGGTGCGGTT ACAGGATGAG TATGATACCT TTCTGATTAT CGCCGATGTG CAAGCATTGA CTACCAATTT CGAGGAACCC GAAAAGCTGG CCCGTGATGT CCGGGAAGTC GCCCTGGACT ACCTGGCGGC AGGGATCGAC CCGGAGAAAA GCACCATTTT CGTCCAGTCT CTGGTACCGG AAATTGCCGA ACTAACTATC TTTTACTCCA TGATTATCAC CGTCAATACC CTGCGCCATA ACCCGACCAT CAAGTCAGAA GCCGCCCAAA GGGGCTATAC CGACATGACC TACGGCTTCC TGGGTTATCC GGTAAGCCAG GCGGCGGATA TTACTTTCTG CAAGGCCAAC CTGGTGCCCG TAGGTGAGGA CCAGTTGCCC CACATTGAGT TGACCCGGAA GCTTGTCCGT CGCTTTAACA GCCTCTACGG CCCGGTCCTG GTAGAGCCCG AGGCCCTGGT AGGGGAAGTG CCGCGTCTGG TTGGCCTGGA TGGAGCGGCC AAGATGAGCA AATCCCTGGA TAATGCCATC AACCTATCCG ACCCGCCGGA AGAGGTCGAA CGCCGGGTCA AAAATGCAGT AACTGACCCG GCCCGCATCC GGGCTACCGA TCCCGGCCAC CCCGATATCT GTACCGTTTT TGCTTATCAT ACCGCTTTCA ATAAGCCGGT GATTCCGGAG ATCGAAGAAT CTTGTAAAAA AGGCGCCATC GGCTGCGTGG CCTGTAAAAA GCGGTTAACA GCCACCCTCA ACGAACTGCT GGAGCCCATG CGGGAACGAA GGGCCAGGTA CGAAGCCAAC CCTAAATTGG TTGATGAAAT CCTCCTGGCC GGGACGGCGC GCGCCCGGGC GGTGGCGAAA GAAACTATGG CCCAGGTCCG GGAAGCCATG AAGATTAATT ATTTTCCCGG TTAG
|
Protein sequence | MAEIQTKKKG RILTGDRPTG KLHLGHYVGS LINRVRLQDE YDTFLIIADV QALTTNFEEP EKLARDVREV ALDYLAAGID PEKSTIFVQS LVPEIAELTI FYSMIITVNT LRHNPTIKSE AAQRGYTDMT YGFLGYPVSQ AADITFCKAN LVPVGEDQLP HIELTRKLVR RFNSLYGPVL VEPEALVGEV PRLVGLDGAA KMSKSLDNAI NLSDPPEEVE RRVKNAVTDP ARIRATDPGH PDICTVFAYH TAFNKPVIPE IEESCKKGAI GCVACKKRLT ATLNELLEPM RERRARYEAN PKLVDEILLA GTARARAVAK ETMAQVREAM KINYFPG
|
| |