Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2230 |
Symbol | thrS |
ID | 8137568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2600064 |
End bp | 2601974 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644869844 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_003022037 |
Protein GI | 253700848 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.00712554 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGAGA TAAACGTCAC GCTACCGGAT GGTTCCCAAA GACCCCTGCC CGCAGGTGCC TCCATTTTCG ATCTCGCCGC ATCCATCGGA GCGGGGCTTG CCAAGGCGGC CATCGCCGGG AAGATCGACG GCAACCTGGT CGACCTCAAC ACACCTCTTG CCGACGGCGC CCGTGTCGAG ATCGTCACCG AGAAAAGCCC CGAGGCGCTT GAGATCATCA GGCACTCCAC CTCGCACCTT ATGGCGCAGG CCGTAAAGGC GCTCTTCCCG CAGGCGAAGG TGACCATCGG TCCCGCCATC GAGACCGGTT TTTACTACGA CTTCGACGTC GACCACCCCT TCACCCCGGA AGACCTCGAG AAGATAGAAG AAAAGATGCG CGAGCTCGCC AAGGCCGATC TCAAGATCGA GCGCAGGGAG CTCACCAGCG CCGACGCCAT AGCCCTCTTC AAGGGGATGG GCGAGGACTA CAAGGTCGAA CTGATCGAGG ACCTGGGGGC GGACAAGGTT TCGCTTTACA GCCAGGGCGA TTTCGTCGAC CTCTGCCGCG GGCCGCATCT CCCCAAGACC TCCTACATCA AGGCCTTCAA GCTCACCTCC ATAGCCGGCG CCTACTGGCG CGGCGACGAG AAGCGCGCCA TGCTGCAGCG CGTCTACGGC ACCGCCTTCG GCGACAAGAA GGAGCTCGAA GCCTATCTGG CCAGGATCGA AGAGGCGAAA AAGCGCGACC ACCGCAAGCT CGGCCGCGAA CTGGACCTCT TCTCCTTCAA CGACGAGGTC GGCGCAGGGC TCGTGATCTG GCACCCCAAA GGGGCTATGC TCCGCACCAT CCTCGAGGAC TTCGAGAGGA AGGAGCACCT AAAGCGCGGC TACGACATCG TTCTTGGTCC GCAGATCCTC AAGACCGAAC TCTGGCAGCG CTCCGGGCAC TACGAGAACT ACCGCGAGAA CATGTATTTC ACCACGGTGG ACGAGCAGAG CTACGGCGTG AAGCCGATGA ACTGCCTGGC CCACATGATG ATCTACCGCT CGCAGCTTCG CTCTTACCGC GACCTGCCGC TGCGCTACTT CGAGCTCGGC ACTGTGCACC GCCACGAGCG CGCCGGCGTT CTGCACGGCC TTCTGCGCGT GCGCGGCTTC ACCCAGGACG ACGCCCATAT CCTCTGCACC CCAGATCAGC TCGACGCCGA GATCAAAGGG GTCATCCAGT TCGTCACCGA GGTGATGGGT ATCTTCGGCT TCGAGTTCGA GATGGAGCTT TCCACCCGTC CCGAGAAGTC GATCGGCTCC GACGACGCCT GGGAGCTTGC CACGAGCGCT CTCCTGAACG CGCTCAAGGA CTCGGGCCGC CCTTACGAAA TCAACGAGGG GGACGGCGCG TTCTACGGTC CGAAGATCGA CATCAAACTG CGTGACGCGC TTGACAGAAG ATGGCAATGT GCTACAATCC AGTGCGATTT TACCCTCCCG GAGCGTTTCG ATCTCACCTA CGTCGACGCA GACGGTGAAA AGAAGCGCCC CGTCATGGTG CACAGGGTCA TCCTGGGCGC CATCGAACGC TTCATCGGTG TCCTCATCGA ACATTTCGCT GGAAACTTCC CGACTTGGCT GGCACCGGTT CAGGCGACCA TCGTTACGGT CACCGACAAC CAGATTCCGT ACGCGCAGGC GGCGTTCGAC AAGCTGCGCG CGGCCGGGAT AAGGGTGCAG AAAGATTTCA GGAACGAGAA GCTTGGCTTC AAGATTCGCG AAGCCCAGCT CCAGAAGATA CCGTACATGC TGGTGGTGGG GGACAAGGAG GTCGAGAGCG GCATGCTGGC GCCGCGATTC CGCGACGGCA AGAACCTCGA GTCCATGACC CCGGAGCAGT TCATTACTTT TATCGAAAAC GAAGTCAAAA GTTATAAATA A
|
Protein sequence | MNEINVTLPD GSQRPLPAGA SIFDLAASIG AGLAKAAIAG KIDGNLVDLN TPLADGARVE IVTEKSPEAL EIIRHSTSHL MAQAVKALFP QAKVTIGPAI ETGFYYDFDV DHPFTPEDLE KIEEKMRELA KADLKIERRE LTSADAIALF KGMGEDYKVE LIEDLGADKV SLYSQGDFVD LCRGPHLPKT SYIKAFKLTS IAGAYWRGDE KRAMLQRVYG TAFGDKKELE AYLARIEEAK KRDHRKLGRE LDLFSFNDEV GAGLVIWHPK GAMLRTILED FERKEHLKRG YDIVLGPQIL KTELWQRSGH YENYRENMYF TTVDEQSYGV KPMNCLAHMM IYRSQLRSYR DLPLRYFELG TVHRHERAGV LHGLLRVRGF TQDDAHILCT PDQLDAEIKG VIQFVTEVMG IFGFEFEMEL STRPEKSIGS DDAWELATSA LLNALKDSGR PYEINEGDGA FYGPKIDIKL RDALDRRWQC ATIQCDFTLP ERFDLTYVDA DGEKKRPVMV HRVILGAIER FIGVLIEHFA GNFPTWLAPV QATIVTVTDN QIPYAQAAFD KLRAAGIRVQ KDFRNEKLGF KIREAQLQKI PYMLVVGDKE VESGMLAPRF RDGKNLESMT PEQFITFIEN EVKSYK
|
| |