Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1227 |
Symbol | |
ID | 7083887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1361288 |
End bp | 1363036 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643698243 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_002354882 |
Protein GI | 217969648 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.463936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCCA GCCAGTTCCA TCTCTTCACC CTCAAGGAAG CCCCCTCCGA CGCCGAGGTC GTCAGCCAGA AGCTGATGCT GCGCGCCGGC ATGATCCGCA AGGTCGCCGC CGGCATCTAC AGCTACATGC CGATGGGCCT GCGCGCGATC CGCAAGGTCG AGGCCATCGT GCGCGAGGAG ATGGACCGCG CCGGCGCCAT GGAACTGATC ATGCCGATGG TGCAGCCCGC CGAGCTGTGG GACGAGACCG GGCGCTGGGA CAAGATGGGC GACGAGCTGC TGCGCTTCAA GGACCGCCAC GAGCGCGACT TCGCGCTGCA ACCCACCTCC GAGGAGGTCG TCACCGACAT CGCGCGCCAG GAGCTCAAGA GCTACCGCCA GCTGCCGAAG AACTTCTACC AGATCCAGAC CAAGTTCCGC GACGAACGCC GGCCGCGCTT CGGCGTCATG CGCGGGCGCG AATTCACCAT GAAGGACGCC TACTCCTTCG ACCGCAGCGA GGAAGCCGCC GGGCGCAGCT ACGACATCAT GTTCGCCGCC TACAAGCGCA TCTTCGACCG CCTCGGCCTC GAATACCGCG CGGTGGCGGC CGACACCGGC GCCATCGGCG GCGACCGCTC GCACGAGTTC CAGGTCATCG CCGACACCGG CGAGGACGCC ATCGTCTACT GCCCCGACTC CGACTACGCT GCCAACATCG AGCTCGCCGA GGCGGTGTCC CTGCTCGCCC GCCGCGCAGA GCCCGCCCGG GCGCTGGCAA AAACCCCCAC ACCGGGCAAG GCGACCTGCG AGGACGTCGC CGCGCTGCTG GGCGTGCCGC TGGCGACCAC GGTGAAGTCG CTGGTACTCG CCACCGACGA TGTCGATGAA AAGGGCAAGC CCGCCGGCGT CACCGTGTGG CTGCTGCTGG TGCGCGGCGA CCACGCCCTC AACGAGGTCA AGGCCGGCAA GCTGGAAGGC CTCAAGGCAG GCTTCCGCTT CGCCACCGAG GCCGAGATCC TCGAACACTT CGGCTGCAAG CCGGGCTACC TCGGCCCCAT CGGTCTGAAG AAGCCGGTCA AGGTCATCGC CGACCGCACG GTCGCCCACA TGGCCGACTT CATCTGCGGC GCCAACGAGG CCGACTTCCA CTTCACCGGC GTGAACTGGG GCCGCGACCT GCCCGAGCCC GACCAGGTCG CCGACCTGCG CAACGTCGTC GAGGGCGACC CCAGCCCGGA CGGCCGCGGC GTGCTCGCCA TCCAGCGCGG CATCGAGGTC GGTCACGTGT TCTACCTCGG CACCAAGTAC TCCAAGGCGA TGAACGCCAC CTTCCTCGAC GAGGACGGCA AGCCCAAGCA CTTCGAAATG GGCTGCTACG GCATCGGCGT GACCCGCATC CTGGGCGCCG CGATCGAGCA GAACCACGAC GCCCGCGGCA TCGTCTGGCC GCGCGCGATC GCACCCTTCG AGGTGGTGGT CTGCCCGGTC GGCTGGGGCA AGTCGCAGGC GGTGCGCGAC GAGGCGCAGA AGCTCTACGA CGCGCTCGTC GCCACCGGCG TCGACGCCAT CCTCGACGAC CGCGACGAGC GCCCGGGCGT GATGTTCGCC GACTGGGAAC TGATCGGCGT GCCGCACCGC GTCACCATCG GCGACCGCGG GCTCAAGGAA GGCGTGATCG AATACCAGGG CCGGCGCGAT GCCGAAGCCG CCAAGCTGCC CGCAGGCGAG GTCCTCGGCC ACGTGCTCGA GCGCCTGAAC CAGGGCTGA
|
Protein sequence | MRASQFHLFT LKEAPSDAEV VSQKLMLRAG MIRKVAAGIY SYMPMGLRAI RKVEAIVREE MDRAGAMELI MPMVQPAELW DETGRWDKMG DELLRFKDRH ERDFALQPTS EEVVTDIARQ ELKSYRQLPK NFYQIQTKFR DERRPRFGVM RGREFTMKDA YSFDRSEEAA GRSYDIMFAA YKRIFDRLGL EYRAVAADTG AIGGDRSHEF QVIADTGEDA IVYCPDSDYA ANIELAEAVS LLARRAEPAR ALAKTPTPGK ATCEDVAALL GVPLATTVKS LVLATDDVDE KGKPAGVTVW LLLVRGDHAL NEVKAGKLEG LKAGFRFATE AEILEHFGCK PGYLGPIGLK KPVKVIADRT VAHMADFICG ANEADFHFTG VNWGRDLPEP DQVADLRNVV EGDPSPDGRG VLAIQRGIEV GHVFYLGTKY SKAMNATFLD EDGKPKHFEM GCYGIGVTRI LGAAIEQNHD ARGIVWPRAI APFEVVVCPV GWGKSQAVRD EAQKLYDALV ATGVDAILDD RDERPGVMFA DWELIGVPHR VTIGDRGLKE GVIEYQGRRD AEAAKLPAGE VLGHVLERLN QG
|
| |