Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0572 |
Symbol | |
ID | 7406913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 644828 |
End bp | 646546 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643714955 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_002572471 |
Protein GI | 222528589 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00211854 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTTT CAGAGCTTTT TATGCCAACT TTGAAAGAGA CACCTTCAGA TGCAGAGATA GAATCACACA AGCTTATGCT AAGGTCTGGT TTTATGAGAC AGCTTTCCTC TGGCATTTAT GTTTATCTTC CACTTGGTTA CAGAGTTTTA AGAAAGATAG AAAACATTGT AAGAGAAGAG ATGGATAGAA GCGGTGCACA AGAGGTTCAT ATGTCTGCAC TTATGCCAAA AGAACTTTGG GAAGAGTCGG GTAGGTGGGC TGTATTTGGC CCTGAAATGT TTAGGATAAA AGATAGAAAC GAAAGGGAAT ACTGTTTGGG TCCCACTCAC GAAGAAGCAT TTACTTATAT AGTCAGAAAC GAGATTTCAT CTTACAGGGA CCTTCCTAAA ATTTTGTATC AAATCCAAAC AAAGTTTCGT GATGAGAGAA GACCGCGGTT TGGTGTTATG CGTTGCAGAG AGTTTACAAT GAAGGATGCT TATTCTTTTG ACATTGATGA GAACGGCTTG GACATATCAT ACCAAAAGAT GTATGATGCA TATGTGAGGA TATTCAAAAG ATGCGGGCTT GATGTAAAGA TTGTCGAGGC AGACACAGGT GCAATGGGCG GGGCAAGCTC ACACGAGTTT ATGGTTCCAT CATCTGTTGG TGAGGCAGAG ATTGCATATT GTAAAGCTTG CGGATATGCT GCAAACTTGG AAAAGGCAGA GTGTTTGGAT GAGCCTGTTG AAAACAAAGA GGAGCCAAAA GAAAAACAGG AAGTCTACAC TCCAAATGTG AGAACAATTG AGGAGCTTGT GAGTTTTCTT GGCATCGACA GCACAAGGTT TGTAAAAACA ATGATTTACA AGGCAGATGA CAAGTTTGTT GCAGTGCTGG TAAGGGGAGA TAGAGAGGTA AATGAGACAA AGCTAAAAAA TCTTTTAAAA GCTACAGACC TTGAACTTGC ATCGGCAGAG GATGTAGAGA AAATTACAGG TGCAAAAGTA GGATTTGCCG GTCCAATTGG TCTTTCGATA GATGTGTACG CAGACAATGA AGTAAAATAT CTCAAAAACT TTGTGGTGGG TGCAAATAAG ACAGATTATC ATATAAAGAA TGTTAATCTT TCAGATTTTA AAGTAACAAA GTTTACAGAC CTCAGGAATA TAACCCAAGA CGACCTTTGT CCAAAATGTC GCTCTCAGAA GGTGACAATT GAAAGAGGAA TTGAGGTTGG ACATATATTT AAGCTTGGCA CTAAATATAC ACAGGCATTT AACTGCGTGT ACACAGATGA AAAAGGCGAA AAGAAGCTCA TGATAATGGG ATGTTATGGT ATTGGTATCA ACAGGACAGC TGCAGCTATC ATTGAACAGA TGCACGATGA AGATGGTATA ATTTGGCCAA TTACAGTTGC ACCGTATGAG GTAATTATTG TGCCTGTAAA TGTAAAAGAT GAAAATCAGA AAAAGATTGC ATTTGAGATT TATGAAAACC TTCAGAGAAA TGGAGTTGAG GTTTTGATTG ATGACAGAGA TGAAAGAGCA GGTGTTAAGT TCAAGGATGC AGATTTGATA GGAATTCCGT TTAGAGTTAC TGTTGGAAAG AAAATATCAG AGGGAAAACT TGAGATTAGA AATAGAAGAA CAAAGGAATC TTTTGAAGTT GAAATTGAGA AAGCAATAGA GGTTGTAATT AATCTAATCA GGGAAGAAAA AGCAAAATAC CAAATATAA
|
Protein sequence | MKVSELFMPT LKETPSDAEI ESHKLMLRSG FMRQLSSGIY VYLPLGYRVL RKIENIVREE MDRSGAQEVH MSALMPKELW EESGRWAVFG PEMFRIKDRN EREYCLGPTH EEAFTYIVRN EISSYRDLPK ILYQIQTKFR DERRPRFGVM RCREFTMKDA YSFDIDENGL DISYQKMYDA YVRIFKRCGL DVKIVEADTG AMGGASSHEF MVPSSVGEAE IAYCKACGYA ANLEKAECLD EPVENKEEPK EKQEVYTPNV RTIEELVSFL GIDSTRFVKT MIYKADDKFV AVLVRGDREV NETKLKNLLK ATDLELASAE DVEKITGAKV GFAGPIGLSI DVYADNEVKY LKNFVVGANK TDYHIKNVNL SDFKVTKFTD LRNITQDDLC PKCRSQKVTI ERGIEVGHIF KLGTKYTQAF NCVYTDEKGE KKLMIMGCYG IGINRTAAAI IEQMHDEDGI IWPITVAPYE VIIVPVNVKD ENQKKIAFEI YENLQRNGVE VLIDDRDERA GVKFKDADLI GIPFRVTVGK KISEGKLEIR NRRTKESFEV EIEKAIEVVI NLIREEKAKY QI
|
| |