Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1044 |
Symbol | |
ID | 3831850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1071961 |
End bp | 1073682 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637828972 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_429901 |
Protein GI | 83589892 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000019894 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCGAGCGA GTGAATTGCT GGCTCCGACC CTGAGGGAAA CCCCAGCCGA GGCCGAAATT GTTAGCCACC AGCTACTCCT CCGGGGCGGC TTTATCCGTA AGGCCGCCGC CGGCATCTAC ACCTACCTGC CCCTGGGTCG GCGGGTACTG GCCAAGATCG AGCAGATTAT CCGGGAAGAA ATGGACCGGG CCGGCGGGCA GGAAGTGGTC TTGCCCATTA TCCAGCCGGC CGAACTCTGG CAGGAAAGCG GCCGTTGGGA GGTCTACGGC GAAGAAATGT TTCGCCTCCA GGACCGGCAC CGGCGCCAGT TTTGCCTGGG TCCCACCCAC GAAGAGATTA TCACCGCCCT GGTACGGAGC GAAGTCACCT CCTACAAACA ACTCCCCCTG CTCCTGTACC AGATTCAAAA TAAATACCGG GATGAGCGCC GGCCCCGCTT TGGTCTCCTG CGGGGTCGGG AGTTTATCAT GAAGGACCTC TATTCCTTTG ACCTGGACCA GGAAGGGCTT AACCAGAGCT ACCGGAAGAT GTACCAGGCC TACAGTAATG TTTTCCGTCG CTGTGGTCTG GATTTCCGCC CGGTCCAGGC TGATAGCGGT GCTATCGGCG GCAATTACAG CCACGAATTT ATGGCCCTGG CCACCGCCGG TGAGGCCCTG CTGGTTTATT GCCGGGAGTG CGATTATGCG GCCAATGTGG AAATCGCCGT GGCAAAAGCC CTGCCCATGA TAGCGACGGA AAATCCCGCT CCTTTAAAGG AAGTGGCTAC ACCGGGGCAA AAGACGGTGG CGGAAATCTG CACCTTCCTG GAGGTCACCC CGGACAGGCT CATCAAAACC CTCTTTTACG AGGCCGACGG CCAGCTTATT GCTGCCCTGG TTCGCGGCGA CAGGGAGCTC AACGAAGTCA AGCTCCAGAA TCATCTCGGC TGCCGGCACC TGCTCCTGGC AGACCCTGAA AGGGTGCGGA AGGCCACCGG GGCGCCGGTC GGCTTTGTCG GCCCGGTGGG CTTGCAGGGT ATACCCCTTT ACGCCGACCT GGAAATACCC TACCTGGTCA ACGGGGTGGC TGGTGCCAAC CGGGAGGGCT ACCACCTGGT AAACGTCAAC CCGGGCCGGG ACTTTAACCC CACAGCTGTG GTCGACATCC GCCAGGTGGA GGCCGGGGAA CCCTGTCCCC AGTGCGGTGC CCCCCTGGCC CAGGCCCGGG GGATCGAGGT TGGCCAGGTC TTCCAGTTAG GAACCAAATA TAGCGGCGCC CTGGGAGCCA ATTATACCGA CGCCCGGGGC CAGGAGCATC CCATCGTGAT GGGCTGCTAT GGTATTGGCG TTAGCCGGAC CATGGCGGCG ATTGTCGAGC AATGCCACGA CGACCAGGGG ATTATCTGGC CTTTGAGCGT TGCTCCCTAC CAGGTGGTTA TTATCCCGGC CTCCCTGAAG GATGACGGCC AGCGGCAAGT GGCCGAAGGG CTGTACCGGG AACTGGCCGC CGCCGGGGTG GAAGTCGTCT ATGACGACCG GGATGAACGG GCCGGTCTCA AGTTTGTCGA GGCGGACCTC ATCGGTTATC CCCTGCGGAT AACCGTCGGC AAGAGGACCA TCACCAGCGG CACGGTGGAC GTTAAATGGC GGTCCCGGAA GGAGGAAACA CCGCTGCCCC TGGAGGGGCT GTCGGCGCAG ATCCAGGCCT TGCTGGCCCG GGAGATGGAA AAGTACCGGT AA
|
Protein sequence | MRASELLAPT LRETPAEAEI VSHQLLLRGG FIRKAAAGIY TYLPLGRRVL AKIEQIIREE MDRAGGQEVV LPIIQPAELW QESGRWEVYG EEMFRLQDRH RRQFCLGPTH EEIITALVRS EVTSYKQLPL LLYQIQNKYR DERRPRFGLL RGREFIMKDL YSFDLDQEGL NQSYRKMYQA YSNVFRRCGL DFRPVQADSG AIGGNYSHEF MALATAGEAL LVYCRECDYA ANVEIAVAKA LPMIATENPA PLKEVATPGQ KTVAEICTFL EVTPDRLIKT LFYEADGQLI AALVRGDREL NEVKLQNHLG CRHLLLADPE RVRKATGAPV GFVGPVGLQG IPLYADLEIP YLVNGVAGAN REGYHLVNVN PGRDFNPTAV VDIRQVEAGE PCPQCGAPLA QARGIEVGQV FQLGTKYSGA LGANYTDARG QEHPIVMGCY GIGVSRTMAA IVEQCHDDQG IIWPLSVAPY QVVIIPASLK DDGQRQVAEG LYRELAAAGV EVVYDDRDER AGLKFVEADL IGYPLRITVG KRTITSGTVD VKWRSRKEET PLPLEGLSAQ IQALLAREME KYR
|
| |