Gene Moth_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1044 
Symbol 
ID3831850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1071961 
End bp1073682 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content61% 
IMG OID637828972 
Productprolyl-tRNA synthetase 
Protein accessionYP_429901 
Protein GI83589892 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000019894 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCGAGCGA GTGAATTGCT GGCTCCGACC CTGAGGGAAA CCCCAGCCGA GGCCGAAATT 
GTTAGCCACC AGCTACTCCT CCGGGGCGGC TTTATCCGTA AGGCCGCCGC CGGCATCTAC
ACCTACCTGC CCCTGGGTCG GCGGGTACTG GCCAAGATCG AGCAGATTAT CCGGGAAGAA
ATGGACCGGG CCGGCGGGCA GGAAGTGGTC TTGCCCATTA TCCAGCCGGC CGAACTCTGG
CAGGAAAGCG GCCGTTGGGA GGTCTACGGC GAAGAAATGT TTCGCCTCCA GGACCGGCAC
CGGCGCCAGT TTTGCCTGGG TCCCACCCAC GAAGAGATTA TCACCGCCCT GGTACGGAGC
GAAGTCACCT CCTACAAACA ACTCCCCCTG CTCCTGTACC AGATTCAAAA TAAATACCGG
GATGAGCGCC GGCCCCGCTT TGGTCTCCTG CGGGGTCGGG AGTTTATCAT GAAGGACCTC
TATTCCTTTG ACCTGGACCA GGAAGGGCTT AACCAGAGCT ACCGGAAGAT GTACCAGGCC
TACAGTAATG TTTTCCGTCG CTGTGGTCTG GATTTCCGCC CGGTCCAGGC TGATAGCGGT
GCTATCGGCG GCAATTACAG CCACGAATTT ATGGCCCTGG CCACCGCCGG TGAGGCCCTG
CTGGTTTATT GCCGGGAGTG CGATTATGCG GCCAATGTGG AAATCGCCGT GGCAAAAGCC
CTGCCCATGA TAGCGACGGA AAATCCCGCT CCTTTAAAGG AAGTGGCTAC ACCGGGGCAA
AAGACGGTGG CGGAAATCTG CACCTTCCTG GAGGTCACCC CGGACAGGCT CATCAAAACC
CTCTTTTACG AGGCCGACGG CCAGCTTATT GCTGCCCTGG TTCGCGGCGA CAGGGAGCTC
AACGAAGTCA AGCTCCAGAA TCATCTCGGC TGCCGGCACC TGCTCCTGGC AGACCCTGAA
AGGGTGCGGA AGGCCACCGG GGCGCCGGTC GGCTTTGTCG GCCCGGTGGG CTTGCAGGGT
ATACCCCTTT ACGCCGACCT GGAAATACCC TACCTGGTCA ACGGGGTGGC TGGTGCCAAC
CGGGAGGGCT ACCACCTGGT AAACGTCAAC CCGGGCCGGG ACTTTAACCC CACAGCTGTG
GTCGACATCC GCCAGGTGGA GGCCGGGGAA CCCTGTCCCC AGTGCGGTGC CCCCCTGGCC
CAGGCCCGGG GGATCGAGGT TGGCCAGGTC TTCCAGTTAG GAACCAAATA TAGCGGCGCC
CTGGGAGCCA ATTATACCGA CGCCCGGGGC CAGGAGCATC CCATCGTGAT GGGCTGCTAT
GGTATTGGCG TTAGCCGGAC CATGGCGGCG ATTGTCGAGC AATGCCACGA CGACCAGGGG
ATTATCTGGC CTTTGAGCGT TGCTCCCTAC CAGGTGGTTA TTATCCCGGC CTCCCTGAAG
GATGACGGCC AGCGGCAAGT GGCCGAAGGG CTGTACCGGG AACTGGCCGC CGCCGGGGTG
GAAGTCGTCT ATGACGACCG GGATGAACGG GCCGGTCTCA AGTTTGTCGA GGCGGACCTC
ATCGGTTATC CCCTGCGGAT AACCGTCGGC AAGAGGACCA TCACCAGCGG CACGGTGGAC
GTTAAATGGC GGTCCCGGAA GGAGGAAACA CCGCTGCCCC TGGAGGGGCT GTCGGCGCAG
ATCCAGGCCT TGCTGGCCCG GGAGATGGAA AAGTACCGGT AA
 
Protein sequence
MRASELLAPT LRETPAEAEI VSHQLLLRGG FIRKAAAGIY TYLPLGRRVL AKIEQIIREE 
MDRAGGQEVV LPIIQPAELW QESGRWEVYG EEMFRLQDRH RRQFCLGPTH EEIITALVRS
EVTSYKQLPL LLYQIQNKYR DERRPRFGLL RGREFIMKDL YSFDLDQEGL NQSYRKMYQA
YSNVFRRCGL DFRPVQADSG AIGGNYSHEF MALATAGEAL LVYCRECDYA ANVEIAVAKA
LPMIATENPA PLKEVATPGQ KTVAEICTFL EVTPDRLIKT LFYEADGQLI AALVRGDREL
NEVKLQNHLG CRHLLLADPE RVRKATGAPV GFVGPVGLQG IPLYADLEIP YLVNGVAGAN
REGYHLVNVN PGRDFNPTAV VDIRQVEAGE PCPQCGAPLA QARGIEVGQV FQLGTKYSGA
LGANYTDARG QEHPIVMGCY GIGVSRTMAA IVEQCHDDQG IIWPLSVAPY QVVIIPASLK
DDGQRQVAEG LYRELAAAGV EVVYDDRDER AGLKFVEADL IGYPLRITVG KRTITSGTVD
VKWRSRKEET PLPLEGLSAQ IQALLAREME KYR