Gene Mlg_0539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0539 
Symbol 
ID4268068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp585706 
End bp587412 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content70% 
IMG OID638125280 
Productprolyl-tRNA synthetase 
Protein accessionYP_741383 
Protein GI114319700 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00847816 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGAGCCA GCCTGTTTCC GCTCTCCACC AGCAAGGAGA CCCCCGCCGA CGCCGAGATC 
GTCAGCCACC AGCTCATGCT CCGGGCCGGG ATGATCCGCA AACTGGCGGC CGGCCTGTAC
ACCTGGACAC CGCTGGGGCT GCGGGTGCTG CGCAAGGTGG AGCAGATCGT GCGCGAGGAG
ATGGACCGCG CCGGCGCCCA CGAGCTGCTG ATGCCGGCGG TGCAGCCGGC GGAGCTCTGG
CAGGAGTCCA CCCGCTGGGA CAAGTACGGC CCCGAGCTGC TGCGGCTGAA GGACCGCCAC
GAGCGCGACT TCTGCTTCGG CCCCACCCAC GAGGAGGTGA TCACCGACTA CGTGCGCCGG
GAGGTGAAGA GCTACCGCCA GCTGCCGCTC AACCTCTACC AGATCCAGAC CAAGTTCCGC
GACGAGATCC GCCCCCGCTT CGGGGTCATG CGCGCCCGCG AGTTCCTGAT GAAGGACGCC
TACTCCTTCC ACCTGGACGA CGACTGCCTG GCGCGCACCT ACCAGGTGAT GTACGAAACC
TACACCCGGA TCTTCGAGCG CACCGGCCTG GTCTTCCGCG CGGTGGCGGC CGACTCGGGC
AACATCGGCG GCAGCGTCTC CCACGAGTTC CACGTGCTGG CCGAGTCCGG TGAGGACGCG
GTGGCCTTCT CCGATGAGAG CGACTACGCC GCCAACGTGG AGCTGGCCGA GGCGGTGGCC
CCGGCCGGCG AGGCCCCGCC ACCGGCCGAG ACCATGCGCC GGGTGGACAC CCCCGGGGCA
CGCACCATCG ACGACCTGGT CCGAGACTAC GGCCTGCCCA TCGAGAAGAC CGTCAAGACC
CTGGTGGTGC ACGGCGCCGA CGGTGGCCTG GTGGCCCTGC TGGTACGCGG CGACCACAGC
CTGAACGACG TCAAGGCCAC GACCCTGCCC CAGGTGGCCG AGCCGCTGGT GATGGCCGGC
GAGGAGGAGA TCCGCGCCGC GGTGGGCGCC GGCCCCGGCT CGCTGGGCCC GGTGGAACTG
CCCCTGCCCT GCGTGGTCGA CCGCAGCGTG GCAGTGATGA GCGACTTCGC CGCCGGCGCC
AACCAGGACG ACGCGCACTA TTTCGGTATC AACTGGGGCC GCGATGTGGC CCTGCCCGAG
GTGGCCGACC TGCGCGAGGT GGTGGCCGGC GACCCGAGCC CCGACGGCCG GGGCACCCTG
GAGATCGCCC GCGGCATCGA GGTGGGCCAT ATCTTCCAAT TGGGCCGGGA GTACAGCGAG
AAGATGAAGG CCACGGTGCT GAACGAGGCC GGCGACGCCC AGACCGTGAC CATGGGCTGC
TACGGCATCG GCGTCTCCCG CGTGGTGGCC GCCGCCATCG AGCAGAACCA CGACGACAAC
GGCATCATCT GGCCCGCGCC TATCGCCCCC TTCCAACTGG CCCTGGTGCC CATCGGCATG
AACCGTTCCG AGGCGGTGAC CGAGCAGGCC GAAAAGCTCT ACGCCGAGCT GCAGGCGGAG
GGCGTGGAGG TCTTTTTCGA TGACCGCGAC GCCCGCCCGG GGGTGAAGTT CGCCGACATG
GAACTGATCG GCATCCCCCA CCGGCTGGTG ATCGGTGACC GGGGGCTGAA AAACGGGGTG
GTGGAGTACC GGGGCCGGCG GGACAGCGAG AGCACCGATG TGCCGCTGGC GGAGCTGAGC
GCCTTCCTGC GGGAACGCCT GGGCTGA
 
Protein sequence
MRASLFPLST SKETPADAEI VSHQLMLRAG MIRKLAAGLY TWTPLGLRVL RKVEQIVREE 
MDRAGAHELL MPAVQPAELW QESTRWDKYG PELLRLKDRH ERDFCFGPTH EEVITDYVRR
EVKSYRQLPL NLYQIQTKFR DEIRPRFGVM RAREFLMKDA YSFHLDDDCL ARTYQVMYET
YTRIFERTGL VFRAVAADSG NIGGSVSHEF HVLAESGEDA VAFSDESDYA ANVELAEAVA
PAGEAPPPAE TMRRVDTPGA RTIDDLVRDY GLPIEKTVKT LVVHGADGGL VALLVRGDHS
LNDVKATTLP QVAEPLVMAG EEEIRAAVGA GPGSLGPVEL PLPCVVDRSV AVMSDFAAGA
NQDDAHYFGI NWGRDVALPE VADLREVVAG DPSPDGRGTL EIARGIEVGH IFQLGREYSE
KMKATVLNEA GDAQTVTMGC YGIGVSRVVA AAIEQNHDDN GIIWPAPIAP FQLALVPIGM
NRSEAVTEQA EKLYAELQAE GVEVFFDDRD ARPGVKFADM ELIGIPHRLV IGDRGLKNGV
VEYRGRRDSE STDVPLAELS AFLRERLG