Gene Hlac_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0447 
SymbolmetG 
ID7401065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp463085 
End bp465229 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content68% 
IMG OID643707511 
Productmethionyl-tRNA synthetase 
Protein accessionYP_002565119 
Protein GI222478882 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0143] Methionyl-tRNA synthetase 
TIGRFAM ID[TIGR00398] methionyl-tRNA synthetase
[TIGR00399] methionyl-tRNA synthetase C-terminal region/beta chain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.381782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACG ACGACTTCCC CACCGACGAT CCGGCGGTGG TGACCTGTGG ACTGCCGTAC 
GCCAACGGCG ACCTCCACAT CGGCCACCTC CGCACCTACG TCGGCGGCGA CGTGTACCGG
CGCGCGCTCG AACGACTCGG GCAGGAGACC GCGTTCGTCT CCGGCTCCGA CATGCACGGC
ACCCCCGTCG CGGTCAACGC CGAGCAGGAG GGGGTCTCCC CCGAGCAGTT CGCGCTCGAC
TGGCACGAGC AGTACGCCGC GACGTTCCCG AAGTTCAACG TCGAGTTCGA CAACTACGGC
CACACCCACG ACGAGACGAA CACGGCGGTG ACCCAACAGC TCGTCCGTGA CCTCGACGAG
GGCGGCCACC TCTACGAGAA GGAGATCATG GTCGCGTACG ACCCCGTCGA CGACCAGTTC
CTCCCGGACC GGTACGTCGA GGGGACCTGC CCCTACTGCG GTGCACACGC CCGCGGCGAC
GAGTGCGACG AGGGGTGCCA GCGCCACCTC GAACCCGGCG AGGTCGAGGA CCCCGAATCG
ACGATCACCG GCAACCCCGC CGAGTACCGC GAGCGCACCC ACCAGTTCTT CGAAGTCTCC
GCGTTCTCCG AGTACCTCTC CGGCTTCCTC GATCGGCTGG AGGGGACCTC GAACGCCCGG
AACCAGCCCC GCGAGTGGAT CGAGCAGGGG CTCCAAGACT GGTGTATCAC CCGCGACATG
GACTGGGGGA TCGACTACCC CGGCGAAAAC CCGCAGGACC TCGTCTTATA CGTCTGGGTC
GACGCCCCGA TCGAGTATAT CTCCTCGACG AAGCAGTACA CCGAGCGCGT CGGCGCCGAC
GCCTTCGACT GGGAGGCCGC GTGGAAGGAG GGCGCGAGCG ACGCGCACCC CGAGGGCGGC
GAGATCGTCC ACGTGATCGG CCGCGACATC ATTCAACATC ACACGATCTT CTGGCCCGCG
ATGCTGGAGG CGACCGACCA CACCGAGCCG CGCGCCGTGA TGGCGAGCGG CTTCGTCACC
CTCGGCGGCA AGGGCTTCTC CACGAGCCGC GACCGCGCGG TCTGGGCCGA CGAGTACCTC
GACGAGGGGT TCCATCCCGA CCCGCTGCGC TACTACCTCG CGACCAACGG CGGGTTCCAG
CAGGACGTGG ACTTCTCGTG GGAGAAGTTC CGCGACCGCG TCAACACCGA GCTGGTGGGG
ACCGTCGGCA ACTTCCTCTA CCGCTCGCTG CTCTTCGCCC ACCGCAACTA CGACGACGCG
CCCATCGCGG ACGCGACGAG CGACGAGGTC GCCGAGCGGA TCGAGGAGGC GATCGCCGAC
TTCGAGGCCG CCGTCAACGA CTACTCCGTG CGCGCGGTCG GCGACGCCGT CACCGACCTC
GCCCGGTTCG GCAACGAGTA CATCCAGCGC AGCGAGCCGT GGAAGCTCGT GGACGACGAC
CCCGAGGAGG CCGCGCAGGT CATCCACGAC TGCGTCGCGA TCGCGAAGGC GATCGCGGTC
CTGTTCGAGC CCATCGCACC CGAGAAGACC GAGCGTCTCT GGGACCAGCT CGGCGAGGAC
GGCTCGGTCC ACGAGACCAC CGTCGAGGCG GCCCGCGAGG GCCCCGCCGG CGACCTCGCG
GAGCCGACGG AGCTGTTCGA GCAGATCGAA GACGAGCGCG TCGAGGCGCT CAACGAGAAG
CTGGAGGCGC GCGCCGCCGA GGCGGAGGAC GGCGACGAAG CGGACGAGGA GAGCGGCGAC
GACGGCGAGG CGGACGACGG CAGCGACGAA GCGGACGACG ACACCACTGA CGAACCCGAC
ATGACCGACA TCGAACCCCT CAGCGACGAC CGCATCAGCT TCGACGACTT CCAGGAACTG
GACATCCGGA TCGGCCGGAT CGAGGAGGCG GAGGGTATCG AGGGCGCCGA CGACCTCCTG
AAGCTCCGCG TCGACCTCGG CGCGGAGACC CGGACGATCG TCGCGGGGCT CAAACAACTC
CACGACGTGG ACGACCTGCC CGGAACGAAG GTCGTCGTGC TCGCGAACAT GGAGAAGGCG
GAGCTGTTCG GCGTCGAGTC GAACGGGATG GTGCTCGCCG CCGGCGAGGA GGCCGACCTC
CTCACCACCT ACGAGGACGC CGGGCCGGGG ACGAAGGTGA AGTAA
 
Protein sequence
MSHDDFPTDD PAVVTCGLPY ANGDLHIGHL RTYVGGDVYR RALERLGQET AFVSGSDMHG 
TPVAVNAEQE GVSPEQFALD WHEQYAATFP KFNVEFDNYG HTHDETNTAV TQQLVRDLDE
GGHLYEKEIM VAYDPVDDQF LPDRYVEGTC PYCGAHARGD ECDEGCQRHL EPGEVEDPES
TITGNPAEYR ERTHQFFEVS AFSEYLSGFL DRLEGTSNAR NQPREWIEQG LQDWCITRDM
DWGIDYPGEN PQDLVLYVWV DAPIEYISST KQYTERVGAD AFDWEAAWKE GASDAHPEGG
EIVHVIGRDI IQHHTIFWPA MLEATDHTEP RAVMASGFVT LGGKGFSTSR DRAVWADEYL
DEGFHPDPLR YYLATNGGFQ QDVDFSWEKF RDRVNTELVG TVGNFLYRSL LFAHRNYDDA
PIADATSDEV AERIEEAIAD FEAAVNDYSV RAVGDAVTDL ARFGNEYIQR SEPWKLVDDD
PEEAAQVIHD CVAIAKAIAV LFEPIAPEKT ERLWDQLGED GSVHETTVEA AREGPAGDLA
EPTELFEQIE DERVEALNEK LEARAAEAED GDEADEESGD DGEADDGSDE ADDDTTDEPD
MTDIEPLSDD RISFDDFQEL DIRIGRIEEA EGIEGADDLL KLRVDLGAET RTIVAGLKQL
HDVDDLPGTK VVVLANMEKA ELFGVESNGM VLAAGEEADL LTTYEDAGPG TKVK