Gene Mlg_0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0555 
SymbolvalS 
ID4270310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp601592 
End bp604345 
Gene Length2754 bp 
Protein Length917 aa 
Translation table11 
GC content67% 
IMG OID638125296 
Productvalyl-tRNA synthetase 
Protein accessionYP_741399 
Protein GI114319716 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00036551 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAAGA CCTACCAGCC GCAGGCCATC GAGGAAAAGT GGTACGCCTA CTGGGAAGAG 
CACGGGCATT TCGCCCCCTC CGGCGAGGGT GAGCCCTACT GCATCATGAT CCCGCCACCC
AATGTGACGG GCACGCTGCA CATGGGCCAC GCCTTCCAGG ACACCATCAT GGATGCGCTG
ATCCGCTACC AGCGCATGTG CGGCCGCAAC ACCCTCTGGC AGCCCGGCAC CGACCACGCC
GGCATCGCCA CCCAGATGGT GGTGGAGCGC CAGCTGGAGG CCGAGGGCAA GTCCCGCTTC
GATCTGGGCC GCGAGAAGTT CCTCGAGCGG GTCTGGCAAT GGAAGGCGGA ATCCGGCGGC
ACCATCACCC GCCAGCTGCG GCGCATGGGG GCGTCGGTGG ACTGGTCCCG CGAGCGCTTC
ACCATGGACG AGGGGCTGTC CCGTGCGGTG CGCGAGGTCT TCGTGCGGCT CTACGAGGAC
GGCCTGATCT ACCGGGGCAA GCGGCTGGTC AACTGGGATC CGGTGCTGCA CACCGCCGTC
TCCGACCTGG AGGTCACCAG CCAGGAGGAA CAGGGCCACA TCTGGCACAT GCGCTACCCG
ATCTCCGACG GCTCGGGCCA CGTGGTGGTG GCCACCACCC GGCCGGAGAC CATGCTCGGC
GACACCGCCG TGGCGGTGCA CCCGGAGGAC GAGCGGTTCA GGCACCTGGT GGGCAAGACG
GTGGACCTGC CGCTCACCGG CCGTCAGATC CCGGTGATCG CCGACGACTA CGTGGACCCG
GAATTCGGCT CCGGCTGCGT CAAGATCACC CCGGCCCACG ACTTCAACGA CTACGCCGTG
GGCGAGCGCC ACGACCTGGC GAAGATCAAC ATCCTCACCC CGGATGCCGC CATCAACGAG
CACGCGCCCA AGGCCTACCA GGGGCTGGAC CGGTTCGAGG CGCGCAAGCG CATCGTGGCC
GACTTGGATG AACTGGGCCT GCTGGAGAAG GTGGAGGACC ACACACTGAT GGTGCCGCGC
GGCGACCGCA GCGGCGCCGT GCTGGAGCCC TTCCTCACCG ACCAGTGGTA CGTCCGCGCC
GAACCGCTGG CCCGGCCGGC CATCGAGGCG GTGGAAGCGG GCCGCATCCG GTTTATCCCC
GGGAACTGGG ACCGCACCTA CTACGAGTGG ATGAATAACA TCCAGGACTG GTGCATCAGC
CGCCAGATCT GGTGGGGGCA CCGCATCCCC GCCTGGTACG ACGCCGAAGG CCATGTCTAC
GTGGGCCGCA CCGAGGAGGA GGTGCGCGCA CGGCACAACC TGGGCAACGT GCCGCTGCAC
CAGGACGAGG ACGTGCTGGA CACCTGGTTC AGCTCCGCGC TCTGGCCCTT CTCCACCCTG
GGCTGGCCCG ACGAAACCGA GGCGCTACGC ACCTTCTACC CCACCTCGGT GCTGGTCACC
GGCTTCGACA TCATCTTCTT CTGGGTCGCC CGGATGATCA TGATGGGCCT GCACTTCATG
GACGACGTGC CCTTCCGGGA GATCTACATC CACGGCCTGG TGCGCGACCC GGACGGGCAG
AAGATGTCCA AGTCCAAGGG CAACGTCCTC GACCCGCTGG ACATCATCGA GGGCATCGAG
CTGGAAGAGC TGGTGGCCAA GCGCACCAGC GGGCTGATGC AGCCGCAGAT GGCGAAGCGG
ATCGAAAAGG CCACCCGCAA GCAGTTCCCG GACGGCATCC CCGCCCACGG CACCGACGCC
CTGCGCTTCA CCTTCGCCTC ACTGGCCACC ACCGGCCGCG ATGTGGTCTT CGACCTGGGC
CGTGCTGAGG GCTACCGCAA TTTCTGCAAC AAGATCTGGA ACGCGGCCCG CTACGTGCTG
ATGAACACCG AGGGCCACGA CACCGGGGTG GCCGCCGAGG ACGTGAAGCT CACAGTGGCC
GACCGCTGGA TCATCTCGCA GCTCCAACGC ACCGAGCTGC GGGTGCGCGA GGCACTGGAC
GGTTACCGTC TGGACCTGGC CGCCCAGGCC ATCCACGAGT TCATCTGGGA CGAGTACTGC
GACTGGTACC TGGAGCTGTC CAAGCCGGTG CTCAACCAGT CCGACGACCC GGCCCTGCTG
CGCGGCACCC GCCGCACCCT GGTACGGGTG CTGGAGGCCG TCCTGCGGCT CACCCACCCG
ATCATGCCCT TCATCACCGA GGCCGTCTGG CAGCAAATCG CGCCGCTGGC GGGCAAGCAG
GGCGACAGCA TCATGACCCA GCCCTACCCG GTGGCCGAGG CCGGGCTGGT GGACAACGCC
GCCGAGGCGG ACATCGCCTG GATCAAGCAG TTCGTGCTGG GCGTGCGCCG CATCCGTGGC
GAGATGGATC TGTCACCGGC CAAGCCACTA CCGGTGCTGC TGCAGAACAC CGCCCCCGAG
GACCGCGCCC GGTTGGAGAA CTACGACGCC TTCCTCAAGA CCCTGGCCCG GCTGGAGTCC
ATCGAGGAGG TCACCGGCGA GGCGCCCGAA TCCGCCATCG CCCTGGTGGG CGACATGCAG
CTCCTGGTAC CCATGGCCGG CCTGATCGAC AAGGACGCCG AGCTGGCCCG CCTGGACAAG
GCCCGCCAGC GCCTGCAAAA GGAGGTGGCG CGCCTGGAGG GCAAACTGGG CAACGAGAAC
TTCATCACCA AGGCGCCGGA GGCGGTGGTC CAGAAGGAGC GCGACAAACT CGCCGACCAG
CGCTCCGCCC TGGAGAAGAT CGAGACCCAG CGCGAGAAGA TCGCCCGGGT CTGA
 
Protein sequence
MEKTYQPQAI EEKWYAYWEE HGHFAPSGEG EPYCIMIPPP NVTGTLHMGH AFQDTIMDAL 
IRYQRMCGRN TLWQPGTDHA GIATQMVVER QLEAEGKSRF DLGREKFLER VWQWKAESGG
TITRQLRRMG ASVDWSRERF TMDEGLSRAV REVFVRLYED GLIYRGKRLV NWDPVLHTAV
SDLEVTSQEE QGHIWHMRYP ISDGSGHVVV ATTRPETMLG DTAVAVHPED ERFRHLVGKT
VDLPLTGRQI PVIADDYVDP EFGSGCVKIT PAHDFNDYAV GERHDLAKIN ILTPDAAINE
HAPKAYQGLD RFEARKRIVA DLDELGLLEK VEDHTLMVPR GDRSGAVLEP FLTDQWYVRA
EPLARPAIEA VEAGRIRFIP GNWDRTYYEW MNNIQDWCIS RQIWWGHRIP AWYDAEGHVY
VGRTEEEVRA RHNLGNVPLH QDEDVLDTWF SSALWPFSTL GWPDETEALR TFYPTSVLVT
GFDIIFFWVA RMIMMGLHFM DDVPFREIYI HGLVRDPDGQ KMSKSKGNVL DPLDIIEGIE
LEELVAKRTS GLMQPQMAKR IEKATRKQFP DGIPAHGTDA LRFTFASLAT TGRDVVFDLG
RAEGYRNFCN KIWNAARYVL MNTEGHDTGV AAEDVKLTVA DRWIISQLQR TELRVREALD
GYRLDLAAQA IHEFIWDEYC DWYLELSKPV LNQSDDPALL RGTRRTLVRV LEAVLRLTHP
IMPFITEAVW QQIAPLAGKQ GDSIMTQPYP VAEAGLVDNA AEADIAWIKQ FVLGVRRIRG
EMDLSPAKPL PVLLQNTAPE DRARLENYDA FLKTLARLES IEEVTGEAPE SAIALVGDMQ
LLVPMAGLID KDAELARLDK ARQRLQKEVA RLEGKLGNEN FITKAPEAVV QKERDKLADQ
RSALEKIETQ REKIARV