Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_5159 |
Symbol | |
ID | 7116197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 5527100 |
End bp | 5529820 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643527852 |
Product | valyl-tRNA synthetase |
Protein accession | YP_002423851 |
Protein GI | 218533035 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.174874 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGACA AGACCTTCGA CCCCGCCGCC GTCGAGGCGC GCGTCTCCGC GGCCTGGGAA GAGGGCCAGG CCTTCCGCGC CGGCCGCCCC GAGCGGGCCG GCGCCGAGCC CTTCAGCATC GTGATCCCGC CGCCGAACGT GACGGGCTCG CTGCATATGG GCCACGCGCT CAACAACACG ATCCAGGACA TCCTCGTCCG CTTCGAGCGG ATGCGGGGCA AGGACGTGCT GTGGCAGCCG GGCACGGACC ACGCCGGCAT CGCCACGCAG ATGGTGGTCG AGCGCAAGCT GATGGAGACG CAGCAGCCGG GCCGCCGCGA GTTGGGACGC GAGGAATTCC TGCGCCGGGT CTGGGCCTGG AAGGAAGAAT CCGGCGGCAC GATCATCGGC CAGCTCAAGC GCCTCGGTGC CTCCTGCGAC TGGTCGCGGG AGCGCTTCAC CATGGATGAG GGCCTGAGCC GCGCCGTGCT CAAGACCTTC GTCGATCTGC ACGCGCAGGG GCTGATCTAC CGCGACAAGC GCCTCGTGAA CTGGGACCCG AAATTCCAGA CCGCGATCTC GGACCTCGAA GTGCAGCAGA TTGAGGTGAA GGGCCATCTC TGGCACTTCG ACTACCCCGT CGTGGACGAG GCCGGCACGC CGACGGGCGC GATCATCACC GTGGCGACGA CGCGCCCCGA GACGATGCTC GGCGATACGG CCGTCGCCGT CCATCCGGAT GACGAGCGCT ACCGCGACCT TGTCGGAAAA CGCGTCCGCC TTCCGCTGGT CAACCGGCTG ATCCCGATCG TCGCCGACGC CTATTCCGAT CCGGAGAAGG GGACCGGCGC GGTCAAGATC ACCCCGGCCC ACGACTTCAA CGACTTCGAG GTCGGGCGCC GCAACGGATG CCGCCCGATC AACGTCCTCG ATGCCGAGGC CCGCATCCAG ATCGCCGGCA ACGCCGATTT CCTCGACGGC GCCGAGCCGG AGGACGCCGC GCTGGCGCTC GACGGCCTCG ACCGGTTCGA GGCCCGCAAG CGCGTCGTCT CTCTGATGGA GGAGCGCGGC CTGCTGCGTC TGGTCGAGCC CAATACCCAC GCGGTGCCGC ATGGCGACCG TTCGGGCGTG GTGATCGAGC CGTACCTGAC CGACCAGTGG TACGTGAACG TCAAGCCGCT GGCCGAGCGC GCCCTCCAGG CAGTGCGCGA CGGGCAGACC CGCTTCGTCC CGGACAACTA CGAAAAGATC TTCTTCCAGT GGCTGGAGAA CATCGAGCCG TGGTGCGTCT CGCGGCAGCT CTGGTGGGGC CACCAGATCC CGGTCTGGTA CGATGCGGAG GGCGGCATCT TCGTCACCGA GAGCGAGGCG GACGCGGTCG CGCAGGCGAA AGCCAAGCAC GGCCGTGAGG TCGCGCTGAC CCGCGATCCC GACGTGCTCG ACACGTGGTT CTCCTCCGCG CTCTGGCCGT TCTCGACGCT CGGCTGGCCG GACAAGACGC CGGAGCTGGC CCGCTTCTAC CCCACCAACA CCCTGGTCAC GGGTAAGGAC ATCATCTTCT TCTGGGTCGC CCGGATGATG ATGATGGGCC TGCACCTCAC CGATCAGGCG CCGTTCGAGA CCGTCTACCT GCACACCCTC GTCCGCGACG AGAAGGGTGC GAAGATGTCG AAGTCGAAGG GCAACGTGGT CGACCCGGTC GATCTCATCG ACCGTTTCGG CGCCGACGCG CTGCGCTTCA CGCTCGCCGC GCTCGCCGCC CCCGGCCGCG ACATCAAACT CGGGCCGCAG CGGGTCGAGG GCTACCGTAA CTTCGCGACC AAGCTCTGGA ACGCGGCGCG CTTCGCCGAG ATGAACGGCT GCGAACTGAA GGCCGATTTC CGGCCCGAGG CTGTGCGCGA GACGCTCAAC GCCTGGGCGC TCACCGAGGC CGCCAAAGCG GTGACGGAGG TGGCGCAGGG CATCACGGTC TACCGCTTCA ACGATGCGGC GGCCGCGGCC TACCGCTTCG TCTGGAACGT GTTCTGCGAC TGGTATCTCG AACTCGCCAA ACCCGTGCTT CAGGGCGAGG GCGTCGATCC GGCGGCACGC GCCGAGACGC AGGCGACGGT CGCCTTCCTG ATCGACCAGA TCGCCAAGCT GCTGCACCCG TTCATGCCCT TCCTCACGGA GGAGCTGTGG GCGATCAAGG GGCAGGTGCT GCCGACACCG CGCGGCCTGC TCGCGCTCGA ATCCTGGCCC GAACTCTCGG CCTATACGAA CAAGCAGGCC GAGGAAGAGA TCGGCTGGCT GGTCGATCTG ATCTCCGAGG TCCGCTCGGC CCGCTCCGAG ACCAACGTGC CCGCCGGCGC CCAGGTGCCG CTGGTGCTGG TGGGCGCCGA TGAGGGCGTC CGCGCTCGGG TCGAGCGCTG GAGCGAGACG CTGACCCGCC TCGCCCGTCT CTCCGAGATC GGTTTCGCCG ACGCCGCGCC GAAGAACGCC GTCCAGCTCC TCGTGCGGGG CAGCGTGGCG GCCCTTCCGC TCGAAGGCAT CGTCGATCTC GCGGCCGAGG TCGCGCGGCT GAAGAAGGAA GCGGGCAAGG CGCGGGCCGA GATCGGCAAG ATCGACGGCA AGCTCGGCAA CGCCGACTTC CTCGCCCGCG CGCCGGAAGA GGTGGTGGAC GAGCAGCGCG AGCGCCGCGA CGCGGAGGCG GCCCGGCTCG TCAAGTTCGA GGAAGCGTTG GTCCGGCTCA GCGAGGCATG A
|
Protein sequence | MMDKTFDPAA VEARVSAAWE EGQAFRAGRP ERAGAEPFSI VIPPPNVTGS LHMGHALNNT IQDILVRFER MRGKDVLWQP GTDHAGIATQ MVVERKLMET QQPGRRELGR EEFLRRVWAW KEESGGTIIG QLKRLGASCD WSRERFTMDE GLSRAVLKTF VDLHAQGLIY RDKRLVNWDP KFQTAISDLE VQQIEVKGHL WHFDYPVVDE AGTPTGAIIT VATTRPETML GDTAVAVHPD DERYRDLVGK RVRLPLVNRL IPIVADAYSD PEKGTGAVKI TPAHDFNDFE VGRRNGCRPI NVLDAEARIQ IAGNADFLDG AEPEDAALAL DGLDRFEARK RVVSLMEERG LLRLVEPNTH AVPHGDRSGV VIEPYLTDQW YVNVKPLAER ALQAVRDGQT RFVPDNYEKI FFQWLENIEP WCVSRQLWWG HQIPVWYDAE GGIFVTESEA DAVAQAKAKH GREVALTRDP DVLDTWFSSA LWPFSTLGWP DKTPELARFY PTNTLVTGKD IIFFWVARMM MMGLHLTDQA PFETVYLHTL VRDEKGAKMS KSKGNVVDPV DLIDRFGADA LRFTLAALAA PGRDIKLGPQ RVEGYRNFAT KLWNAARFAE MNGCELKADF RPEAVRETLN AWALTEAAKA VTEVAQGITV YRFNDAAAAA YRFVWNVFCD WYLELAKPVL QGEGVDPAAR AETQATVAFL IDQIAKLLHP FMPFLTEELW AIKGQVLPTP RGLLALESWP ELSAYTNKQA EEEIGWLVDL ISEVRSARSE TNVPAGAQVP LVLVGADEGV RARVERWSET LTRLARLSEI GFADAAPKNA VQLLVRGSVA ALPLEGIVDL AAEVARLKKE AGKARAEIGK IDGKLGNADF LARAPEEVVD EQRERRDAEA ARLVKFEEAL VRLSEA
|
| |