Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3046 |
Symbol | |
ID | 4784968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3237556 |
End bp | 3240396 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640091617 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_001022234 |
Protein GI | 124268230 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.490165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCG ACCAGCCCAC CGACTACCGC GCCACGCTGA ACCTGCCCGA CACGCCCTTC CCGATGCGCG GCGACCTGCC CAAGCGCGAA CCGGGCTGGG TGAAGCAGTG GGAGCAGCAG GGCACCTACC AGCGCCTGCG CGACGCCCGC GTCGGCCGGC CGCGCTTCGT GCTGCACGAC GGCCCGCCCT ACGCCAACGG CCAGCTGCAC ATCGGCCACG CGCTGAACAA GGTGCTGAAG GACATGATCG TCAAGGCGCG TCAGCTGGCC GGCTACGACG CGCTCTACGT GCCGGGCTGG GACTGCCACG GCCTGCCGAT CGAGAACCAG ATCGAGAAGC TGCACGGCCG CGGCCTGGGG CGCGACGAGG TGCAGGCGAA GAGCCGTGCC TATGCCACCG AGCAGATCGA GCAGCAGCGG GCCGACTTCA AGCGCCTGGG CGTGCTCGGC GCCTGGGACC AGCCCTACCG CACGATGGAC TTCGGCAACG AGGCCGGCGA GATCCGCGCG TTGAAGCGCG TGATGGAGCG CGGCTTCGTC TACCGCGGCC TGAAGCCGGT GTACTGGTGC TTCGACTGCG GCAGCTCGCT GGCCGAGTTC GAGATCGAGT ACGAGAACAA GTCCTCGCCG ACGGTGGACG TGGCCTTCCT GGCCGCCGAG CCGCAGAAGC TCGCCGCCGC CTTCGGCCTG CCGGCGCTGG CCAAGGACGC GTTCGCGGTG ATCTGGACCA CGACGCCATG GACCCTTCCC GCCAACCAGG CGCTGAACCT CAACCCCGAG CTCGAGTACG CGCTGGTCGA CACCGAGCGC GGCCTGCTGC TGCTGGCCAA TGCGCTGGTG GAGAAGTGCC TGGCGCGCTA CGGCCTGGCC GGCACGGTGC TCGCCACGAC CGCCGGCCAG GCGCTCGAGG GCCTGGAGTT CCACCATCCG CTGGCCCACG TGCACCCGGG CTACGCGCGC CGCAGCCCGG TCTACCTGGC CGACTACGCC ACCGCCGAGG ACGGCACCGG CATCGTCCAC TCGGCGCCGG CCTACGGCGT GGAGGACTTC AACTCCTGCA TCGCGCACGG GATGAAGCAC GACGAGATCC TGAACCCCGT GCAGGGCAAT GGCGTGTACG CGGCCGAGCT GCCGCTGTTC GGCGGCCAGT TCATCTGGAA GGCCAACCCG CTGATCGTGC AGGCGCTGCA GGATGCCGGC CGGCTGATGG CGACCGCAAA GCTCGAGCAC AGCTACCCGC ACTGCTGGCG CCACAAGACG CCGGTGATCT ACCGCGCCGC GGCGCAGTGG TTCGTGCGCA TGGACGAGGG CGAGGGCGTG TTCACCGTCG ACAAGGCGCC GAAGACGCTG CGCCAGACCG CGCTGGCGGC GATCGACGCC ACCGCCTTCT ACCCCGAGAA CGGCCGCGCC CGACTGCGCG ACATGATCGC CAACCGGCCC GACTGGTGCA TCAGCCGCCA GCGCAACTGG GGCGTGCCGC TGCCTTTCTT CCTGCACAAG GTGAGCGGCG AGCTGCACCC CGACACGCTG GCGCTGATGG ACCGCGCCGC CGCGCTGGTG GCGCAGGGCG GCGTGGAGGC CTGGTCGCGG CTCGACCCGC GCGAGTGGCT GGGCGAGGCA GCCGGCGATT ACGCCAAGAG CACCGACATC CTCGATGTGT GGTTCGACTC CGGCTCGACC TTCTTCCACG TGCTGCGCGG CAGCCATGCC GGCGCCGGCC GCGACGACGG CGGGCCCGAG GCCGACCTCT ACCTCGAGGG CCACGACCAG CACCGCGGCT GGTTCCACAG CTCGCTGCTG ATCGCCTGCG CGATCGAGGG CCGTGCGCCC TACCGCGGCC TGCTGACGCA CGGTTTCGCG ACCGACGGCC AGGGCCGCAA GATGAGCAAA TCGCTCGGCA ACACCGTGGT GCCGCAGTCG GTGAGCGAGA AGCTGGGTGC CGAGATCATC CGGCTGTGGG TCGCCAGCAC CGACTACTCG GGCGACCTGA ACATCGACGA CAAGATCCTC GCACGCGTGG TCGACGCCTA CCGGCGCATC CGCAACACGC TGCGCTTCCT GCTCGCCAAC ACCAGCGACT TCGACCCGGC GACCGACGCG GTGCCGGACG AGCAGTTGCT GGAGATCGAC CGCTACGCGA TCGACCGCGC GGCGCAGCTG CAGGCCGAGA TCCTGGCGCA CTACGAGGTC TACGAATTCC ACCCGGTGGT CGCGAAACTG CAGGTCTACT GCAGCGAAGA CCTCGGTGCG TTCTACCTCG ACGTGCTGAA GGACCGGCTC TACACCACCG CCCCGAAATC GCTGGCGCGG CGCAGCGCGC AGACCGCGCT GCACCGCATC ACCGGCGCGA TGCTGCGCTG GATGGCGCCG TTCCTGAGCT TCACCGCCGA GGAGGCCTGG CCGATCTTCG CGCCGGGCGT GTCGCCGTCG ATCTTCACGC AGACCTATAC CCCCTTCGCG CCCCCCGATG CCGCGCGCCT GGACAAGTGG GCCCGCGTGC GCGAGATCCG CGATGCCGTC AACAAGGAGA TCGAGGCCGT CCGCACCGCC GGCGCGGTGG GCGCCTCGCT GCAGGCCACG GTGGCGGTCG GCGCGCCGGC CGACGACCTG GCGCTGCTGC AGTCACTGGG CGAGGACCTG AAGTTCGTGT TCATCACCTC GGCCGCCACC GCGGCGGCGG CCGACGCGCT GACGGTCGCG GTCACGCCGA GCAGCGCCGC CAAGTGCGAA CGCTGCTGGC ACTACCGCGA CGACGTCGGC GCCGACCCGG CCCACCCGAC GATCTGCGGC CGCTGCACCA ACAATCTCTA CGGTGCCGGC GAAAGCCGCA CGGTGGCCTG A
|
Protein sequence | MSTDQPTDYR ATLNLPDTPF PMRGDLPKRE PGWVKQWEQQ GTYQRLRDAR VGRPRFVLHD GPPYANGQLH IGHALNKVLK DMIVKARQLA GYDALYVPGW DCHGLPIENQ IEKLHGRGLG RDEVQAKSRA YATEQIEQQR ADFKRLGVLG AWDQPYRTMD FGNEAGEIRA LKRVMERGFV YRGLKPVYWC FDCGSSLAEF EIEYENKSSP TVDVAFLAAE PQKLAAAFGL PALAKDAFAV IWTTTPWTLP ANQALNLNPE LEYALVDTER GLLLLANALV EKCLARYGLA GTVLATTAGQ ALEGLEFHHP LAHVHPGYAR RSPVYLADYA TAEDGTGIVH SAPAYGVEDF NSCIAHGMKH DEILNPVQGN GVYAAELPLF GGQFIWKANP LIVQALQDAG RLMATAKLEH SYPHCWRHKT PVIYRAAAQW FVRMDEGEGV FTVDKAPKTL RQTALAAIDA TAFYPENGRA RLRDMIANRP DWCISRQRNW GVPLPFFLHK VSGELHPDTL ALMDRAAALV AQGGVEAWSR LDPREWLGEA AGDYAKSTDI LDVWFDSGST FFHVLRGSHA GAGRDDGGPE ADLYLEGHDQ HRGWFHSSLL IACAIEGRAP YRGLLTHGFA TDGQGRKMSK SLGNTVVPQS VSEKLGAEII RLWVASTDYS GDLNIDDKIL ARVVDAYRRI RNTLRFLLAN TSDFDPATDA VPDEQLLEID RYAIDRAAQL QAEILAHYEV YEFHPVVAKL QVYCSEDLGA FYLDVLKDRL YTTAPKSLAR RSAQTALHRI TGAMLRWMAP FLSFTAEEAW PIFAPGVSPS IFTQTYTPFA PPDAARLDKW ARVREIRDAV NKEIEAVRTA GAVGASLQAT VAVGAPADDL ALLQSLGEDL KFVFITSAAT AAAADALTVA VTPSSAAKCE RCWHYRDDVG ADPAHPTICG RCTNNLYGAG ESRTVA
|
| |