Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0617 |
Symbol | |
ID | 5835003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 679350 |
End bp | 682025 |
Gene Length | 2676 bp |
Protein Length | 891 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641366400 |
Product | hypothetical protein |
Protein accession | YP_001638102 |
Protein GI | 163850059 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.113619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.189106 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCAGA TCGATGAGCA GGCCGCTGCG GCCGTCTCCG ACGGCGAGGG CGCAGCGCCC CTGCTGCCGG CGGCTTCACG CTATGCCGGG GGCGCCGGGC CGGGCGGACG CGAGGGGCTG CACCGCATCG CACCCAACAA CCGCTTCGCG CGGGTGGGCT TGAGCTTTTC CGATCTTTCG CCCGGAACCT GGGCCTGCCT GCGATTCGGT CTTGCCTTCG ACGCGGACGA GGCGGCGGCT CTGGCCGTCG ATGCCGTCGC GGTCGGCTTC GACTTTCTAA CGGCGGACGG CTCCAGCCTC GATCTCGACC ACGTGCCGGG GCTGGCCCGC AGCCTGCTCG ATCCGCACGT CGCCTGGATT CCGGGGCCGG CGCTTCTCGG TGCGGAGCCG TTCCGCATCG CCTTCCGCGT GCCCGATCAG GCGCGCGGCG TTAGCGTCAC CCTGCGCTCG TGGCGCAACA CCGGCGACGT CGCCGTGGCC GACCCTGTCC TTCGTCTCGG CGCGCCCGAT CCAGCGCCGG CGCCGCGTCG GCGCCCGCTG TCCGGCGAGG GGCTGCGATT CGACTACGCG CTGGCCGAGG ACATCGCCCT GGTGCTGCGC GGCCAGATCT ACGCCGCGCG GGCCGACGAG CACGCCGCGC GGGTGCGCCT CGTCTACCGT GATGCGGCAG GAGCGGAGAT CGCGCCGCCC TACGCCAACG CCGTCTCCGT GCCGGGCCTC GGTGCGGTCA TCAACCTCTC GGCCCAGCCG CAGGCGCGCC GCTTCACCCT GACCCTGCGT CCGCCACCCG GTGCCGCAGC GGTCGCGCTC GATTTCGGTG CGTGGGAGGA TGCGCAGACC GCCTCGGAGA GCGAGCTGAT CGGCCAGCCC GAACTCGCGC TCGAAGACGA TTTCCGCCTC GAAAGCCTTT GCGACGACGA CGTCCTCGAT GCGCCGGCCT TCCTCGCCCG CCTCGCCGAC CGGCTCGGCC TGCCCGAAGG CGTGCCCGCG TCGTGGCGGG CGGGGTCCGA GACGGGCGCC GCGCCGATCC TCGCGCGGGC CCGCGACTTG CGCAACGGGC CGGATCGCAG TGCGCGGATC GAGGGGGAGA GTGTCGTCCT GGCTCTGGCC GGTCTGCCCG ATTGGACGTT GCCGCAAACG CCGGATTGGC GCGAGGATCC GTTCCGCTCG GTGGCGTGGC GCCTCGCCTA TCAGTCGCTG GCTTGGCTGC TGCCGCTCGC CGCGCTTCCG GACGGAGCGG CCCGCGCCCG GACCATCGCC GCGGCGTGGT CGCGGGCCAA TCCGTGGAGC CAGCCGGCGG ACGGCCTGAG TCTGCACCCG GCGGCTCTGG CCCCCCGCGG CGACGTACTG GCGAGCCTGC TTACAGGCGG CGACGAGCCC ACCCGAGACG AGCCCGCCCG CGCCGAAATC GCCGCGGAGG CCGCACGGCA GGGCTTCGCG CTCGCCGAGA TCGTTGGGCA GAACACCCTG GCGCGGGCCC TGCACGGCAT CCAGGCGGCG GCGGCGCTGC TTGCCGTCGC ACGGGCGCTG CCCGGCTTCG CCTTCGCCCC GTACTGGGAT TCCCTGGCGC GCGACAGCCT CACCCACGGC TTCGATTCGC TGTTGTCGGA GGCGGGCGCG TTCACCGACG CCGCCCCGGT GCGGCGCCTC GATCTCCTGA GCCACGGACA GGCGATCGCC GAGGCGCTCG GCGAGACCGA GCCCGGCCCG ACGATCCGCC GCCGTGTCGA GGCGGCTTTG CTGGGCCTCG CCGGGCTGAT CGATCGTGGC GGACGGCTGC CACCCTTCGG CGATGCGCCC GCCGGCCTCG ACCATTCCGC CTGGATCGGC CGGCTGTCCT CGCGGGGCCA TTCTCAGGGT CACGACTTCG TCGCGGAGCG CGACGCGGCG CCCGTTGCCG AGGGCCGCAC CGCCGGCATG CTTGCGCTCC GCTACGATGG GATGGAGCAG GGCTGGGGCC ATTTCGCCCT CACCCATGCC GCGCAATCAC CGCACGGGCA TCGCGACTGC ACCTCCTTCA CCTTTGCCAC CGGCTCGCGG CGCTGGATCG TGGAGGGGGG TGGCGCCGAG GGCGTCGAGG TCGGCCCGGC GCGCCATTAC CTGCTCTCGC CCCGCGCCCA CAATGTCGCC ATCCCCGATG GCTGCGAGCC GGTCGCCGGT TCGGGGACGC TAAGCGCGCG CGTCACGCTC GACGATGCCG AGGCCCTGAC GCTCGCCACC ACCGTGCATG GGCCCGACTA CGCCCATGCG CGCCTCTTCC TCGTGCTCGA CGACCTGAGC GGGATGGCGG TGATCGACCG ATTCGTGCGG GTGGGGCCGT CGCTGTCGTT CGAGGGGCTG CTGCATCTGC CCCCCGATAC CCTCGTCGCG CTCTCCAGCC CCCGCCGGGC GCTCGCCCAG CAGGAGGGAC GGCGGCTCGA ATTCTTCGCG ATTCCGCTGA AGGGTCAGGC GGCCGGATTG GAGGTCGCTA TCGGTCGCAG CGACCGGCCG CACGCGATGC AGGGGTTTTG CGCGACGGGA TCGGGCGGGC TGAGGCCGGC GCCGGTGCTG CGCTACGCCT TCACCGGCCG CGACACGGTC TGCGGCGGTG TCGTCATCGC CGCGGATGCG GGGGCGGAGC AGCGGCTGGT GCGGCTGCTC GCCGACGACG CGGTGCGGCG TTGGGTGGAG GGATGA
|
Protein sequence | MDQIDEQAAA AVSDGEGAAP LLPAASRYAG GAGPGGREGL HRIAPNNRFA RVGLSFSDLS PGTWACLRFG LAFDADEAAA LAVDAVAVGF DFLTADGSSL DLDHVPGLAR SLLDPHVAWI PGPALLGAEP FRIAFRVPDQ ARGVSVTLRS WRNTGDVAVA DPVLRLGAPD PAPAPRRRPL SGEGLRFDYA LAEDIALVLR GQIYAARADE HAARVRLVYR DAAGAEIAPP YANAVSVPGL GAVINLSAQP QARRFTLTLR PPPGAAAVAL DFGAWEDAQT ASESELIGQP ELALEDDFRL ESLCDDDVLD APAFLARLAD RLGLPEGVPA SWRAGSETGA APILARARDL RNGPDRSARI EGESVVLALA GLPDWTLPQT PDWREDPFRS VAWRLAYQSL AWLLPLAALP DGAARARTIA AAWSRANPWS QPADGLSLHP AALAPRGDVL ASLLTGGDEP TRDEPARAEI AAEAARQGFA LAEIVGQNTL ARALHGIQAA AALLAVARAL PGFAFAPYWD SLARDSLTHG FDSLLSEAGA FTDAAPVRRL DLLSHGQAIA EALGETEPGP TIRRRVEAAL LGLAGLIDRG GRLPPFGDAP AGLDHSAWIG RLSSRGHSQG HDFVAERDAA PVAEGRTAGM LALRYDGMEQ GWGHFALTHA AQSPHGHRDC TSFTFATGSR RWIVEGGGAE GVEVGPARHY LLSPRAHNVA IPDGCEPVAG SGTLSARVTL DDAEALTLAT TVHGPDYAHA RLFLVLDDLS GMAVIDRFVR VGPSLSFEGL LHLPPDTLVA LSSPRRALAQ QEGRRLEFFA IPLKGQAAGL EVAIGRSDRP HAMQGFCATG SGGLRPAPVL RYAFTGRDTV CGGVVIAADA GAEQRLVRLL ADDAVRRWVE G
|
| |