Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4214 |
Symbol | |
ID | 5833282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4688763 |
End bp | 4690721 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641370005 |
Product | hypothetical protein |
Protein accession | YP_001641654 |
Protein GI | 163853611 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0784859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.378864 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCG ATCCGGTCAC CACGCGGCTC CTGCGGCAGG CGCTGCCGGT GGTGACGACG CTCGCCTCGA TGCTGCTGGT CGTGACGCTC GCCAGCGCGC TGTATTTCGG CCGCGACATC CTCGTGCCGG TGGCGCTCGC CATCCTGCTG AGCTTCGTCC TCGTGCCCGC CGTGCGGGCC CTGCGGCGCG TCCGCGTGCC GCGGGTGGCG GCGGTGCTGC TGGTGGTGGT GCTCGCGTTC GGCCTGCTCG GCGCGATCGG CAGCCTGATC GCCCGCGAGG CGGCGCAGCT CGCGACCGAT CTGCCGCGCT ACTCGCTGAC CCTGCGCGAC AAGATCACGG CCCTGCGCGC CGCCACCGCC GAGCGCGGAA GCCTGTCGGA CACCTTCTCC GGCTTCTTCG ACATGGCCGA GGAGATCGGC AAGGAGTTGC AGCCGCCCGC CGCCAAGACC GAGTCCGAGA CCAACGCGCC GCTCGGCACG GCGGAGCGGC CGATGCAGGT CGAGATCCAC GCTCCCCGCT CCGGGCTGCT GACGACGCTC GGCAGCGTAG CGGGCGGGGT GCTGCACCCG CTGGCGACGC TCGGCCTGAT CCTCCTGTTC ACGATCTTCA TCCTGCTCCA GCGCGAGGAT CTGCGAAACC GCGCGATCCG GCTGGCGGGG TCGAGCGATC TGCGCCGCAC CACCGCGGCG ATCGACGACG CCACGAGCCG GCTCAGCCGG TTCTTCGTGG CGCAGCTCAT CCTCAACGTC GCCTTCGGCC TCGTGATCGG CGTGGGCCTG TGGTTCATCG GCGTGCCGAG CCCGATCCTG TTCGGGGTCA TCGCCGGCAT CTCGCGCTTC GTGCCCTATG TCGGCGCGGT GCTCTCGGCG GCGCTGCCGC TCGCCATCGC CGTGGCCGTC GATCCCGGCT GGTCGATGGC GATCCAAGTG GCGATCCTGT TCGTCGTGAT CGAGCCGATC GCGGGTCACG TGGTCGAACC GCTGCTCTAC GGGCATTCGA CCGGCATCTC GCCGATTGCG GTGATCCTCG CGGCGACGAT CTGGACCTTC CTGTGGGGGC CGATCGGCCT CCTGCTCGCC ACCCCGCTGA CGGTCTGCCT CGTGGTACTC GGCCGCCATG TCGAGCGGCT CTGGTTCCTC GACGTGATCC TCGGCGACCG CCCGGCGCTC GGCCCCCAGG AGATCTTCTA CCAGCGCATG CTGGCGGGCG ATCCGGCCGA GGCGGTCGAT CAGGGGCGGC TGTTCCTGAA GGAGCGGGCG CTCGTGACCT ATTACGACGA GGTCGTGCTT CCGGGCCTAC GCATGGCCCA GGAGGATGCG GCGCGCGGCA TGCTCGACCG GGAGCGGCAG GGCGAGGTCG GCGCCGGCTT CCGCACGGTG GTCGAGGCGC TGGGGCTGGC GCGCAAGCGC GGCATGCCGC GGCGCTCGCG CAAGCCCATC GGTGCCGAGG CCGAGGCCGC CTTCGCCGCG GTGGGGCCGG ACCGCCACAC CGCCGGCATC GTGCTCGGCC CGGACGACTT GGCTTCGGCC TGGCGCGGCG AGGCGCCGGT CCTGTGCGTG CCGAGCGGAG GCGCCTTCGA CGAGGCCGCG ACGCTGATGC TCGCTCAGAC ACTCTCCCGC CACGGGCTCG GAGCGCGGGT GGCGCCGGGC GACGCGTTGC GGAACGGTCT GCCCGAGGGC GAGCGCCCGG CGATGATCTG CTTCTCCTAT CTCGACCCGA TCAGCCTGTC GCAGATCCGC CTGACCATCC GCCGCGCCCG CAAGGCCGCG CCGGGGGTGA CGATCCTCGT CGGCTTCTGG CGCGAGCGCG ACCCGGCCTC CCTCGGCCGT CTGCGCCGGG CGATCTCCGC CGACCTGCTC GTGACTTCGC TGAGCGACGC CCTCGACGCC GCATTGGCGC GGGCGCGGGC CGGGACAGCC GCTCCGGCCT CGCCCCGGCT TCGGCAGGCG GCGGAGTAG
|
Protein sequence | MTTDPVTTRL LRQALPVVTT LASMLLVVTL ASALYFGRDI LVPVALAILL SFVLVPAVRA LRRVRVPRVA AVLLVVVLAF GLLGAIGSLI AREAAQLATD LPRYSLTLRD KITALRAATA ERGSLSDTFS GFFDMAEEIG KELQPPAAKT ESETNAPLGT AERPMQVEIH APRSGLLTTL GSVAGGVLHP LATLGLILLF TIFILLQRED LRNRAIRLAG SSDLRRTTAA IDDATSRLSR FFVAQLILNV AFGLVIGVGL WFIGVPSPIL FGVIAGISRF VPYVGAVLSA ALPLAIAVAV DPGWSMAIQV AILFVVIEPI AGHVVEPLLY GHSTGISPIA VILAATIWTF LWGPIGLLLA TPLTVCLVVL GRHVERLWFL DVILGDRPAL GPQEIFYQRM LAGDPAEAVD QGRLFLKERA LVTYYDEVVL PGLRMAQEDA ARGMLDRERQ GEVGAGFRTV VEALGLARKR GMPRRSRKPI GAEAEAAFAA VGPDRHTAGI VLGPDDLASA WRGEAPVLCV PSGGAFDEAA TLMLAQTLSR HGLGARVAPG DALRNGLPEG ERPAMICFSY LDPISLSQIR LTIRRARKAA PGVTILVGFW RERDPASLGR LRRAISADLL VTSLSDALDA ALARARAGTA APASPRLRQA AE
|
| |