Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4272 |
Symbol | |
ID | 5834328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4754476 |
End bp | 4756074 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641370063 |
Product | extracellular solute-binding protein |
Protein accession | YP_001641712 |
Protein GI | 163853669 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0380913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.650765 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTCC CCCGCCGCAC CCTCCTGCAG ACCGGCGCGG CGCTCGCCGC CGGGCTCGCC CTCCCCGCTC CCGCGCGGGC GGCCTCCCCC GTCTATCGCC GCGGCAACGA CGCCGACCCG GAGACGCTCG ATCCGCACAA AACCTCGACG GTGGCCGAGG CGCATCTCCT GCGCGACCTG TTCGAGGGGC TGCTGACCTA CGACAACCGC GGCACGATCA TTCCCGGCAT GGCCGAGCGT TGGACCGTCT CCGATGACCG CCTCACCTAC CGCTTCACCC TGCGGCCGGA CGGGCGCTGG TCGAACGGCG ATGCCGTGAC GGCCGACGAC TTCCTGTTCT CCCTGCGCCG CATCCTCGAT CTGAAGACGG CGGCGAAATA CGCCGAGGTG CTGTTCCCGA TCCGGGGGGC GGCCGCCGTC AATGCGGGTG AGCAACCGCC GGAGACGCTG GGGGTGACGG CCCCCGATCC CCGCACCCTG GAAATCGGGC TCGCCGAGCC GGTGCCCTAC CTCCTCGAAC TCCTGACGCA CCAGACCTCG CTGCCGGTCC ACCGCCCCTC GCTGGAGCGC TGGGGCGACG CCTTCGCGCG GCCCGGCAAC CTCGTCTCGA ACGGCCCCTA CGCCCTGGTC GATTGGGTGC CGAACGACCG CATCACCCTG ACGAAAAACC CGCATTACCG CGACGCCGCC GCGATCCCGA TCGAGCGGGT GGACGTCATC CCGACTCCCG ACCTCGCCGC GGCGGTGCGG CGCTATGCGG CCGGCGAGAT CGATTCCCTC TCGGACCTGC CCGCCGACCA GATCGCTTCG CTCAAGAGCC GCTTCGACCG CCAAGTGCAG CTCGGACCGG GGCTCGGCCT GCTCGCCATC GCCTTCAACC TGCGAAAGAA ACCCTTCGAC GACGCGCGGG TGCGCCGGGC CCTGTCGCTG GCCATCGACC GGGAATTTCT GGCCGAGATC GTCTGGGGGC AGACCATGGC CCCGGCCTAT TCCTTCTGCC CGCCCGGCCT CGACAACGCC CTGCCGCCCC CGCTCCTGCC GGGGCGCGAG GATGGGCCGA TCGACCGCGA GGAGGAGGCG TTGCGGCTGC TGGCAGAAGC CGGCTACGGG CCGGGCAACC CGCTGACGGT CGAGTATCGC TTCAACGTCA CCGACAACAA CCGCAACACG GCGATCGCGC TCGCGGATGC GTGGCGCGGC ATCGGCGTCG TGACCCGCTT CGTCTCCACC GACGCCAAGA CCCACTTCGC GTATCTCCGC GACGGCGGCC CCTTCGACCT CGCCCGGATG TCCTGGGTCG CCGACTATTC CGATCCGCAG AATTTTCTCT TTTTGCTCCG CACCGGCAAT GACGGGTTCA ATGCCGGGCG CTGGTCGAAC GCGCGCTTTG ACGAACTGCT GACGCGGGCG GCGCAGGAGC GCGACGTGCC GGCCCGCGCG CGGATGCTGT TCGACGCCGA AACCCTCGTG CTCGACGAAC TGCCCTGGGT GCCGCTGCTG CATTACCGCT CGAAGGCGCT CGTCTCGCCG CGGCTGCACG GGATGCACCC GAACATCCGC AACGTCGCCC CCACCCGCTA TCTCCGGCTC GATCCATGA
|
Protein sequence | MSLPRRTLLQ TGAALAAGLA LPAPARAASP VYRRGNDADP ETLDPHKTST VAEAHLLRDL FEGLLTYDNR GTIIPGMAER WTVSDDRLTY RFTLRPDGRW SNGDAVTADD FLFSLRRILD LKTAAKYAEV LFPIRGAAAV NAGEQPPETL GVTAPDPRTL EIGLAEPVPY LLELLTHQTS LPVHRPSLER WGDAFARPGN LVSNGPYALV DWVPNDRITL TKNPHYRDAA AIPIERVDVI PTPDLAAAVR RYAAGEIDSL SDLPADQIAS LKSRFDRQVQ LGPGLGLLAI AFNLRKKPFD DARVRRALSL AIDREFLAEI VWGQTMAPAY SFCPPGLDNA LPPPLLPGRE DGPIDREEEA LRLLAEAGYG PGNPLTVEYR FNVTDNNRNT AIALADAWRG IGVVTRFVST DAKTHFAYLR DGGPFDLARM SWVADYSDPQ NFLFLLRTGN DGFNAGRWSN ARFDELLTRA AQERDVPARA RMLFDAETLV LDELPWVPLL HYRSKALVSP RLHGMHPNIR NVAPTRYLRL DP
|
| |