Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0343 |
Symbol | |
ID | 5832886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 388520 |
End bp | 390394 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641366128 |
Product | extracellular solute-binding protein |
Protein accession | YP_001637838 |
Protein GI | 163849795 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG GCGCGACCCG CCGAAGCGTC GTCCTCGGCA CCGGGGTGAT AGCGCTTGCG GGCGCCCTGC CCCGTTCGGC CCGTGCCGAG GATGTCCACA AGGCTCATGG CGGGATTCAT GGCCTATCGA GCTTCGGCGA GCTGAAATAC GCGCCCGACT TCCCGAACTT CGACTACGTG AACCCGATGG CGCCGCGCGG CGGGCGCTTC TCGACAACTC TGGTCCAGAC CTTCGGCAAT CAGGCGTTCG ACACCTTCGA CACGCTCAAC CCCTACGTCT TCCGCGGCAA CGGTGCGGCC GGGATCAATC TCACCTTCGA CAGCCTGATG GTGCGCGCCC TCGACGAGCC GGACGCGCTC TACGGCCTCG TCGCCCGCTC GGTCGAGATC AGCCCCGACG GCCTGACCTA TCGCTTCGCC CTGCGTCCCC AGGCGCATTT CCACGACGGC TCACCCCTGA CCGCGCGGGA CGCGGCCTTC TCCCTCACCA TCCTCAAGGA GAAGGGGCAC CCGACGATCG CCCAGGTGAT CCGCGACGTC GCGGAGGCGA CGGCGGAGGG CGACGAGACC CTCGTCGTCC GCTTCGCCCC CGGCCGCAGC CGCGACCTGC CGCTGATCGT CGCCGGACTG CCGATCTTCT CGGCGAAGTT CTTCGAAGGG CGCGACTTCG AGGCCCAGAC TCTCAAGCCC CTGCTCGGCT CCGGCCCGTA TCAGGTCGGG CGGATCGATA TCGGCCGCTT CATCGAACTG GAGCGTGTGA CCGATTACTG GGCGGCGGAT CTTCCGGTGA TGGTCGGGCA AAACAACTTC GACCAGCTCC GCTACGAGTA TTTCCGCGAT CGGCAGGTCG CCTTCGAGGC GTTCAAAGGC GGCGCCTACA CCTATCGCCA GGAATTCACC TCGCGGATCT GGGCAACGGG CTACGACTTC CCCGCCGCGC GCGAGGGCCG CGTCAAGCGC GAGACCCTGC CCGACACCTC GCCCGCCGCC ATCCAGGGCT GGTTCTTCAA CACCCGCCGC GAGGTGTTCA AGGATCCGCG CGTGCGCGAG GCGATCGGCC TGTGCTTCGA CTTCCCCTGG ACCAACCGCA CAGCGATGTT CGGCTCCTAC GAGCGCACCG TCTCGTTCTT CCAGAAGACC GACCTGATGG CGACGGGCAA GCCCTCCGCG GAGGAGCTGG CCCTGCTCGA ACCCTTCAGC GGGCAGGTGC CGGCCGAGGT GTTCGGCGAG GCCTGGACGC CGCCGGTTCC GGACGGCTCG GGCCAGGACC GGGCGCTGCT CGCCCGCGCG GTGGCCCTGC TTAAGGAGGC GGGCTGCACC CGCGAGGGCG GCGCCTTGCG GCTGCCGAGC GGCAAGCCGA TCGAGTTCGA ATTCCTCGAT TCGGATTCCG TCTGGGAGCC GATCGTCCAG CCCTTCATCC GCAATCTCGG GTTGATCGGC ATCAAGGCGC GCCAGCGGGC GGTCGATGCC GCGCAGTATC AGGCGCGGGT GCGCGACTTC GACTTCGACA TCACCGCCCG CGCCGCCTCG GGCGACGCGA CGCCGGGGCC GGAGCTGCGC GAGGCCTATG GCTCCCGCGC GGCGGCGATC CCCGGCTCCA ACAACCTCGC CGGCATCACC GATCCGGTGA TCGACGCGCT GCTTGACCGC ATTGCCAACG CGGATTCGCG CGCGAGCCTC ACCGTGGCCT GCCGCGCCCT CGACCGGGTG ATGCGGGCCG GCCGCTACTG GATCCCGATG TGGTACTCGC CCGAGTACCG CCTCGCCCTG TGGGACATGT ACGGCCGCCC GGCGAAGCTG CCGACCTATG GGCTCGGCGT GCCGGGCCTG TGGTGGTACG ACGAGGCCAA GGCGCGCCGG ATCGGCCGGG GCTGA
|
Protein sequence | MSAGATRRSV VLGTGVIALA GALPRSARAE DVHKAHGGIH GLSSFGELKY APDFPNFDYV NPMAPRGGRF STTLVQTFGN QAFDTFDTLN PYVFRGNGAA GINLTFDSLM VRALDEPDAL YGLVARSVEI SPDGLTYRFA LRPQAHFHDG SPLTARDAAF SLTILKEKGH PTIAQVIRDV AEATAEGDET LVVRFAPGRS RDLPLIVAGL PIFSAKFFEG RDFEAQTLKP LLGSGPYQVG RIDIGRFIEL ERVTDYWAAD LPVMVGQNNF DQLRYEYFRD RQVAFEAFKG GAYTYRQEFT SRIWATGYDF PAAREGRVKR ETLPDTSPAA IQGWFFNTRR EVFKDPRVRE AIGLCFDFPW TNRTAMFGSY ERTVSFFQKT DLMATGKPSA EELALLEPFS GQVPAEVFGE AWTPPVPDGS GQDRALLARA VALLKEAGCT REGGALRLPS GKPIEFEFLD SDSVWEPIVQ PFIRNLGLIG IKARQRAVDA AQYQARVRDF DFDITARAAS GDATPGPELR EAYGSRAAAI PGSNNLAGIT DPVIDALLDR IANADSRASL TVACRALDRV MRAGRYWIPM WYSPEYRLAL WDMYGRPAKL PTYGLGVPGL WWYDEAKARR IGRG
|
| |