Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0342 |
Symbol | |
ID | 5832530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 386610 |
End bp | 388523 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641366127 |
Product | extracellular solute-binding protein |
Protein accession | YP_001637837 |
Protein GI | 163849794 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGAAA ACGCCGTAAT TGCCCGCCGC AGCTTCCTCG CCGGAGCGCT CGTCCTCCTG GGACTGCGAG GCCCCGTCTC CGCTCAGGAG CCCGCCGAAT CCGCCTCGAA GGTCTCGGCG CAGGAGGCAC CCGGCAATTG GCGCCACGCG CTGACCCTGC TCGGCGAGCC GAAATACGGC CCGGACTTCC AGCATTTCGA CTATGCCGAC CCGAAGGCGC CGCTCGGCGG CCTGGTGCGG CTCGGGTCGC AGGGCGGCTT CGACAACCTC AACTTTATCG TCAACGGCCT GAAGGGCGAT CTCGAAGGCG GGATCACCCA GATCTACGAC ACGCTGATGG AGGATTCCGG CGACGAGCCG TTCTCCTCCT ACGGCCTCCT CGCCGAGGCC GTGCGGATCG CCGAGGACGG CCAGTCGGTG ACCTATCGCC TGCGGCCGAA CGCGCGCTGG CACGACGGCA AGCCGGTCAC CGTCGAGGAC GTGATCTGGT CCTTCGATAC GCTGAAGGCC AACAGCCCAT TCTATGCCGC CTACTACCAC ACCGTGGCCA AGGCCGAGCC GGGGGCCGAG CGCGAGGTCG TCTTCCGCTT CAGCGAGGCC GGCAACCGCG AACTGCCGCA GGTGCTGGGA CAGATGCAGG TGCTGCCCAA GCACTGGTGG ACCGGCACCG ACAAGAACGG CAAGCCGCGC AGCCCTACCG AGACGACGCT CGAGATCCCG CTCGGCTCCG GCCCGTATCG GTTGACCAAG GTCGATGCCG GCCGCTCCGC CGTCTACGAG CGCGTGGCCG ATTATTGGGG CAAGGATCTG CCGGTGAATC GCGGCCGCAA CAATTTCGGC ACGGTCCGCT TTGAGTACTT CCGCGATGGC TCCGTGCTGC TGGAGGCGCT GAAGGGCGAC CTCTACGACT TCCGCACCGA GAACATCGCC CGCAACTGGG CCACCGCCTA CGACGACTTC CCGGCGGTGA AGGAAGGGCG CCTCATCAAG GAGGAGTTCC CCGGCCGCGG CACCGGCATC ATGCAGGCCT TCGTGTTCAA CCTGCGCCGG GACAAGTTCA AGGACGAGCG CGTGCGCCGC GCCTTCAACC TCGCTCTGAA CTTCGAGGAG ATGAACCGGC AGCTCTTCTA CGGTCTCTAC AGCCGGATCG ATTCCTACTT CTACGGCTCG GACCTCGCCT CGTCCGGCCT GCCCGACGGG ATGGAGAAGG TGATCTTAGA GAGCGTGAAG GACAAGCTGC CGGGCAGCGT CTTCACCGAG ACCTACACCA ACCCCGTCAA CGACACGCCC GAAGCTGCGC GGGCCAACCT GCGCAAGGCG GTCGGCCTGC TGCGCGAGGC CGGCTACGAG CTGAAGGGCG GCAAGATGGT CGCCAAGGCG ACCGGCGAGC CGCTGACGGT CGAGTTCCTC GAATTCCAGA ACGTGTTCGA GCGGGTGATC CTGCCCTACG CGGCGCAGCT CAAGCTCATC GGCATCGAAT CCTCGATCCG GGTGATCGAT CAGGCCCAGT ACCAGAACCG CCTGCGCAGC TTCGACTTCG ACATCACCAC CTCGAACTGG CCGGAATCGC TCTCGCCCGG CAACGAGCAG CGCGAATTCT GGGGCTCGGC CGCCGCCGAC AAGCCGGGCT CGCGCAACAT CGCCGGGATC AAGGATGCGG GCATCGATGC GCTGATTGAG AAGGTGATCT TCGCGCAGGA CCGCGAGACG CTGGTCGCCG CCACGCACGC CCTCGACCGG GCGCTCCTCG CCCACAACTT CGTGGTGCCG CAATGGTCCT CGGCCGCGAC GCGGACGCTG CGTTGGAACC GCTTCGGCCG CCCGGCGGTG CTGCCGAAAT ACGGCTCTTC CGGCTTCCCG ACGACGTGGT GGTACGACGA GGCGCTCGCC GCCAAGACGG GGGCGCCGCG ATGA
|
Protein sequence | MTENAVIARR SFLAGALVLL GLRGPVSAQE PAESASKVSA QEAPGNWRHA LTLLGEPKYG PDFQHFDYAD PKAPLGGLVR LGSQGGFDNL NFIVNGLKGD LEGGITQIYD TLMEDSGDEP FSSYGLLAEA VRIAEDGQSV TYRLRPNARW HDGKPVTVED VIWSFDTLKA NSPFYAAYYH TVAKAEPGAE REVVFRFSEA GNRELPQVLG QMQVLPKHWW TGTDKNGKPR SPTETTLEIP LGSGPYRLTK VDAGRSAVYE RVADYWGKDL PVNRGRNNFG TVRFEYFRDG SVLLEALKGD LYDFRTENIA RNWATAYDDF PAVKEGRLIK EEFPGRGTGI MQAFVFNLRR DKFKDERVRR AFNLALNFEE MNRQLFYGLY SRIDSYFYGS DLASSGLPDG MEKVILESVK DKLPGSVFTE TYTNPVNDTP EAARANLRKA VGLLREAGYE LKGGKMVAKA TGEPLTVEFL EFQNVFERVI LPYAAQLKLI GIESSIRVID QAQYQNRLRS FDFDITTSNW PESLSPGNEQ REFWGSAAAD KPGSRNIAGI KDAGIDALIE KVIFAQDRET LVAATHALDR ALLAHNFVVP QWSSAATRTL RWNRFGRPAV LPKYGSSGFP TTWWYDEALA AKTGAPR
|
| |