Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3756 |
Symbol | |
ID | 5833256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4163581 |
End bp | 4165203 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641369546 |
Product | extracellular solute-binding protein |
Protein accession | YP_001641201 |
Protein GI | 163853158 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0607711 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGTTTT TGACCACTTT GATGGATGCG TGGGAACTCG GGAGCCTCAG CATGGCCGGT ACAATGGACC GCAGACGGTT TCTTCAGGCG AGCGCCGCCG CTGTGGGTTT TGCCCAGATC AATCCCGACT TCCTGGTCTC CTCGGCCTTC GCCCAATCGG GCAAGCCGCT GGTCTTCCTC TCGGCCGAGA ACATCACCGG CAACTGGGAC CCGACCGCCC ACACCACGCT CTCGCAGAAG AACATCGAGG GCTTCGTGAT GGGCTTCCTG ACCCGCACGC CGATGACCCT CGATGACCCC GGCAAGGTGG TCTACGAGCT CGCCACCGAC ATCCGGTTGC TCGATCCGCA CCGCCTGCAG ATCAAGCTGC GCAAGGGCGT GCAGTTTCAC GACGGCAAGC CGTTCGGGCC CGAGGACGTC AAAGCGACCT TCGAGTACGG CGCGGGCAAG GACCGGCCGG CGCAGTGGTA TCCCGGCCCG ACCGAGACGC TGACGATCAC CACGCCCGAC GACGAGACCG TGATCGTCGA CACCTCGAAG GGCGGCTACC CCGCCCACCT CTTCATCTTC CTGGCCTCGT TCCTCCCGAT CCTCTCGGCC AAGGACGTGG CCGAGGGGCC GGGCGGCGCC CTCACCCGGC GCCTGAACGG CACCGGCCCG TTCCGCTTCG TCGAGCAGCG CGGCAACGAC ACCGTGCTCA AGGCCCATGA CGGCTATTTC CGCGGCAAGC CGGGGATTCC CGGCATCAAC TTCACCTTCA CCGGCGATTC GACCACGCGA ATGCTGTCGC TGATGAACGG CCAGGCCTCG ATCGTCGAGC GGCTCGAACC CGAGCAGGTC GAGACGGTCA AGAACAACCC AAAGATCGCG ATCAACGAGG TCGTCTCGGT CGAGAACAAG TATCTCTGGT TCCGCTGCTC CAAGCCGCCC TTCAACGACG TGCGGGTGCG CATGGCGGCC TGTCACTCGA TCGACCGGGC GATGCTCCTG GAGATCCTCG GCGCGGCGGG CCACGCCTCG GCCAATTTCA TCTCGCCGGT GAAGTTCGGC TACGTCGATC TGAAGAACTA CCCGGCCTAC GACCCGGCCA AGGCCCAGGC GCTGCTGGCC GAGGCGGGCT TCCCCAAGGG CAAGGGGCTG CCGCCGCTCG AATACATCAC CTCGGTCGGA TTCTACCCGA AGACGAAAGA ATACGGCGAG GTCATCACCG CGATGCTCAA TGAGCAGGGC TTTCCGGTGA GCCTCACGGT GCTGGAGCCG GCGGCTTGGA ACGAGCGGCT CTATCACCGC CCCGGCGGCG GGCCCGGCCA CATGGTCGAT TGCGGCTGGT CCACCGCCTC GCCCGAGCCG GATCTGGTGC TGCGCACCCA CTTCCACTCC TCCTCGCATC GCATCACCGG CATCGAGGAT GCGCAGATCG ATGCGAGCCT CGACAAGGAG CGCGCGGCGC CGACGCTGGA GGAGCGCAAG GCCATCCTGC AGAACGAGAC GATGCCGCTC CTGGCCGCCA AGATGCCGGC GCTGTCGCTG TTCACCTCGG TGATGATCCA CGCGATGCAG CAGGAGCTGA AGGGCCTCTA CATCTACCCG GACGGCTCGA TCGACGCCTC GAAAACCGCC TGA
|
Protein sequence | MWFLTTLMDA WELGSLSMAG TMDRRRFLQA SAAAVGFAQI NPDFLVSSAF AQSGKPLVFL SAENITGNWD PTAHTTLSQK NIEGFVMGFL TRTPMTLDDP GKVVYELATD IRLLDPHRLQ IKLRKGVQFH DGKPFGPEDV KATFEYGAGK DRPAQWYPGP TETLTITTPD DETVIVDTSK GGYPAHLFIF LASFLPILSA KDVAEGPGGA LTRRLNGTGP FRFVEQRGND TVLKAHDGYF RGKPGIPGIN FTFTGDSTTR MLSLMNGQAS IVERLEPEQV ETVKNNPKIA INEVVSVENK YLWFRCSKPP FNDVRVRMAA CHSIDRAMLL EILGAAGHAS ANFISPVKFG YVDLKNYPAY DPAKAQALLA EAGFPKGKGL PPLEYITSVG FYPKTKEYGE VITAMLNEQG FPVSLTVLEP AAWNERLYHR PGGGPGHMVD CGWSTASPEP DLVLRTHFHS SSHRITGIED AQIDASLDKE RAAPTLEERK AILQNETMPL LAAKMPALSL FTSVMIHAMQ QELKGLYIYP DGSIDASKTA
|
| |