Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4149 |
Symbol | |
ID | 5832504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4615910 |
End bp | 4616800 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641369939 |
Product | extracellular solute-binding protein |
Protein accession | YP_001641589 |
Protein GI | 163853546 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.514926 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTCG TGAACGGACG GCGCCGTACA GCCGCATCGG TCGTCGCTCT GACCGCCTTC GCCGCCCTGT GCGCACCCGC CCAGGCGCAG GACACGAAAG CCACCTCGAA AGCCGCCGAG GCCGCCAAAC CCGATGCCGG GACCTTGCGC GTCTGCGCCG CCGAGCAGCC GCCGCTCTCG ATGAAGGACG GCTCGGGGCT CGAGAACCGC ATCGCGACGA CGGTGGCCGA GGCCATGGGC CGCAAAGCCC AGTTCGTCTG GCTCGGAAAG CCCGCGATCT ACCTCGTGCG CGACGGGCTG GAGAAGAAGA CCTGCGACGT GGTGATCGGG CTCGATGCCG ACGACGCCCG CGTGCTGACC AGCAAGCCCT ATTACCGCTC GGGCTACGTC TTCCTCACCC GCGCCGACAA GGATCTCGAC GTCAAGTCCT GGTCCGATCC GCGCCTGAAG GACGTCAGCC ACATGGTGGT CGGCTTCGGC ACGCCCGGCG AGGCGATGCT CAAGGATATC GGCCGCTACG AGGAGGACAT GGCCTACCTC TACTCGCTGG TGAACTTTCG CGCGCCGCGG AATCAATACA CCCAGATCGA TCCGGCCCGG ATGGTGAGCG AGGTCGCCAC CGGCAAGGCC GAGGTCGGCG TGGCCTTCGG GCCCGACGTC GCCCGCTACG TGCGCGATTC CTCGACCAAG CTGCGCATGA CCCCCGTGCC CGACGACACG CAGGCCAGCG ACGGCCGGAA GATGCCGCAG AGCTTCGACC AGGCGATGGG CGTGCGCAAG GACGACACCG CCCTGAAGGC GGAGATCGAC GCCGCCCTGG AGAAGGCCAA GCCGAAGATC GAGGCGATCC TGAAGGAAGA AGGCGTGCCC GTGCTGCCCG TCTCCAACTG A
|
Protein sequence | MSLVNGRRRT AASVVALTAF AALCAPAQAQ DTKATSKAAE AAKPDAGTLR VCAAEQPPLS MKDGSGLENR IATTVAEAMG RKAQFVWLGK PAIYLVRDGL EKKTCDVVIG LDADDARVLT SKPYYRSGYV FLTRADKDLD VKSWSDPRLK DVSHMVVGFG TPGEAMLKDI GRYEEDMAYL YSLVNFRAPR NQYTQIDPAR MVSEVATGKA EVGVAFGPDV ARYVRDSSTK LRMTPVPDDT QASDGRKMPQ SFDQAMGVRK DDTALKAEID AALEKAKPKI EAILKEEGVP VLPVSN
|
| |