Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1172 |
Symbol | |
ID | 5832448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1291817 |
End bp | 1292758 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641366965 |
Product | aliphatic sulfonate ABC transporter periplasmic ligand-binding protein |
Protein accession | YP_001638645 |
Protein GI | 163850602 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0629619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0181854 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGCT GGCTCGTCCT GATCGCCCTG GCGCTCGCCA GCCTCGCGCC CGCCCGCGCG GAGGAGGTCT TGCGCGTCGG CGACCAGCGC GGCAACGCCC GCGCCCTGAT GGAGGCCACG GGCGTGCTCG ACGGCCTCAC CTACCGGCTG GAATGGAGCG AGTTTCCGGC GGCCGCCCCG CTGCTGGAAG CGCTGAATGC CGGCGTCATC GACGCGGGCG GCGTGGGCGA TGGCCCCTTC ACCTTCGCGG CCGCCGCGGG GGTTCCGGTC AAGGCCTTTC TGGCCTTTCG CAACCGGCAG GACGGGCTCG CCATCCTCGT GCGGCCCGAT TCCGCCATCC GCAGCGTGGC GGATCTCCAG GGCAAGCGGA TCGCCACCAA CCGCGGCTCG ATCGGCCACC AGGTCGTCCT CGCCGCCCTC GAAGAAGCAG GGCTGCCCGC CGACAGCGTG CAGTTTCGCT TCCTGCCGCC GGCCGACGCC AAGTTGGCGC TGACTTCCGG CGCGGTCGAT GCGTGGTCGA CCTGGGAGCC CTACACCTCC GCGGCCGAAC TCGCCGGCCT CGTCCGGGTG CTCCGCGACG GCAACGGCAT TACCCCGGGC CTGAGCTACG CGGTGGCGAG CGACGCCGCG CTGAAATCCA AGCGCGCCCT GCTCGCCGAC TACGCCGCCC GCCTCGCCAG GGCCCGAGCC CGGGCGCTGA CCGATCCGGC GCCCTATGCT GCCGCGTGGT CGCGGCTGAT CGGCCTGCCC GAGGCGGTGC CTTTGCGCTG GTTCGGGCGC GCGCGCTATC GCACCGTGCC GATCGATGAC ACCGTGATCG CCGACGAGCA GCGCATCATC GACCTCTACG TGCGGGCCGG ACTGATCCCG GCGGCGCGGG CCCCGCGCGC CGAGGCGATC CTCGATACCG GGTTTTCGGA CGCGCTTGCC GCCGTGCGAT GA
|
Protein sequence | MARWLVLIAL ALASLAPARA EEVLRVGDQR GNARALMEAT GVLDGLTYRL EWSEFPAAAP LLEALNAGVI DAGGVGDGPF TFAAAAGVPV KAFLAFRNRQ DGLAILVRPD SAIRSVADLQ GKRIATNRGS IGHQVVLAAL EEAGLPADSV QFRFLPPADA KLALTSGAVD AWSTWEPYTS AAELAGLVRV LRDGNGITPG LSYAVASDAA LKSKRALLAD YAARLARARA RALTDPAPYA AAWSRLIGLP EAVPLRWFGR ARYRTVPIDD TVIADEQRII DLYVRAGLIP AARAPRAEAI LDTGFSDALA AVR
|
| |