Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1169 |
Symbol | |
ID | 5832419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1287933 |
End bp | 1289018 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641366962 |
Product | ABC transporter substrate-binding protein |
Protein accession | YP_001638642 |
Protein GI | 163850599 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0939824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0445222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACG AGCCCCCGCG CCCCGGCCTC CTGCCGTCGC GCCGGTTCCT GTTGCGCGCA GGCGCTGCCG CAGCGGCTCT CCCGCTTGGG TTCGGGGTGG GCGGCGTGCG GGCCTGGGGC CCCGGCCCGG CCGTACCGTT CGATCCCGGC CCGATCTGCC GCCCCGCCGC CGCGGAGGGG CCCGCCGGTC CCCTGAAGCC GATCAAGCTC GCCTGGAACG CCACCGCGAT CTGCACCGCC GCGGCGCCGC TGGCCAAGGA GCGCGGCATC TTCGCCGCCC ACGGCCTCGA CGTGGAGTTC GTGAATTTCG GCGGCTCGAC CGAGGCTCTG TTGGAGGCCA TTGCCACGGG CAAGGCGGAT GCCGGCATCG GCATGGCGTT GCGCTGGCTC AAGCCTCTGG AACAGGGCTT CGACGTGAAG ATCACCGCCG GCCTGCACGG CGGCTGTCTC CGGCTGCTCG GCGCGAAATC CGCCGGCATC ACCGACGTCG CGGCGCTGAA GGGCAAAACG ATCGCGATCA GCGATCACGC GAGCCCGGCC AAGAACTTCT TCGCCCTGCT GCTCGCGCAG GCCGGCATCG ATCCGGAGAC CGGCGTCGAG TGGCGGCAAT ACCCGGCCGA CCTCCTCAAC CTTGCGGTCG AGAAGGGCGA GGCGCAGGCG CTGGCCGATT CCGATCCGCG CACGTGGATC TGGCTGAAGG ATCCGAAATT CACGGAAGTC GCGACCAACC TCTCGGGGGC TTACGCCGAT CGCACCTGCT GCGTGGTCGC CGTGCGCGGC AGCCTGATCC GCAATGATCG CGCCGCCGCC GCCGCGCTCA CCCGCGCCGT GCTGGAGGCC GGTCACCGCG TCCACGAGAA CCCGAAGGAC GCCGCACGCA TCTTTTCCGG CTACGGCGGC AAGGGTTCGG TCGAGGATCT TGCCGCGATG CTGCGCAGCC AGCACCACGG CGACCGCCCG GTCGGCACCG ACCTGAAACG CCAGCTCGTG CTTTACGGCG ACGAACTCAA ACAGGTGAAC GTCCTCAAGC GCACCACCGA CACGGCTAAG TTCGCCGAGC GCGTCTATGC CGACGTGCTG AGCTGA
|
Protein sequence | MTDEPPRPGL LPSRRFLLRA GAAAAALPLG FGVGGVRAWG PGPAVPFDPG PICRPAAAEG PAGPLKPIKL AWNATAICTA AAPLAKERGI FAAHGLDVEF VNFGGSTEAL LEAIATGKAD AGIGMALRWL KPLEQGFDVK ITAGLHGGCL RLLGAKSAGI TDVAALKGKT IAISDHASPA KNFFALLLAQ AGIDPETGVE WRQYPADLLN LAVEKGEAQA LADSDPRTWI WLKDPKFTEV ATNLSGAYAD RTCCVVAVRG SLIRNDRAAA AALTRAVLEA GHRVHENPKD AARIFSGYGG KGSVEDLAAM LRSQHHGDRP VGTDLKRQLV LYGDELKQVN VLKRTTDTAK FAERVYADVL S
|
| |