Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1806 |
Symbol | |
ID | 5832170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2027993 |
End bp | 2028988 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641367605 |
Product | putative ABC transporter periplasmic solute-binding protein |
Protein accession | YP_001639276 |
Protein GI | 163851233 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.592526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.965356 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTCAC GGCGCGGAAT GCTCGCGACG GCGGCTCTGT CGCTGGCCGC GACGCTGCCG ATGGCGCGCC GCGCGCAGGC GGCGGGCGAG ACGTTCCGGC TCGGCGTTCT GCCTTTCGGC ACCGCCTCCT GGGAGGCTGC CGTTATCAAG GCGCGGGGCT TTGATACGGC CAATGGCTTC ACCCTCGATA TCGTCAAGCT GGCCGGCAAC GATGCCGCCC GTATCGCCTT CCTCGGCGGT CAGGTCGATG CCATCGTCGG CGACCTGATC TTCGCCGCCC GCCTCGGCAA CGAGGGGCGG GGCGTGCGCT TCTCGCCCTA TTCCACCACC GAAGGGGCGC TGATGGTGCC CGCCGGAAGC CCGATCACGG ATTTGAAGGG GCTCGCGGGC AAGCGGCTCG GGGTGGCGGG CGGCGCGCTC GACAAGAACT GGATCCTGTT GAGGGCGCAG GCGCGCGAGA CGGCCGGGCT CGAGCTCGAG AACGTCGCGC AGATCGCCTA CGGCGCGCCA CCGCTGCTGG CGCAGAAGCT GGAGACCGGC GAGCTCGACG CGGCTCTGCT CTACTGGCAG TTCTGCGCCC GCCTCGAAGC CAAAGGCTTC AAGCGGCTGA TTTCGGCCGA CGACGTCATG CGGGCCTTCG GCGCCAAGGG CGCGGTCTCG CTGATCGGCT ATCTCTACGA GGGCCACACC GTGGCCGACC GGGGCGAGGT GGTGCGCGGC TTCGCCCGCG CCTCGGCCGC TGCCAAGGAC GCGCTGGCGA ACGAGCCGGC CCTGTGGGAG ACGGTCCGTC CGCTGATGGC GGCGGAGGAC GACGCCACCT TCGCCACGCT CAAGCGCGAT TTCCTCGCCG GAATCCCGCG CCGGCCGATC GCCGCCGAGC GCGCCGACGG CGAGCGCATC TACGCGGCGC TGGACCGGCT CGCAGGCGCG CAGCTCCTCG GCGTGGGCAA GAGCCTGCCG CCGGACCTCT ATCTCGACGC CTCGGGCAAC GGCTGA
|
Protein sequence | MLSRRGMLAT AALSLAATLP MARRAQAAGE TFRLGVLPFG TASWEAAVIK ARGFDTANGF TLDIVKLAGN DAARIAFLGG QVDAIVGDLI FAARLGNEGR GVRFSPYSTT EGALMVPAGS PITDLKGLAG KRLGVAGGAL DKNWILLRAQ ARETAGLELE NVAQIAYGAP PLLAQKLETG ELDAALLYWQ FCARLEAKGF KRLISADDVM RAFGAKGAVS LIGYLYEGHT VADRGEVVRG FARASAAAKD ALANEPALWE TVRPLMAAED DATFATLKRD FLAGIPRRPI AAERADGERI YAALDRLAGA QLLGVGKSLP PDLYLDASGN G
|
| |