Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2847 |
Symbol | |
ID | 7093010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 3128963 |
End bp | 3129994 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643466158 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_002363127 |
Protein GI | 217978980 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.108269 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATTT CATGGATTAA CATCGGCGCG GTCGCGTCCA TCGCGGTCGC GGCGGCCCTG CTCATAACAA GGAATGGCGA GGGACACGCG GCGCACTACC TGCTGAATGT CTCCTATGAC CCAACCCGCG AACTCTATCA GAGCATCAAC CCGCGCTTTG TCGAAAATTA TGAGAAACAA ACCGGTTCAA GCGTCGCGAT TACCCAATCG CACGCCGGGT CGTCCTATCA GGCAAGACGG GTGTCCTCGG GCGAACTCAA GGCCGACGTC GTCACGCTCG GGCTGCCCTC GGACGTTGAG GCCTTGCACA ACAAGGGCAT GATTGCGGAT GGCTGGGAGA AGCGTTTGCC CCATGACTCC CGGCCATATA ATTCGACGAT CGTCTTTGTC GTCCGCAAGG GAAATCCCCA CGCCATACAT GACTGGCCCG ATCTGATTAA GGAGGGCGTC GAGATCATCG TTCCTGATCC CAAGACGTCG GGCAATGGCA AGCTGGCGGC GCTTGCCGCC TGGGGCGCCG TCGTCACCCG GGGCGGCAGC GAGAGCGAAG CGAAGGCCTT TTTGAAAGAG CTCTATGCGC ATACGCCATT CCTCGATCCG GCGGCGCGGT CGACGGGCGT CGCTTTTGCG ATCGAGAAAA AAGGCGACGT TCACCTCGCC TGGGAAAATG AGGCCCTGCG CGAAACCAAG GACTCCAAGG GAGCCCTCGA AATCGTCTAT CCGCCGGTCA GCATCCGGGC CGAGCCATCG GTCGCTTGGG TCGACAGCAA TGTCGAGAAG CACGGCAGCG CTCCGCTGGC GCGCGCCTAT CTGGAGTTCC TGTTCACCGA CGAGGGGCAG GAGATTATCG CGCGCGAAGG CTATCGCCCG CAAAATCAGC AAATCCTCGA AAAACATGCC GACCGGCTGC CGAAGATCAA TCTGTTTTCG ATCACCGCGA TCGCGCAGGA CTGGTCCGAC GCGCAGCGCC GATTCTTTGC CGACAACGGC ATTATCGACG CCGTCTACGC GCCCAAACCG CGCAGCGACT AG
|
Protein sequence | MRISWINIGA VASIAVAAAL LITRNGEGHA AHYLLNVSYD PTRELYQSIN PRFVENYEKQ TGSSVAITQS HAGSSYQARR VSSGELKADV VTLGLPSDVE ALHNKGMIAD GWEKRLPHDS RPYNSTIVFV VRKGNPHAIH DWPDLIKEGV EIIVPDPKTS GNGKLAALAA WGAVVTRGGS ESEAKAFLKE LYAHTPFLDP AARSTGVAFA IEKKGDVHLA WENEALRETK DSKGALEIVY PPVSIRAEPS VAWVDSNVEK HGSAPLARAY LEFLFTDEGQ EIIAREGYRP QNQQILEKHA DRLPKINLFS ITAIAQDWSD AQRRFFADNG IIDAVYAPKP RSD
|
| |