Gene Msil_2246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2246 
Symbol 
ID7091368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2432175 
End bp2433224 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content64% 
IMG OID643465567 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_002362542 
Protein GI217978395 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGGG CCAATCCCCT TCGCCTCGCC GCTATCGGCG TCCTGGTCGC CCTCGCGGCG 
CCCGCCGCTC ATTTCCTTGG CGGTTCTTCC TTCGTGCAGC CGGCGCAGGC CGCGCAGGAA
CTTCTGAACG TCTCCTATGA TCCGACCCGC GAGCTTTACA GGGCGATCAA CGAAGCCTTC
GCCGCCGATT GGAAGGCGAA GACCGGCGAG GCGATCGAAG TGCGCTCGTC CCATGCCGGC
TCCGGCGCGC AGGCGCGCGC GGTGATCGAC GGCCTGCCCG CCGATGTGGT CACACTGGCG
CTCGCCGCCG ATATTGACGC CATCGCCGCC AAGAGCGGCA AGCTGCCCGC CGATTGGCAA
AAGCGCCTGC CGCATAATTC CACGCCCTAC ACCTCGACGA TCGTGCTCTT GGTCCGGAAG
GGCAATCCGA AACAGATCAA GGATTGGGAC GATCTGGTGA AGCCGGGCAT CTCGGTCATT
ACGCCCAACC CGAAGACGTC GGGCGGCGCG CGCTGGAATT TCCTCGCCGC GTGGGGCTAC
GCGAATAAGA AATTCGGCGG CGACGAAGCC AAGGTCCGCG ATTTCATCCG CGCGCTCTAC
AAAAATACGC CGGTGCTCGA TACCGGCGCG CGCGGCTCGA CGATCAGCTT CGCCCAGCGC
GGCCAGGGCG ACGTGCTGAT CTCGTGGGAG AATGACGCCT TCCTCGCCTC GGAAGAATTC
GGCAAGGACC AGTTCGACAT CATCGTCCCC TCGATTTCGA TCCTGGCGGA GCCTCCGGTC
GCCCTGGTCG ACGGCAATGT GGACGCCAAG AAGACCCGCA AGGTCGCCGA GGCCTATCTC
GACTTCCTCT ATACGCCGAA GGCGCAGGCG CTGATCGCCA AGAACTATTA TCATCCCGTG
TCGCCCGAGG CGGCCGATCC CAAGGATCTG GCGCGCCTCG CCAAAATTCC GCTGGTCACG
ATCGACGGTG ATTTTGGCGG CTGGAAGGCG GCTCAGGCGC GCTTCTTCGC CGACGGCGGC
GTGTTTGATC AGATCTACGC CGGGCAATAA
 
Protein sequence
MSRANPLRLA AIGVLVALAA PAAHFLGGSS FVQPAQAAQE LLNVSYDPTR ELYRAINEAF 
AADWKAKTGE AIEVRSSHAG SGAQARAVID GLPADVVTLA LAADIDAIAA KSGKLPADWQ
KRLPHNSTPY TSTIVLLVRK GNPKQIKDWD DLVKPGISVI TPNPKTSGGA RWNFLAAWGY
ANKKFGGDEA KVRDFIRALY KNTPVLDTGA RGSTISFAQR GQGDVLISWE NDAFLASEEF
GKDQFDIIVP SISILAEPPV ALVDGNVDAK KTRKVAEAYL DFLYTPKAQA LIAKNYYHPV
SPEAADPKDL ARLAKIPLVT IDGDFGGWKA AQARFFADGG VFDQIYAGQ