Gene Msil_2847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2847 
Symbol 
ID7093010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3128963 
End bp3129994 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content60% 
IMG OID643466158 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_002363127 
Protein GI217978980 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.108269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATTT CATGGATTAA CATCGGCGCG GTCGCGTCCA TCGCGGTCGC GGCGGCCCTG 
CTCATAACAA GGAATGGCGA GGGACACGCG GCGCACTACC TGCTGAATGT CTCCTATGAC
CCAACCCGCG AACTCTATCA GAGCATCAAC CCGCGCTTTG TCGAAAATTA TGAGAAACAA
ACCGGTTCAA GCGTCGCGAT TACCCAATCG CACGCCGGGT CGTCCTATCA GGCAAGACGG
GTGTCCTCGG GCGAACTCAA GGCCGACGTC GTCACGCTCG GGCTGCCCTC GGACGTTGAG
GCCTTGCACA ACAAGGGCAT GATTGCGGAT GGCTGGGAGA AGCGTTTGCC CCATGACTCC
CGGCCATATA ATTCGACGAT CGTCTTTGTC GTCCGCAAGG GAAATCCCCA CGCCATACAT
GACTGGCCCG ATCTGATTAA GGAGGGCGTC GAGATCATCG TTCCTGATCC CAAGACGTCG
GGCAATGGCA AGCTGGCGGC GCTTGCCGCC TGGGGCGCCG TCGTCACCCG GGGCGGCAGC
GAGAGCGAAG CGAAGGCCTT TTTGAAAGAG CTCTATGCGC ATACGCCATT CCTCGATCCG
GCGGCGCGGT CGACGGGCGT CGCTTTTGCG ATCGAGAAAA AAGGCGACGT TCACCTCGCC
TGGGAAAATG AGGCCCTGCG CGAAACCAAG GACTCCAAGG GAGCCCTCGA AATCGTCTAT
CCGCCGGTCA GCATCCGGGC CGAGCCATCG GTCGCTTGGG TCGACAGCAA TGTCGAGAAG
CACGGCAGCG CTCCGCTGGC GCGCGCCTAT CTGGAGTTCC TGTTCACCGA CGAGGGGCAG
GAGATTATCG CGCGCGAAGG CTATCGCCCG CAAAATCAGC AAATCCTCGA AAAACATGCC
GACCGGCTGC CGAAGATCAA TCTGTTTTCG ATCACCGCGA TCGCGCAGGA CTGGTCCGAC
GCGCAGCGCC GATTCTTTGC CGACAACGGC ATTATCGACG CCGTCTACGC GCCCAAACCG
CGCAGCGACT AG
 
Protein sequence
MRISWINIGA VASIAVAAAL LITRNGEGHA AHYLLNVSYD PTRELYQSIN PRFVENYEKQ 
TGSSVAITQS HAGSSYQARR VSSGELKADV VTLGLPSDVE ALHNKGMIAD GWEKRLPHDS
RPYNSTIVFV VRKGNPHAIH DWPDLIKEGV EIIVPDPKTS GNGKLAALAA WGAVVTRGGS
ESEAKAFLKE LYAHTPFLDP AARSTGVAFA IEKKGDVHLA WENEALRETK DSKGALEIVY
PPVSIRAEPS VAWVDSNVEK HGSAPLARAY LEFLFTDEGQ EIIAREGYRP QNQQILEKHA
DRLPKINLFS ITAIAQDWSD AQRRFFADNG IIDAVYAPKP RSD