Gene Msil_3330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3330 
Symbol 
ID7090826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3662933 
End bp3663979 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID643466637 
Productselenide, water dikinase 
Protein accessionYP_002363598 
Protein GI217979451 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0709] Selenophosphate synthase 
TIGRFAM ID[TIGR00476] selenium donor protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.456858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATG CTCCCGTGCG CCTCACCGAT CTCGCCCATG GCGGCGGCTG CGGCTGTAAA 
CTGGCGCCCT CCGTTCTGCA GCAGCTCCTC GCCAATCAAC CTCAGGCTGC GCCTTATGCG
CAGCTTCTGG TCGGCACGGA GACCGGCGAC GACGCGGCAG TCTGGCAGAT CGACGAGGAG
CGCTGCGTCA TCGCCACGAC CGACTTCTTC ATGCCGATGG TCGACGATCC GCGGGATTTC
GGCCGCATCG CGGCGGCCAA TGCGCTGTCG GATATTTACG CCATGGGCGG AACGCCGATC
ATGGCGCTGG CCATTCTCGG AATGCCGCTC GGAAAGCTGC CGATTGAAAC CGTGCGCGCG
ATCCTTGCGG GCGGCGCCTC GATCTGCGCC GAGGCCAGCA TTCCTGTCGC GGGGGGCCAT
TCGATCGACT CGCCGGAGCC GATCTACGGA CTTGCGGTCG TCGGGCTTTG CGCCGTCAGC
GATATCCGCC GCAATTCGGG CGCGCGCCCC GGCGACGCGC TGATCCTGAC CAAGGGCATC
GGCGTCGGCG TCTATTCGGC GGCGTTCAAG AAGCAGGCGC TGAGTAATGC GGCCTATGAA
GAGATGATGG CCTCGACGAC GCTGTTGAAC CGCGTCGGCC ACAAGCTGGC GAAGGACGAC
GACGTCCACG CCATGACGGA TGTGACCGGC TTCGGCCTGC TTGGCCATGG CGTCGAGCTC
GCGCGCGGCG GAGGCGTCGC GCTCGACATC GATTTCGCCC GCATCCCTTT CCTGAAGGAG
GCTCAGGAGC TGGCCGAGGC CGGCCTAATT ACCGGAGCCT CCGGACGCAA CTGGGCGAGC
TATGGCGACG CCGTTGTGCT GCCGGCTGAA ACGCCGGACT GGCGGCGCGC GCTGCTGACC
GATCCGCAGA CCTCCGGCGG GCTCCTCATC GCCTGCGCTC CGGAGCGCGC CGAAGCGATC
CGCGGGACCA TCGAGGCTGC GGGCTTTCCT CGCGCGACGA TCATCGGCGC CGTCGCCGCG
GGCGAGCCAG CCGTCCGGAT CGGCTGA
 
Protein sequence
MLDAPVRLTD LAHGGGCGCK LAPSVLQQLL ANQPQAAPYA QLLVGTETGD DAAVWQIDEE 
RCVIATTDFF MPMVDDPRDF GRIAAANALS DIYAMGGTPI MALAILGMPL GKLPIETVRA
ILAGGASICA EASIPVAGGH SIDSPEPIYG LAVVGLCAVS DIRRNSGARP GDALILTKGI
GVGVYSAAFK KQALSNAAYE EMMASTTLLN RVGHKLAKDD DVHAMTDVTG FGLLGHGVEL
ARGGGVALDI DFARIPFLKE AQELAEAGLI TGASGRNWAS YGDAVVLPAE TPDWRRALLT
DPQTSGGLLI ACAPERAEAI RGTIEAAGFP RATIIGAVAA GEPAVRIG