Gene Msil_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3921 
Symbol 
ID7092618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4301122 
End bp4304379 
Gene Length3258 bp 
Protein Length1085 aa 
Translation table11 
GC content63% 
IMG OID643467206 
Producttrehalose synthase 
Protein accessionYP_002364164 
Protein GI217980017 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATC GCAGCGATGT GCATTGGTAT CGCGACGCCA TCATTTACCA GCTTCACGTC 
AAATCCTTCT TCGATTGCAA CAATGACGGC ATCGGCGATT TCAAGGGAGT GACGCAGAAA
CTCGACTATG TGAAGGATCT TGGCGCGACC GCGATCTGGC TGATGCCATT CTACCCCTCG
CCGCTGCGCG ACGACGGTTA CGACATTTCC AATTATCGCG ACATCAACCC AGCCTATGGG
TCGCTGCGCG ACTTCAAGAC GTTCGTTCGC GAAGCGCATG ACCGCGGCCT TCGGGTCATC
ATCGAACTCG TCGTCAACCA CACCTCCGAT CAGCATCCCT GGTTCCAACG CGCCCGCGCT
GCGAAGCCCG GCTCGGCCGC GCGAAATTTC TATGTCTGGG CCGACGATGA CAAAGCCTAC
AAGGCCGTGC CGATCATCTT TCTCGATATC GAGAAATCGA ACTGGACCTA TGACGAGGCC
GCCAAAGCCT TTTACTGGCA TCGCTTCTAT GCGCATCAGC CGGACCTCAA TTACGATAAT
CCGCGCGTGC TCGAGGCGGT GCTCGACGTC ATGCGTTTCT GGCTCGACAT GGGCGTCGAC
GGGCTGCGGC TCGACGCCAT TCCCTATCTG GTCGAGCGCG AAGGAACGCT TTGCGAAAAC
CTGCCTGAGA CGCATGCGAT CATCAAAAAG ATCAGGGCTG CGGTCGACGC CGATTATCCC
GACCGCATGC TTCTGGCCGA GGCCAATGTC TGGCCGGAGG AGGCGGCCGG CTATTTCGGC
GACGGCGACG AATGCCACAT GGCGTTTCAT TTTCCGCTGA TGCCGCGCAT CTACATGGCG
CTGGCCCAGG AGGACCGCCA TCCGATCACC GACATTATGC GCCAGACGCC GGAGCTGCCG
GAGGGATCGC AATGGGCGAT CTTCCTGCGC AACCATGATG AGATGACGCT CGCCATGGTA
ACCGACAAGG AGCGCGATTA TCTGTGGTCC TTCTATGCCG CCGACCAGCG CGCGCGAATC
AATCTCGGCA TTCGGCGCCG CCTTGCGCCG CTGCTCGAGA ACGACCGGCG CAAGATCGAA
TTGTTGAATT CGCTGTTGTT CTCGATGCCT GGCGCGCCCG TTGTCTATTA CGGCGACGAG
ATCGGCATGG GCGACAATAT CTACCTTGGC GATCGCGACG GCGTCAGAAC GCCGATGCAA
TGGTCGGTCG ACCGCAGCGG CGGCTTCAGC CGCGCCGATC CGGCAAGGCT GTTCCTGCCG
GCGATTCAGG ACCCGATCTA CGGGTTCAGC GCGGTCAATG TCGAAGCGCA GCTCGCCAGC
CCGTCGAGTC TTTTGACATG GACGCGGCGA ATGATCGCCG TGCGTCGCTC GACGCTTGCG
TTCGGACGCG GCGCGCTCCG CTTTCTCTAT CCCGCCAATC GCAAGGTGCT CGCCTATTTG
CGCGAGCTCC CGTCCGAAAC AATTCTCTGC GTCGTCAATG TTTCGCGCGC GCCGCAGGCG
GTCGAACTCG ATCTCAGCGA ATTTCGCGGA TCGGCCCCGG TCGAGATGAC CGCCGGGAGT
CAGTTCCCGG CGATCGGCGC CGCGCCCTAT GTGCTCACCC TGCCGTCCTA CGGCTTCTTC
TGGTTCAGGC TCGAACCTCT GCCGCAGGAG CGCCCGCTGC GCGACTCCAT GCCGGAGCTG
TTCACGCTTG TTGCGGTGGG CAAGCTCGAG ACCATTTTTT CAGGCCGCGA GTTGATCGCC
TTCGAGCAAA ATGTCGCGCC GCAATATCTT GCGACGCGGC GCTGGTTCGA GGCGCCAACA
TCGACGCCGC CGCGCGTCGC CGTCAAGGAT TTCGCGCTCC TCAGCGAGGC GGGCGACACG
CGGCGCTTTG TGCTGGCTCT GCTCGAAGTC GAGGCGTTGG ATGCGTCGTC CGGCGTCTAT
TTCGCGCCCT TCGTCGCCGA GCGCGAGAGT GAATTCACGC CGCCGCCGAG CGCTGCGGTC
GCCAAATTGC GGCGCGGAGC GGAGATGGGC CTGCTCTATG ACGCCGACGC CTGCGCGCCT
TTCGGGGCTG CGATGCTCGA CGCCTTCCGG AGCGGCGCGG CGATCGGCGC CGCCAAAGGC
GGCAAGGTCA TCTTCTTGCC GGCGGAGCCG GACAGTTCCG ACCTCGCGAT CGAGGCTCTG
GAGTGCGAGC TGCTTGCCGC CTTGCCAAGC CATTCGACGC TCGTGCTCGA CAAGCAGATC
ACCCTGAAAA TCTTCCGCCG GCTCCAGAGC GGAGATCGGC CGGACATCGA GGTCAGGCGC
TTTCTGACCG AGGTCGCCCG CTTTCCCAAC ATGCCGGCCT TGCTCGGGGC GGTTAACTAC
AGCGACGCCG CCGGCGCGCG CTTGACGCTC GCGACATTCG AGACGTTCGT GCGCTGTCAG
GGCGACGCCT GGACCTGGAC GCTCGAAGCG TTGAAGCGCA TCCTTGAAAC GCTCGCTATG
GCGCCGGCCG CGACCGATCA GGGCGAGCCT CCAGCGCCGC TCAGCTTTTC CACCTATACG
CCTCACATGC AGCGACTTGG GCTGCGCACC GCGCAAATGC ACCAGGCGCT GGCTACGCCG
ACCGACGATC CGGCGATCGC GTCCGAGCCG CTTGCCGAAG CCGACGTGCG CCATAGCGTC
CTTCTGCTGC GCGAGGCGGC CGCCCGCGGC TTCGAGCGCC TTGAGGAATC CGCCGAGGCG
GAGGCGGCGA ACGCCGAGAT CGACCGGCTA CTCGACCGGC GCGAGGAATG CGAAAGCCTG
TTCGCGCTGC TCGATGCGAA GCCGCAGGGG GCGATCAAGA TCCGCATCCA CGGCGATTAT
GACTTGCGCC GCGCGCTCGT TGTGAAGGAT GACGTCATCA TCGTCGGGTT TCGCGGCGCC
GATCGGATGG CCAAGAATCC GCGCGAGAAA AATTCGCCGC TGCGCGATGT CGCGACCATG
CTGCGCTCCT TTGCGCAAGT GGCCGCTGCG GCCGAGCGGG CGATCGCGAC CCTTGTGCCC
GATCCCGTCA TGGCGGCGAC CCGGCTCAGC GAGCAGGTTG TGGAATTTTC TGAAATCTTC
GTCGAGGCCT ATTTCAACGC CACGCGCGGC GGAGCCGTGG CGATCGCCGA TCAGGGCACG
AGGCGGCGTC TTCTTATTCT CTATATGCTC GCTGCCGCTT TCGAGGAAAT CAGCGGCGAG
GATCCGTCAG CCGAGACGAT CGACGTCGCG GCGAAGGGAC TGAACGCGAT CCTCGATCGC
GCCGCGCGGC TGCTTTGA
 
Protein sequence
MIDRSDVHWY RDAIIYQLHV KSFFDCNNDG IGDFKGVTQK LDYVKDLGAT AIWLMPFYPS 
PLRDDGYDIS NYRDINPAYG SLRDFKTFVR EAHDRGLRVI IELVVNHTSD QHPWFQRARA
AKPGSAARNF YVWADDDKAY KAVPIIFLDI EKSNWTYDEA AKAFYWHRFY AHQPDLNYDN
PRVLEAVLDV MRFWLDMGVD GLRLDAIPYL VEREGTLCEN LPETHAIIKK IRAAVDADYP
DRMLLAEANV WPEEAAGYFG DGDECHMAFH FPLMPRIYMA LAQEDRHPIT DIMRQTPELP
EGSQWAIFLR NHDEMTLAMV TDKERDYLWS FYAADQRARI NLGIRRRLAP LLENDRRKIE
LLNSLLFSMP GAPVVYYGDE IGMGDNIYLG DRDGVRTPMQ WSVDRSGGFS RADPARLFLP
AIQDPIYGFS AVNVEAQLAS PSSLLTWTRR MIAVRRSTLA FGRGALRFLY PANRKVLAYL
RELPSETILC VVNVSRAPQA VELDLSEFRG SAPVEMTAGS QFPAIGAAPY VLTLPSYGFF
WFRLEPLPQE RPLRDSMPEL FTLVAVGKLE TIFSGRELIA FEQNVAPQYL ATRRWFEAPT
STPPRVAVKD FALLSEAGDT RRFVLALLEV EALDASSGVY FAPFVAERES EFTPPPSAAV
AKLRRGAEMG LLYDADACAP FGAAMLDAFR SGAAIGAAKG GKVIFLPAEP DSSDLAIEAL
ECELLAALPS HSTLVLDKQI TLKIFRRLQS GDRPDIEVRR FLTEVARFPN MPALLGAVNY
SDAAGARLTL ATFETFVRCQ GDAWTWTLEA LKRILETLAM APAATDQGEP PAPLSFSTYT
PHMQRLGLRT AQMHQALATP TDDPAIASEP LAEADVRHSV LLLREAAARG FERLEESAEA
EAANAEIDRL LDRREECESL FALLDAKPQG AIKIRIHGDY DLRRALVVKD DVIIVGFRGA
DRMAKNPREK NSPLRDVATM LRSFAQVAAA AERAIATLVP DPVMAATRLS EQVVEFSEIF
VEAYFNATRG GAVAIADQGT RRRLLILYML AAAFEEISGE DPSAETIDVA AKGLNAILDR
AARLL