Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3921 |
Symbol | |
ID | 7092618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 4301122 |
End bp | 4304379 |
Gene Length | 3258 bp |
Protein Length | 1085 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643467206 |
Product | trehalose synthase |
Protein accession | YP_002364164 |
Protein GI | 217980017 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGATC GCAGCGATGT GCATTGGTAT CGCGACGCCA TCATTTACCA GCTTCACGTC AAATCCTTCT TCGATTGCAA CAATGACGGC ATCGGCGATT TCAAGGGAGT GACGCAGAAA CTCGACTATG TGAAGGATCT TGGCGCGACC GCGATCTGGC TGATGCCATT CTACCCCTCG CCGCTGCGCG ACGACGGTTA CGACATTTCC AATTATCGCG ACATCAACCC AGCCTATGGG TCGCTGCGCG ACTTCAAGAC GTTCGTTCGC GAAGCGCATG ACCGCGGCCT TCGGGTCATC ATCGAACTCG TCGTCAACCA CACCTCCGAT CAGCATCCCT GGTTCCAACG CGCCCGCGCT GCGAAGCCCG GCTCGGCCGC GCGAAATTTC TATGTCTGGG CCGACGATGA CAAAGCCTAC AAGGCCGTGC CGATCATCTT TCTCGATATC GAGAAATCGA ACTGGACCTA TGACGAGGCC GCCAAAGCCT TTTACTGGCA TCGCTTCTAT GCGCATCAGC CGGACCTCAA TTACGATAAT CCGCGCGTGC TCGAGGCGGT GCTCGACGTC ATGCGTTTCT GGCTCGACAT GGGCGTCGAC GGGCTGCGGC TCGACGCCAT TCCCTATCTG GTCGAGCGCG AAGGAACGCT TTGCGAAAAC CTGCCTGAGA CGCATGCGAT CATCAAAAAG ATCAGGGCTG CGGTCGACGC CGATTATCCC GACCGCATGC TTCTGGCCGA GGCCAATGTC TGGCCGGAGG AGGCGGCCGG CTATTTCGGC GACGGCGACG AATGCCACAT GGCGTTTCAT TTTCCGCTGA TGCCGCGCAT CTACATGGCG CTGGCCCAGG AGGACCGCCA TCCGATCACC GACATTATGC GCCAGACGCC GGAGCTGCCG GAGGGATCGC AATGGGCGAT CTTCCTGCGC AACCATGATG AGATGACGCT CGCCATGGTA ACCGACAAGG AGCGCGATTA TCTGTGGTCC TTCTATGCCG CCGACCAGCG CGCGCGAATC AATCTCGGCA TTCGGCGCCG CCTTGCGCCG CTGCTCGAGA ACGACCGGCG CAAGATCGAA TTGTTGAATT CGCTGTTGTT CTCGATGCCT GGCGCGCCCG TTGTCTATTA CGGCGACGAG ATCGGCATGG GCGACAATAT CTACCTTGGC GATCGCGACG GCGTCAGAAC GCCGATGCAA TGGTCGGTCG ACCGCAGCGG CGGCTTCAGC CGCGCCGATC CGGCAAGGCT GTTCCTGCCG GCGATTCAGG ACCCGATCTA CGGGTTCAGC GCGGTCAATG TCGAAGCGCA GCTCGCCAGC CCGTCGAGTC TTTTGACATG GACGCGGCGA ATGATCGCCG TGCGTCGCTC GACGCTTGCG TTCGGACGCG GCGCGCTCCG CTTTCTCTAT CCCGCCAATC GCAAGGTGCT CGCCTATTTG CGCGAGCTCC CGTCCGAAAC AATTCTCTGC GTCGTCAATG TTTCGCGCGC GCCGCAGGCG GTCGAACTCG ATCTCAGCGA ATTTCGCGGA TCGGCCCCGG TCGAGATGAC CGCCGGGAGT CAGTTCCCGG CGATCGGCGC CGCGCCCTAT GTGCTCACCC TGCCGTCCTA CGGCTTCTTC TGGTTCAGGC TCGAACCTCT GCCGCAGGAG CGCCCGCTGC GCGACTCCAT GCCGGAGCTG TTCACGCTTG TTGCGGTGGG CAAGCTCGAG ACCATTTTTT CAGGCCGCGA GTTGATCGCC TTCGAGCAAA ATGTCGCGCC GCAATATCTT GCGACGCGGC GCTGGTTCGA GGCGCCAACA TCGACGCCGC CGCGCGTCGC CGTCAAGGAT TTCGCGCTCC TCAGCGAGGC GGGCGACACG CGGCGCTTTG TGCTGGCTCT GCTCGAAGTC GAGGCGTTGG ATGCGTCGTC CGGCGTCTAT TTCGCGCCCT TCGTCGCCGA GCGCGAGAGT GAATTCACGC CGCCGCCGAG CGCTGCGGTC GCCAAATTGC GGCGCGGAGC GGAGATGGGC CTGCTCTATG ACGCCGACGC CTGCGCGCCT TTCGGGGCTG CGATGCTCGA CGCCTTCCGG AGCGGCGCGG CGATCGGCGC CGCCAAAGGC GGCAAGGTCA TCTTCTTGCC GGCGGAGCCG GACAGTTCCG ACCTCGCGAT CGAGGCTCTG GAGTGCGAGC TGCTTGCCGC CTTGCCAAGC CATTCGACGC TCGTGCTCGA CAAGCAGATC ACCCTGAAAA TCTTCCGCCG GCTCCAGAGC GGAGATCGGC CGGACATCGA GGTCAGGCGC TTTCTGACCG AGGTCGCCCG CTTTCCCAAC ATGCCGGCCT TGCTCGGGGC GGTTAACTAC AGCGACGCCG CCGGCGCGCG CTTGACGCTC GCGACATTCG AGACGTTCGT GCGCTGTCAG GGCGACGCCT GGACCTGGAC GCTCGAAGCG TTGAAGCGCA TCCTTGAAAC GCTCGCTATG GCGCCGGCCG CGACCGATCA GGGCGAGCCT CCAGCGCCGC TCAGCTTTTC CACCTATACG CCTCACATGC AGCGACTTGG GCTGCGCACC GCGCAAATGC ACCAGGCGCT GGCTACGCCG ACCGACGATC CGGCGATCGC GTCCGAGCCG CTTGCCGAAG CCGACGTGCG CCATAGCGTC CTTCTGCTGC GCGAGGCGGC CGCCCGCGGC TTCGAGCGCC TTGAGGAATC CGCCGAGGCG GAGGCGGCGA ACGCCGAGAT CGACCGGCTA CTCGACCGGC GCGAGGAATG CGAAAGCCTG TTCGCGCTGC TCGATGCGAA GCCGCAGGGG GCGATCAAGA TCCGCATCCA CGGCGATTAT GACTTGCGCC GCGCGCTCGT TGTGAAGGAT GACGTCATCA TCGTCGGGTT TCGCGGCGCC GATCGGATGG CCAAGAATCC GCGCGAGAAA AATTCGCCGC TGCGCGATGT CGCGACCATG CTGCGCTCCT TTGCGCAAGT GGCCGCTGCG GCCGAGCGGG CGATCGCGAC CCTTGTGCCC GATCCCGTCA TGGCGGCGAC CCGGCTCAGC GAGCAGGTTG TGGAATTTTC TGAAATCTTC GTCGAGGCCT ATTTCAACGC CACGCGCGGC GGAGCCGTGG CGATCGCCGA TCAGGGCACG AGGCGGCGTC TTCTTATTCT CTATATGCTC GCTGCCGCTT TCGAGGAAAT CAGCGGCGAG GATCCGTCAG CCGAGACGAT CGACGTCGCG GCGAAGGGAC TGAACGCGAT CCTCGATCGC GCCGCGCGGC TGCTTTGA
|
Protein sequence | MIDRSDVHWY RDAIIYQLHV KSFFDCNNDG IGDFKGVTQK LDYVKDLGAT AIWLMPFYPS PLRDDGYDIS NYRDINPAYG SLRDFKTFVR EAHDRGLRVI IELVVNHTSD QHPWFQRARA AKPGSAARNF YVWADDDKAY KAVPIIFLDI EKSNWTYDEA AKAFYWHRFY AHQPDLNYDN PRVLEAVLDV MRFWLDMGVD GLRLDAIPYL VEREGTLCEN LPETHAIIKK IRAAVDADYP DRMLLAEANV WPEEAAGYFG DGDECHMAFH FPLMPRIYMA LAQEDRHPIT DIMRQTPELP EGSQWAIFLR NHDEMTLAMV TDKERDYLWS FYAADQRARI NLGIRRRLAP LLENDRRKIE LLNSLLFSMP GAPVVYYGDE IGMGDNIYLG DRDGVRTPMQ WSVDRSGGFS RADPARLFLP AIQDPIYGFS AVNVEAQLAS PSSLLTWTRR MIAVRRSTLA FGRGALRFLY PANRKVLAYL RELPSETILC VVNVSRAPQA VELDLSEFRG SAPVEMTAGS QFPAIGAAPY VLTLPSYGFF WFRLEPLPQE RPLRDSMPEL FTLVAVGKLE TIFSGRELIA FEQNVAPQYL ATRRWFEAPT STPPRVAVKD FALLSEAGDT RRFVLALLEV EALDASSGVY FAPFVAERES EFTPPPSAAV AKLRRGAEMG LLYDADACAP FGAAMLDAFR SGAAIGAAKG GKVIFLPAEP DSSDLAIEAL ECELLAALPS HSTLVLDKQI TLKIFRRLQS GDRPDIEVRR FLTEVARFPN MPALLGAVNY SDAAGARLTL ATFETFVRCQ GDAWTWTLEA LKRILETLAM APAATDQGEP PAPLSFSTYT PHMQRLGLRT AQMHQALATP TDDPAIASEP LAEADVRHSV LLLREAAARG FERLEESAEA EAANAEIDRL LDRREECESL FALLDAKPQG AIKIRIHGDY DLRRALVVKD DVIIVGFRGA DRMAKNPREK NSPLRDVATM LRSFAQVAAA AERAIATLVP DPVMAATRLS EQVVEFSEIF VEAYFNATRG GAVAIADQGT RRRLLILYML AAAFEEISGE DPSAETIDVA AKGLNAILDR AARLL
|
| |