Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2713 |
Symbol | |
ID | 5831164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3039671 |
End bp | 3042937 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641368513 |
Product | trehalose synthase |
Protein accession | YP_001640175 |
Protein GI | 163852132 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.352676 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGATC GCAGCGATCC GCAGTGGTAC CGTGACGCCA TCATCTACCA GATCCACGTC AAGTCGTTCT TCGACTCGTC GAATGACGGG ATCGGCGATT TCGAGGGACT GACGCAGAAG CTCGATTATG TCCGCGACCT CGGGGTCACG GCGATCTGGC TGATGCCGTT CTATCCCTCG CCGCTGCGCG ACGACGGCTA CGACATCGCC GACTACCGCG ATGTGAACCC GTCCTACGGG ACGATGGAAG ATTTCCGCCG CTTCGTGGCT GCCGCCCATG AGCGGGGGCT GCGCGTCATC ACCGAGCTGG TCATCAACCA CACCAGCGAT CAGCACCCCT GGTTCCAGCG CGCCCGCGAG GCGCCGGCCG GGTCGCCTGA GCGCGATTTC TATGTGTGGT CGGATACCGA CGAGAAGTAT TCCGATACGC GCATCATCTT CCTGGACACG GAAGTGTCGA ACTGGACCTG GGACCCGGTC GCCAAGCAGT ACTTCTGGCA CCGCTTCTAC AGCCACCAGC CCGACCTGAA CTTCGACAAC CCGGCCGTGC TGGAGGCCGT GATCGAGGTC ATGCGCTACT GGCTCGACAT GGGCGTCGAC GGGCTGCGGC TCGACGCGAT CCCCTACCTG ATCGAGCGCG ACGGCACGAA CTGCGAGAAC CTCTCCGAGA CCCACGACGT CATCAAGGCG ATCCGTGCGG CGCTCGACGC CGAGTATCCC GACCGGATGC TGCTGGCCGA GGCCAACCAG TGGCCGGAGG AGACGGCGCA GTATTTCGGC GAGGGCGACG AATGCCACAT GGCATTCCAC TTCCCGCTGA TGCCGCGGAT GTACATGGCG ATCGCCCGGG AGGACCGGCA CCCGATCACC GACATCATGC GCCAGACGCC CGAGATCCCC GAGGGCTGCC AGTGGGCGAT CTTCCTGCGC AACCATGACG AGCTGACGCT CGAAATGGTG ACGGCGGAAG AGCGCGACTA CCTCTGGTCG TTCTACGCCG CCGAGCGGCG CGCCCGCATC AATCTCGGCA TCCGCCGCCG CCTCGCGCCG CTGCTGGAGA ACGACCGGCG CAAGATCGAA CTGATGAAGT CCCTCGTCCT GTCGATGCCT GGCACGCCGG TGCTCTACTA CGGCGACGAG ATCGGCATGG GCGACAACAT CTATCTCGGC GATCGCGACG GTGTGCGCAC ACCGATGCAG TGGTCGCCGG ACCGCAACGG CGGCTTCTCG CGGGCCACCC CGCAAAAGCT GTTCCTGCCG TCGATCCAGG ACCCGATCTA CGGCTACGAC GCGATCAACG TCGAGGCGCA GATCCAAGCG CAGACCTCTC TGCTGAACTG GACGCGGCGC ATGATTGCGA TCCGCAACAA CTCGGTCGCG CTGGGGCGCG GCGCGATCCA GTTCCTGTAT CCGGCGAACC GCAAGGTGCT CGCCTGGATT CGCGAGCATG AGAACGAGCG CATCCTCTGC GTCGCCAACC TCTCGCGGGC ACCGCAGGCA GTGCAGCTCG ATTTGTCGGA CCTGCGTGGC GCGATCCCGA TCGAGCTGAC CGGCGGCACG GCCTTCCCGG CGATCGGCGA ACTGCCCTAC CTGCTGACCC TGCCGGCCTA CGGTTTCTAC TGGTTCGCCC TCTCGGTCGC GAATACCGGA GAGATCGGCC CGCAGCCGCA GAGCCCGGAA CTGTTCACCC TGGTGCTCAC CGGCGGCATC GAGACCCTGA TCAAGGGCCG CGAGCGCACC GCGTTCGAGC GCACCGTGGC GCCGCCCTTC ATCGCCTCGC GCCGCTGGTT CGGCGCCAAG GGCACGCGCA TCAAGTCGGT GCAGGTCGAT GACAGCGCCA TCGTCAAGGA CGGTTCCGGC GAGGGCAAGT TCCTGCTCCC GCGGGTGGCG GTGACCCTCG CCAACGGCGA GCGCCAGGAC TACTTCGTGC CGCTCGCCGT CGAGGAGGGC CGCGAGGACG AGACTCTGCT GGATCACGCG GTCGCCCGCG TGCGCCGCGG GCCCCGCACC GGCCTGCTCT ACGAGGCGGC CGCCGCCGCG CCGTTCGCCG TCGCCCTGAT CGAGGCGATG CGTGACGGTG TTACGATCCC GTCCGAGCGT GGGAGCCTCG TCTTCTCGAC GACCTCGGCC TACGATCCGG AGGTGCCGTT CGAGGCGGCC GACGTGCGCC GCCTCTCGGC CGAGCAGAGC AACACCTCGA TCGCCATCGG CGCGCGGATG ATGCTCAAGC TGCTGCGCCG CCTCCAGCCT GGCACGCATC CCGAGATCGA AATCGGCCGG TTCCTCACCG AGCAGGCCCA CTTCGCCAAC ACGCCGGCTT TGCTCGGCGT GGTCGAGCAT GTGGCCGCGG ACGGCACCCG CACCGCTCTG GCCCTGCTCC AGAAGTTCGT CCTCAATCAG GGCGACGCCT GGACGCTGAT GCTGGAAGGC CTGCGGCGCG ACTTCGAGAC CGTGGTGCTG GCCCCTGAGA GCGAGGCGCC GGCGCCGGAG GATGCCTTCA ATGCGCATCT GCGCTGGGCG GAGCTGCTCG GCCAGCGCAC GGCCGAACTG CACCGGGCCT TCGCCATCGA CACCGACGAT CCGGCCTTCG CAGTCGAGCC GTTCGGTGAG GCGGATCTCG CTTCGCTCGC GGATGATGCC CGCCATCAGG CCGCGCGGGC GTTCAAGGGG CTCGACGCGA TCACCGCGTG GTCGCCGGGC TCGGCCAGCG AGGCGCTCGC GTCCCGCCGT AGCGAGGTCG AGGCGCTCAT CACCGATTTG GCCTCCGGCC CGCTGCGGGG TGCCACCAAG ACTCGCATCC ACGGCGATTA CCATCTCGGT CAGGTGCTCG CCTCCGAGGG CGATCTCATC ATCGTCGATT TCGAGGGCGA GCCCTCGCGT CCGGTGGAGC AGCGCCGGGC CAAATCGACG CCGATGCGTG ACGTGGCGGG CATGCTCCGC TCCTTCGCCT ACGGCGCCGA AACGGTGGTG CGTGAGATCA CGGCCCGGTT CGGCGACAGC GAGGAACGGG CGCGCAACGC GGCGATCGCG TGGCGCGGCA TGATCGATGC AGCGTTCCTC GACGGCTACG GGCAAGCCGT GGCCGGCAGC CGGGCGGCGG TGGAGGATGC GGAGACGCAA TCCCGTCTGC TGCGTCTGAG CCTGCTGACC AAGGCGCTCT ACGAGGTCGA TTACGAAGTG AACAACCGCC CCGACTGGAT CGAAATCCCG GCACGTGGTG TTCTCAACAT ACTGGATGAA GCCAAGCGCG ACCGCGCTTC GGTGTGA
|
Protein sequence | MIDRSDPQWY RDAIIYQIHV KSFFDSSNDG IGDFEGLTQK LDYVRDLGVT AIWLMPFYPS PLRDDGYDIA DYRDVNPSYG TMEDFRRFVA AAHERGLRVI TELVINHTSD QHPWFQRARE APAGSPERDF YVWSDTDEKY SDTRIIFLDT EVSNWTWDPV AKQYFWHRFY SHQPDLNFDN PAVLEAVIEV MRYWLDMGVD GLRLDAIPYL IERDGTNCEN LSETHDVIKA IRAALDAEYP DRMLLAEANQ WPEETAQYFG EGDECHMAFH FPLMPRMYMA IAREDRHPIT DIMRQTPEIP EGCQWAIFLR NHDELTLEMV TAEERDYLWS FYAAERRARI NLGIRRRLAP LLENDRRKIE LMKSLVLSMP GTPVLYYGDE IGMGDNIYLG DRDGVRTPMQ WSPDRNGGFS RATPQKLFLP SIQDPIYGYD AINVEAQIQA QTSLLNWTRR MIAIRNNSVA LGRGAIQFLY PANRKVLAWI REHENERILC VANLSRAPQA VQLDLSDLRG AIPIELTGGT AFPAIGELPY LLTLPAYGFY WFALSVANTG EIGPQPQSPE LFTLVLTGGI ETLIKGRERT AFERTVAPPF IASRRWFGAK GTRIKSVQVD DSAIVKDGSG EGKFLLPRVA VTLANGERQD YFVPLAVEEG REDETLLDHA VARVRRGPRT GLLYEAAAAA PFAVALIEAM RDGVTIPSER GSLVFSTTSA YDPEVPFEAA DVRRLSAEQS NTSIAIGARM MLKLLRRLQP GTHPEIEIGR FLTEQAHFAN TPALLGVVEH VAADGTRTAL ALLQKFVLNQ GDAWTLMLEG LRRDFETVVL APESEAPAPE DAFNAHLRWA ELLGQRTAEL HRAFAIDTDD PAFAVEPFGE ADLASLADDA RHQAARAFKG LDAITAWSPG SASEALASRR SEVEALITDL ASGPLRGATK TRIHGDYHLG QVLASEGDLI IVDFEGEPSR PVEQRRAKST PMRDVAGMLR SFAYGAETVV REITARFGDS EERARNAAIA WRGMIDAAFL DGYGQAVAGS RAAVEDAETQ SRLLRLSLLT KALYEVDYEV NNRPDWIEIP ARGVLNILDE AKRDRASV
|
| |