Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_6807 |
Symbol | |
ID | 6134384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 7491977 |
End bp | 7495243 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641646888 |
Product | trehalose synthase |
Protein accession | YP_001773486 |
Protein GI | 170744831 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.267695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.5618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGATC GCAGCGATCC GCAATGGTAC CGTGACGCCA TCATCTACCA GGTCCACGTC AAATCCTTCT TCGACGCCAA CAACGACGGC ATCGGCGACT TCGACGGCCT GACCGCCAAG CTCGACTACA TCCGCGACCT CGGGGTGACG GCGATCTGGG TGATGCCGTT CTACCCCTCG CCGCTGCGGG ACGACGGCTA CGACATCGCC GACTACAAGG GCATCAACCC GTCCTACGGC ACGATGCGCG ATTTCCGGCG CTTCGTCCGC GAGGCGCACG AGCGCGGCCT GCGCGTCATC ACCGAGCTCG TCATCAACCA CACCTCGGAC CAGCACCCCT GGTTCCAGCG CGCCCGCAGC GCCCCCAAGG GCTCCAAGTG GCGCGACTTC TACGTCTGGT CCGACACGGA CGAGAAGTAC CGCGACACGC GCATCATCTT CCTCGACACC GAGGCCTCGA ACTGGACCTG GGACCCGGTC GCCAAGGCCT ATTACTGGCA CCGCTTCTAC TCGCACCAGC CGGATCTGAA CTTCGACAAT CCGCGCGTGC TGGAGGCGGT GATCGAGGTG ATGCGCTACT GGCTCGACAT GGGGGTGGAC GGCCTGCGCC TCGACGCGAT CCCCTACCTG ATCGAGCGCG AGGACACGAA CTGCGAGAAC CTCTCCGAGA CGCACGACGT CATCAAGAAG ATCCGCGCCG CCCTCGACGC CGGCTACCCG GACCGCATGC TGCTCGCCGA GGCCAACCAG TGGCCCGAGG AGACCGCGCA GTACTTCGGC GACGGGGACG AGTGCCACAT GGCGTTCCAC TTTCCCCTGA TGCCGCGGAT GTACATGGCG ATCGCGCAGG AGGACCGGCA CCCGATCACC GATATCATGC GGCAGACGCC GGAGATCCCG GAGGGCTGCC AATGGGCGAT CTTCCTGCGC AACCACGACG AGCTGACGCT CGAGATGGTC ACCGACAAGG AGCGGGACTA CCTCTGGAGC TTCTACGCCG CCGACCGCCG CGCCCGCATC AACCTCGGCA TCCGCCGCCG CCTCGCGCCC CTGCTGGAGA ACGATCGGCG CAAGATCGAG TTGATGAAGT TCCTGCTGCT GTCGATGCCC GGGACCCCGG TGCTCTATTA CGGCGACGAG ATCGGGATGG GCGACAACAT CTACCTGGGC GACCGCGACG GGGTGCGCAC CCCGATGCAA TGGTCGCCCG ACCGGAACGG CGGCTTCTCC CGCGCCGACC CGGCGCGCCT GTTCCTGCCC ACCATCCAGG ACCCGATCTA CGGCTTCGAC GCCGTCAACG TCGAGGCCCA TAGCCGCGCC CAGACCAGCC TGCTCAACTG GACGCGGCGG ATGATCGCCA TCCGCAACAA CCACCGCTCG CTCGGCCGCG GCACGCTGCG CTTCCTCTAC CCGGACAACC GCAAGGTGCT GGCCTGGCTG CGTGAGTTCG ACGACGAGAA GGTGCTCTGC GTCGCCAACC TTTCGCGGGC GCCGCAGGCG GTGCAGCTCG ACCTCTCGGA GCTGCGCACG GCGGTGCCGG TGGAGCTCAC GGGCGGCACC TCCTTCCCGC CGATCGGCGA CCTGCCCTAC CTGCTGACGC TGCCGGCCTA CGGCTTCTAC TGGTTCAAGC TGGCGCAGGG CCACGCCGAG GCGGCGCCGC GCCAGGAGGC GCCCGAACTC TTCACCCTCG TGCTCACCGG GGGCGTCGAG ACCCTGATCC AGGGCCGCGA GCGGCAGGCC TTCGAGCGCA CGGTGGCGCC GCGCTTCATC GGCTCGCGGC GCTGGTTCGG GGCCAAGGGC TCGCGCATCC GGCAGGTGCA GGTCGTCGAC AGCGCCGTCC TGCCCGCCCG CTCGGGCCGG AGCGGCTACC TGCTGCCGCG CCTGTCCGTG TCGCTCTCGA GCGGCGAGCG CCAGGAATAC TTCACGCCGC TCGCGGTGGA CGAGGGCCGG GAGGACGAGG CGCTCCTCGA CCACGCCGTC GCGCGGGTGC GGCGCGGGCC GCGGATGGGC CTCCTGTACG GGGCGGCCTC CTCGCCGGAC TTCGCCCTCG CGGTCGTCGA CGGGATGCGG GAGGGCCGCG ACCTGCCCTC CGAGGAGGGG CGGCTCGAAT TCCGCGCCAC CTCGCTCTTC GACCCGGATC TCGACCTCGA CCCGGCCGAC ATCCGCCGGC TCTCGGCCGA GCAGAGCAAC ACCTCGATCG CCTTCGGCTC GCGGCTGATG CTGAAGCTCC TGCGCCGCCT CCAGACCGGG ACCCATCCCG AGGTCGAGGT CGGCCGCTTC CTCACCGAGG TGACGGGGTT CCGCAACACG CCGGCGCTCC TCGGCACGGT CGAGCATGTC GGCCGGGACG GCACCCGCAC GGCGCTCGCC CTGCTGCAGG CCTTCGTGCG CAACCAGGGC GACGCGTGGG CGCTGATGCG CGAGTACCTG CGCCGCGACC TCGACGCGAT CGTGCTCGTG CCCGAGAGCG AGGCGCAGGC GCCCGAGGAG GTCTTCGGCA CCCACCTGCG CTGGGCGAGC CTGCTCGGCC AGCGCACCGC CGAGATGCAC CGCGCCTTCG CGATGGAGAC CGACGACCCG GCCTTCGCGG CCGAGCCCTT CACGGCGGAG GACCTCGCCG CCCTCGTCGC CGATACCCGC CGCCAGGGCG AGAAGGCGAT GCGCGGCGTC GCGGGCATCC CCGCCACGGC CTCCGCGAGC GCCCGCGAGG CGGCCGCGGC GATCCTCGCC GCCCGCGAGG AGATCGAGGC GCTGATCACC CGGCTCGGGC GCCTCGATCC GGTCGGGGCC CACAAGACCC GCATCCACGG CGATTACCAT CTCGGCCAGG TGCTCGCCTC CCAGGACGAC CTCATCATCG TCGACTTCGA GGGCGAGCCG TCCCGGCCGG TCGAGGAGCG GCGGGCCAAG GCGACGCCGC TGCGCGACGT CGCCGGCGTG CTGCGCTCCT TCGCGTATGG CGGCGAGACG GTGACCCGGG AGATCGTCTC CCGCTTCGCC GAGGCCGAGG ACCGCACGGT CGCGGCGGTC GCGGCCTGGC GCGGCCTGAT CGAGGGCGCC TTCCTGGAGG CCTACGGGCA CACCGTGCGC GGCAGCCGGG CCGCCGTCGC GGACGACGTC ACTCACGAGC GCCTGCTGCG CCTCTGCCTG CTGAACAAGG CGCTCTACGA GATCGACTAC GAGGCCAACA ACCGCCCGGA CTGGATCGAG ATTCCGGCCC GCGGCGTGCT CTCCCTGCTG GACCAGATGA GAAAGGTGCC CGAATGA
|
Protein sequence | MIDRSDPQWY RDAIIYQVHV KSFFDANNDG IGDFDGLTAK LDYIRDLGVT AIWVMPFYPS PLRDDGYDIA DYKGINPSYG TMRDFRRFVR EAHERGLRVI TELVINHTSD QHPWFQRARS APKGSKWRDF YVWSDTDEKY RDTRIIFLDT EASNWTWDPV AKAYYWHRFY SHQPDLNFDN PRVLEAVIEV MRYWLDMGVD GLRLDAIPYL IEREDTNCEN LSETHDVIKK IRAALDAGYP DRMLLAEANQ WPEETAQYFG DGDECHMAFH FPLMPRMYMA IAQEDRHPIT DIMRQTPEIP EGCQWAIFLR NHDELTLEMV TDKERDYLWS FYAADRRARI NLGIRRRLAP LLENDRRKIE LMKFLLLSMP GTPVLYYGDE IGMGDNIYLG DRDGVRTPMQ WSPDRNGGFS RADPARLFLP TIQDPIYGFD AVNVEAHSRA QTSLLNWTRR MIAIRNNHRS LGRGTLRFLY PDNRKVLAWL REFDDEKVLC VANLSRAPQA VQLDLSELRT AVPVELTGGT SFPPIGDLPY LLTLPAYGFY WFKLAQGHAE AAPRQEAPEL FTLVLTGGVE TLIQGRERQA FERTVAPRFI GSRRWFGAKG SRIRQVQVVD SAVLPARSGR SGYLLPRLSV SLSSGERQEY FTPLAVDEGR EDEALLDHAV ARVRRGPRMG LLYGAASSPD FALAVVDGMR EGRDLPSEEG RLEFRATSLF DPDLDLDPAD IRRLSAEQSN TSIAFGSRLM LKLLRRLQTG THPEVEVGRF LTEVTGFRNT PALLGTVEHV GRDGTRTALA LLQAFVRNQG DAWALMREYL RRDLDAIVLV PESEAQAPEE VFGTHLRWAS LLGQRTAEMH RAFAMETDDP AFAAEPFTAE DLAALVADTR RQGEKAMRGV AGIPATASAS AREAAAAILA AREEIEALIT RLGRLDPVGA HKTRIHGDYH LGQVLASQDD LIIVDFEGEP SRPVEERRAK ATPLRDVAGV LRSFAYGGET VTREIVSRFA EAEDRTVAAV AAWRGLIEGA FLEAYGHTVR GSRAAVADDV THERLLRLCL LNKALYEIDY EANNRPDWIE IPARGVLSLL DQMRKVPE
|
| |