Gene M446_6807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6807 
Symbol 
ID6134384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp7491977 
End bp7495243 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content70% 
IMG OID641646888 
Producttrehalose synthase 
Protein accessionYP_001773486 
Protein GI170744831 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.267695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.5618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATC GCAGCGATCC GCAATGGTAC CGTGACGCCA TCATCTACCA GGTCCACGTC 
AAATCCTTCT TCGACGCCAA CAACGACGGC ATCGGCGACT TCGACGGCCT GACCGCCAAG
CTCGACTACA TCCGCGACCT CGGGGTGACG GCGATCTGGG TGATGCCGTT CTACCCCTCG
CCGCTGCGGG ACGACGGCTA CGACATCGCC GACTACAAGG GCATCAACCC GTCCTACGGC
ACGATGCGCG ATTTCCGGCG CTTCGTCCGC GAGGCGCACG AGCGCGGCCT GCGCGTCATC
ACCGAGCTCG TCATCAACCA CACCTCGGAC CAGCACCCCT GGTTCCAGCG CGCCCGCAGC
GCCCCCAAGG GCTCCAAGTG GCGCGACTTC TACGTCTGGT CCGACACGGA CGAGAAGTAC
CGCGACACGC GCATCATCTT CCTCGACACC GAGGCCTCGA ACTGGACCTG GGACCCGGTC
GCCAAGGCCT ATTACTGGCA CCGCTTCTAC TCGCACCAGC CGGATCTGAA CTTCGACAAT
CCGCGCGTGC TGGAGGCGGT GATCGAGGTG ATGCGCTACT GGCTCGACAT GGGGGTGGAC
GGCCTGCGCC TCGACGCGAT CCCCTACCTG ATCGAGCGCG AGGACACGAA CTGCGAGAAC
CTCTCCGAGA CGCACGACGT CATCAAGAAG ATCCGCGCCG CCCTCGACGC CGGCTACCCG
GACCGCATGC TGCTCGCCGA GGCCAACCAG TGGCCCGAGG AGACCGCGCA GTACTTCGGC
GACGGGGACG AGTGCCACAT GGCGTTCCAC TTTCCCCTGA TGCCGCGGAT GTACATGGCG
ATCGCGCAGG AGGACCGGCA CCCGATCACC GATATCATGC GGCAGACGCC GGAGATCCCG
GAGGGCTGCC AATGGGCGAT CTTCCTGCGC AACCACGACG AGCTGACGCT CGAGATGGTC
ACCGACAAGG AGCGGGACTA CCTCTGGAGC TTCTACGCCG CCGACCGCCG CGCCCGCATC
AACCTCGGCA TCCGCCGCCG CCTCGCGCCC CTGCTGGAGA ACGATCGGCG CAAGATCGAG
TTGATGAAGT TCCTGCTGCT GTCGATGCCC GGGACCCCGG TGCTCTATTA CGGCGACGAG
ATCGGGATGG GCGACAACAT CTACCTGGGC GACCGCGACG GGGTGCGCAC CCCGATGCAA
TGGTCGCCCG ACCGGAACGG CGGCTTCTCC CGCGCCGACC CGGCGCGCCT GTTCCTGCCC
ACCATCCAGG ACCCGATCTA CGGCTTCGAC GCCGTCAACG TCGAGGCCCA TAGCCGCGCC
CAGACCAGCC TGCTCAACTG GACGCGGCGG ATGATCGCCA TCCGCAACAA CCACCGCTCG
CTCGGCCGCG GCACGCTGCG CTTCCTCTAC CCGGACAACC GCAAGGTGCT GGCCTGGCTG
CGTGAGTTCG ACGACGAGAA GGTGCTCTGC GTCGCCAACC TTTCGCGGGC GCCGCAGGCG
GTGCAGCTCG ACCTCTCGGA GCTGCGCACG GCGGTGCCGG TGGAGCTCAC GGGCGGCACC
TCCTTCCCGC CGATCGGCGA CCTGCCCTAC CTGCTGACGC TGCCGGCCTA CGGCTTCTAC
TGGTTCAAGC TGGCGCAGGG CCACGCCGAG GCGGCGCCGC GCCAGGAGGC GCCCGAACTC
TTCACCCTCG TGCTCACCGG GGGCGTCGAG ACCCTGATCC AGGGCCGCGA GCGGCAGGCC
TTCGAGCGCA CGGTGGCGCC GCGCTTCATC GGCTCGCGGC GCTGGTTCGG GGCCAAGGGC
TCGCGCATCC GGCAGGTGCA GGTCGTCGAC AGCGCCGTCC TGCCCGCCCG CTCGGGCCGG
AGCGGCTACC TGCTGCCGCG CCTGTCCGTG TCGCTCTCGA GCGGCGAGCG CCAGGAATAC
TTCACGCCGC TCGCGGTGGA CGAGGGCCGG GAGGACGAGG CGCTCCTCGA CCACGCCGTC
GCGCGGGTGC GGCGCGGGCC GCGGATGGGC CTCCTGTACG GGGCGGCCTC CTCGCCGGAC
TTCGCCCTCG CGGTCGTCGA CGGGATGCGG GAGGGCCGCG ACCTGCCCTC CGAGGAGGGG
CGGCTCGAAT TCCGCGCCAC CTCGCTCTTC GACCCGGATC TCGACCTCGA CCCGGCCGAC
ATCCGCCGGC TCTCGGCCGA GCAGAGCAAC ACCTCGATCG CCTTCGGCTC GCGGCTGATG
CTGAAGCTCC TGCGCCGCCT CCAGACCGGG ACCCATCCCG AGGTCGAGGT CGGCCGCTTC
CTCACCGAGG TGACGGGGTT CCGCAACACG CCGGCGCTCC TCGGCACGGT CGAGCATGTC
GGCCGGGACG GCACCCGCAC GGCGCTCGCC CTGCTGCAGG CCTTCGTGCG CAACCAGGGC
GACGCGTGGG CGCTGATGCG CGAGTACCTG CGCCGCGACC TCGACGCGAT CGTGCTCGTG
CCCGAGAGCG AGGCGCAGGC GCCCGAGGAG GTCTTCGGCA CCCACCTGCG CTGGGCGAGC
CTGCTCGGCC AGCGCACCGC CGAGATGCAC CGCGCCTTCG CGATGGAGAC CGACGACCCG
GCCTTCGCGG CCGAGCCCTT CACGGCGGAG GACCTCGCCG CCCTCGTCGC CGATACCCGC
CGCCAGGGCG AGAAGGCGAT GCGCGGCGTC GCGGGCATCC CCGCCACGGC CTCCGCGAGC
GCCCGCGAGG CGGCCGCGGC GATCCTCGCC GCCCGCGAGG AGATCGAGGC GCTGATCACC
CGGCTCGGGC GCCTCGATCC GGTCGGGGCC CACAAGACCC GCATCCACGG CGATTACCAT
CTCGGCCAGG TGCTCGCCTC CCAGGACGAC CTCATCATCG TCGACTTCGA GGGCGAGCCG
TCCCGGCCGG TCGAGGAGCG GCGGGCCAAG GCGACGCCGC TGCGCGACGT CGCCGGCGTG
CTGCGCTCCT TCGCGTATGG CGGCGAGACG GTGACCCGGG AGATCGTCTC CCGCTTCGCC
GAGGCCGAGG ACCGCACGGT CGCGGCGGTC GCGGCCTGGC GCGGCCTGAT CGAGGGCGCC
TTCCTGGAGG CCTACGGGCA CACCGTGCGC GGCAGCCGGG CCGCCGTCGC GGACGACGTC
ACTCACGAGC GCCTGCTGCG CCTCTGCCTG CTGAACAAGG CGCTCTACGA GATCGACTAC
GAGGCCAACA ACCGCCCGGA CTGGATCGAG ATTCCGGCCC GCGGCGTGCT CTCCCTGCTG
GACCAGATGA GAAAGGTGCC CGAATGA
 
Protein sequence
MIDRSDPQWY RDAIIYQVHV KSFFDANNDG IGDFDGLTAK LDYIRDLGVT AIWVMPFYPS 
PLRDDGYDIA DYKGINPSYG TMRDFRRFVR EAHERGLRVI TELVINHTSD QHPWFQRARS
APKGSKWRDF YVWSDTDEKY RDTRIIFLDT EASNWTWDPV AKAYYWHRFY SHQPDLNFDN
PRVLEAVIEV MRYWLDMGVD GLRLDAIPYL IEREDTNCEN LSETHDVIKK IRAALDAGYP
DRMLLAEANQ WPEETAQYFG DGDECHMAFH FPLMPRMYMA IAQEDRHPIT DIMRQTPEIP
EGCQWAIFLR NHDELTLEMV TDKERDYLWS FYAADRRARI NLGIRRRLAP LLENDRRKIE
LMKFLLLSMP GTPVLYYGDE IGMGDNIYLG DRDGVRTPMQ WSPDRNGGFS RADPARLFLP
TIQDPIYGFD AVNVEAHSRA QTSLLNWTRR MIAIRNNHRS LGRGTLRFLY PDNRKVLAWL
REFDDEKVLC VANLSRAPQA VQLDLSELRT AVPVELTGGT SFPPIGDLPY LLTLPAYGFY
WFKLAQGHAE AAPRQEAPEL FTLVLTGGVE TLIQGRERQA FERTVAPRFI GSRRWFGAKG
SRIRQVQVVD SAVLPARSGR SGYLLPRLSV SLSSGERQEY FTPLAVDEGR EDEALLDHAV
ARVRRGPRMG LLYGAASSPD FALAVVDGMR EGRDLPSEEG RLEFRATSLF DPDLDLDPAD
IRRLSAEQSN TSIAFGSRLM LKLLRRLQTG THPEVEVGRF LTEVTGFRNT PALLGTVEHV
GRDGTRTALA LLQAFVRNQG DAWALMREYL RRDLDAIVLV PESEAQAPEE VFGTHLRWAS
LLGQRTAEMH RAFAMETDDP AFAAEPFTAE DLAALVADTR RQGEKAMRGV AGIPATASAS
AREAAAAILA AREEIEALIT RLGRLDPVGA HKTRIHGDYH LGQVLASQDD LIIVDFEGEP
SRPVEERRAK ATPLRDVAGV LRSFAYGGET VTREIVSRFA EAEDRTVAAV AAWRGLIEGA
FLEAYGHTVR GSRAAVADDV THERLLRLCL LNKALYEIDY EANNRPDWIE IPARGVLSLL
DQMRKVPE