Gene Mext_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1550 
Symbol 
ID5832489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1731040 
End bp1732464 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content67% 
IMG OID641367348 
ProductAlpha,alpha-trehalose-phosphate synthase (UDP-forming) 
Protein accessionYP_001639020 
Protein GI163850977 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACGTC TGATCATCGT CTCCAACCGT GTCGCCGTAC CCGCCGAGGG TAAGGATGCG 
GTCTCCGCAG GGGGACTCGC CGTCGCGGTC AAGGAAGCTT TCTCCTCCTA CGAGGGGTTG
TGGTTCGGTT GGAGCGGCAA CATCCGCGAC AACCCGAGCA CCGAGCCGGA ACTGATCGAC
CGCGGGTCGA TCCAGTACGC CGTCCTCGAC CTGTCGCCGC AAGACCATCG CGAGTACTAC
GCCGGCTTTG CCAACCGGGC GCTCTGGCCG ATCATGCATT ACCGGATCGG GCTGGGGACG
TTCTCCCGCT CGGATTATGC CGGCTACCAG CGCGTCAACC AGACCTTCGC CCAGGCGCTC
GCCAAATTGG TCGAGCCGGA CGACCTGATC TGGGTGCACG ACTACCACCT GCTGCCGCTG
GCGAGCGAGC TGCGCGGCCA GGGCATCGCC AACCCGATCG GCTACTTCCA CCACATCCCG
TGGCCCGCCG CCGACGTGTT CAACACCCTG CCCGCCAGCA ACGAGCTGCT GCGCGCCATG
GCCGATTACG ACCTAATCGG ATTGCAGACC GATTCGGACG TGCAGAACCT CTCGCGCAAC
TTCATCGACA CGATGCGGGC GATCCCGCTC GGCGGCGGCT CGATGATGGT GGACGGGCGG
CGCACGCGAA TCCGCTCCTT CCCCATCGGC ATCGATGTCG CCAGCTTCAA GGAGGCCGCC
GACAAGGCCG GCTCCAACAA GGTGGTGCGG GAGACCATGG CGGGCCTGCG CACCCGCAAG
CTGCTCATCG GCGTCGATCG GCTCGACTAC TCGAAGGGCG TGCCCGAGCG CATGGAGGCG
GTGGACCGCT TCTTCGCCTC GAATCCGGAT CAGCGCGGCA ACGTCGTCTA CATCCAGATC
ACGCCGAAAT CCCGCAGCGA GGTGCCGGAA TACGAACAGC TCTCGCGCGA GGTGAACGAG
AAGGTCGGCG ACATTAACGG CATGCTCGGC GAGCCGGCCT GGACGCCGAT CCAGTACGTC
ACCAAGGCCT ATCCCCGCCC GGTCCTCGCC GGTCTCTACC GGGCCGCCCG CGTCGGCCTC
GTCACGCCGA TGCGCGACGG CATGAACCTT GTGGCCAAGG AATACGTCGT CGCCCAGAGC
GAGGAGGATC CCGGCGTCCT CGTCCTCTCG AAATTCGCGG GTGCGGCCCG GCAGTTGCCC
GAGGCGCTGC TGGTGAACCC CTACGACCGC TTCGAGGTCG CCGAGGCGAT ACGGCAGGCG
CTCTACATGC CCCGCGGCGA GCGCCTGGAG CGCTGGAAGC CGATGGCGGA CCGCATGCGG
CGCGAAGACG TGGATTGGTG GGCCCGCTGC TTCATGGTGG AGCTGGAGAC CTTCCGCACC
GTCGAGCGCG AGCCGCCGAG CACGACGGCG GCGGCGGCGG AGTAG
 
Protein sequence
MARLIIVSNR VAVPAEGKDA VSAGGLAVAV KEAFSSYEGL WFGWSGNIRD NPSTEPELID 
RGSIQYAVLD LSPQDHREYY AGFANRALWP IMHYRIGLGT FSRSDYAGYQ RVNQTFAQAL
AKLVEPDDLI WVHDYHLLPL ASELRGQGIA NPIGYFHHIP WPAADVFNTL PASNELLRAM
ADYDLIGLQT DSDVQNLSRN FIDTMRAIPL GGGSMMVDGR RTRIRSFPIG IDVASFKEAA
DKAGSNKVVR ETMAGLRTRK LLIGVDRLDY SKGVPERMEA VDRFFASNPD QRGNVVYIQI
TPKSRSEVPE YEQLSREVNE KVGDINGMLG EPAWTPIQYV TKAYPRPVLA GLYRAARVGL
VTPMRDGMNL VAKEYVVAQS EEDPGVLVLS KFAGAARQLP EALLVNPYDR FEVAEAIRQA
LYMPRGERLE RWKPMADRMR REDVDWWARC FMVELETFRT VEREPPSTTA AAAE