Gene Mpe_A3790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3790 
Symbol 
ID4785959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp4007497 
End bp4008888 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content69% 
IMG OID640092373 
ProductAlpha,alpha-trehalose-phosphate synthase (UDP-forming) 
Protein accessionYP_001022978 
Protein GI124268974 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000285165 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGTCTC GGTTGGTGGT GGTGTCCAAT CGGATCGGCG ACCCGCGCAA GGCGGCCGCC 
GGCGGGCTGG CCGTCGCATT GGGCGATGCG CTCTCCGAGA GCGGGGGCCT GTGGTTCGGT
TGGAGCGGCA AGGTGGTGGC CGGCGGCGCT CGCGGCGAGG GCGATCTGCA CCTGCAGCCG
TCGGGGAAGG TCACGCTCGC GACGCTCGAT CTCGGCAGCG ACGACCACGC GGCCTATTAC
GCCGGCTACT CGAACCGGGT GCTGTGGCCC GTGTTCCACT ACCGGCTCGA TCTGGCGCAG
TTCGACGACG GGTATTTCGA AGGCTATCAG CGCGTCAACC GGCTGTTCGC CCGCAAGCTG
AGCACGCTGC TGAAGCCGGA CGACATCATC TGGGTGCACG ACTACCACCT GATCCCGCTG
GCGGCCGAGT TGCGCGCTCT CGGCTGCCGT CAGCGCATCG GCTTCTTCCT GCACATCCCG
ATGCCGCCGC CGCTGGTGAT GGCGGCAATC CCGGCGCACG ACATGCTGAT GCGCTCGCTG
TTCGCCTATG ACCTGATCGG GCTGCAGAGC GAGGCCGACG TGGCCCACTT CGCCCGCTAT
GTCGAGATGG AGGCTGGGGC CGAGCGCCTG GGCCGTGACC AGTACCGCGC CTTCGGCCAA
CAGGTTTGTG CGCGGGCCTT CCCGATCGGC ATCGACGTCG ACGAGTTCCA GGCCCTGGCG
CAGACGCCGG AGTCGATCGA GACGCGCGAG ACACTGCGCA GCCAGTACCC GCTGCGTCAG
CTGTTGATCG GGGTCGACCG GCTCGATTAC TCCAAGGGCC TGCCGCAGCG CCTGCGCGCT
TTCCATCGGC TGCTGGCCGA CTACCCCGAG AATCGCAACA GCGCGACCTT GATCCAGGTG
GCCACACCGA CGCGCGAGGG CGTGGAGAGC TACGAGGACA TCCGCCGCGA GCTCGAGGGT
CTGTCGGGGC AGATCAACGG CGAGTACGGC GAACTCGACT GGATGCCGGT GCGCTACATC
CACCGCACGC TGGCGCGGCG ACGGCTGCCG GGGCTGTACC GCGCCGGGCG CGTGGCCCTG
GTCACGCCGC TGCGCGACGG CATGAACCTA GTGGCCAAGG AGTTCGTCGC GGCGCAGGAC
GCTGCCGACC CCGGCGTGCT GGTGCTGTCC CGCTTTGCCG GTGCGGCCGA GCAGATGCGC
GCGGCCCTGC TGGTCAATCC ATACGACATC GGCGCCACCG CGGGTGCGGT GCAGCGCGCG
CTGCGGATGC CGCTGTCCGA GCGGGTGGAG CGGCATCAGG CCTTGCTCGC AGGTGTTCGC
GAGCACGACG TGCACCGTTG GCGACGCGAA TTCCTGCAGG CCTTGCAGGC CGCCGAACGT
GCGCCTGGGT GA
 
Protein sequence
MTSRLVVVSN RIGDPRKAAA GGLAVALGDA LSESGGLWFG WSGKVVAGGA RGEGDLHLQP 
SGKVTLATLD LGSDDHAAYY AGYSNRVLWP VFHYRLDLAQ FDDGYFEGYQ RVNRLFARKL
STLLKPDDII WVHDYHLIPL AAELRALGCR QRIGFFLHIP MPPPLVMAAI PAHDMLMRSL
FAYDLIGLQS EADVAHFARY VEMEAGAERL GRDQYRAFGQ QVCARAFPIG IDVDEFQALA
QTPESIETRE TLRSQYPLRQ LLIGVDRLDY SKGLPQRLRA FHRLLADYPE NRNSATLIQV
ATPTREGVES YEDIRRELEG LSGQINGEYG ELDWMPVRYI HRTLARRRLP GLYRAGRVAL
VTPLRDGMNL VAKEFVAAQD AADPGVLVLS RFAGAAEQMR AALLVNPYDI GATAGAVQRA
LRMPLSERVE RHQALLAGVR EHDVHRWRRE FLQALQAAER APG