Gene Mpe_A0609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0609 
Symbol 
ID4785176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp640725 
End bp642653 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content71% 
IMG OID640089168 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001019806 
Protein GI124265802 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGA CGCCTTCCGT GTGGCACTGG CTCGACGCCC GGCTGAGCCG CGTGCGCCCG 
CATCGCGAGC GGCTGGCGCT GCTGATCGAC GCCGCGGTCA TCGCCGTCTG CTGGCAGTTC
ACCTACCTGT TCCGGCTCGG CTTCGAGCGC TGGTTCAGCG CGCGGCCCGT CTACGACGGC
TGGGTGCTGC TGGGTATCGT GAGCCTCTAC GTGGCGGTGT TCCTGGTGCT GCGCGTTCCG
CGCGGCATGT GGCGCTTCTC CGGCTTCGGC GAGATCAAGC GCCTGACGAT CGCCTGCACG
CTGGCGGGCG GGCTGGCCGC CGCCGCGGTG ATGGGGGCCG AGCTGCGCGC CATCCCGCGC
GCGGTGCTGG CGCTGCACCC CATCGTCGCG CTGATGGGCC TGGCCAGCGT GCGCATCGCC
TACCGCATGC TCTACGAGCA CCTGCGCGCG CGCATCTCCG GCAGCGCCCG CGAGACGCGC
CGGGCGCTGG TGCTCGGCGC GGGCGACGCG GCGCGCCTGC TGCTCGCCGG CCTGCAGCAC
CAGGGCTGGG TGGTGGCGGG CCTGCTCGAC GACCACCCGG CCAAGCAACG CGCCCGCATC
GGCGGCGTGC CGGTGATCGG CCCGCTCGCG AGCGTGGTCG AGCACGTGCG GCTGCTCGAC
ATCAGCCACG TCATCATCGC CATGCCCTCG CTGCGCGGCG CGGCGCGCCG TCGCGTGATC
GATCTTGCAG CCGAGACCGG CCTGCCGGTG CTCACCGTGC CCTCCGCCGA GGAACTGCTG
GAAGGCGCCG CCGTCAGCCG GGTGCGCGAC ATCGAGCCGG AAGACCTGCT GGGCCGTGAG
CCGGTGGTGC TCGACGAAGC CGGCATCTCC GAGTGCCTGA AGGGCAAGTG CGTGATGATC
ACCGGCGCGG GCGGCAGCAT CGGCAGCGAG CTGTGCCGGC AGGTGGCGCG CTACGGGCCG
TCGATGCTGG TGCTGTACGA GCTGAGCGAG TTCAACCTCT ACACCATCGA GCAGTCGCTG
AGCGACAGCT TCCCCGCGCT GCCGCTGGTG CGCCTGATCG GTGACGTGAA GAACGCCGCG
CACCTGCGGC AGGTGATGGC GCGCTGGCGG CCGCAGATCG TGTTCCATGC CGCGGCCTAC
AAGCACGTGC CGCTGATGGA GGAGGAGCAC AACGCCTGGG CGGCGCTGCA GAACAACACG
CTGGGCACTT GGCTGGCGGC CAGCGAGGCG GCGCGAGCCG GGGCGGAGCG CTTCGTGCTC
ATCAGCACCG ACAAGGCGGT GAACCCGACC AACGTGATGG GTGCGACCAA GCGCGCGGCC
GAGATGATGA TCTCGCACCT GGCCTCGCAG GGCCATGCGA CGCGCTTCAT GGCCGTGCGC
TTCGGCAATG TGCTCGGCTC CAGCGGCAGC GTGATCCCCA AGTTCAAGGA ACAGATCGCC
AAGGGCGGCC CGGTGACGGT GACCCATGCC GAGATCACGC GCTATTTCAT GACCATCCCG
GAGGCCGCGA GGCTGGTGGT GCAGGCGGCC GCGATCGGCG AGACCGGGCA GGTGTACGTG
CTGGACATGG GCGAGCCGGT GCGCATCGTC GACCTGGCGC GCGACCTGAT CCGCCTGGCG
GGCCACACGG TCGAGGAGAT CGGCATCGTG TACAGCGGCC TGCGAGCCGG TGAGAAGCTC
TACGAGGAAC TGCTGGCCGA CGCCGACCAC ACGTTGCCCA CGCCGATCGC GCGGTTGCTG
ATCGCGCGCA TCGAGGCCGA CGTGTCGCGT GTGTCGGTGC TGGTCGACCG GGCGCTGCAT
CCCGACAGCG CGACGGCCGA CGAGGTGCGG CGGCAGCTGA TGTGCGCAGT GCCGGAGTAC
CGGGCTGTCG GCGAGACCAT CGCGGCCGCT TCCTCGGGTG ACGTGCCTTT TCCTGTGGAC
CCGGAATAG
 
Protein sequence
MTPTPSVWHW LDARLSRVRP HRERLALLID AAVIAVCWQF TYLFRLGFER WFSARPVYDG 
WVLLGIVSLY VAVFLVLRVP RGMWRFSGFG EIKRLTIACT LAGGLAAAAV MGAELRAIPR
AVLALHPIVA LMGLASVRIA YRMLYEHLRA RISGSARETR RALVLGAGDA ARLLLAGLQH
QGWVVAGLLD DHPAKQRARI GGVPVIGPLA SVVEHVRLLD ISHVIIAMPS LRGAARRRVI
DLAAETGLPV LTVPSAEELL EGAAVSRVRD IEPEDLLGRE PVVLDEAGIS ECLKGKCVMI
TGAGGSIGSE LCRQVARYGP SMLVLYELSE FNLYTIEQSL SDSFPALPLV RLIGDVKNAA
HLRQVMARWR PQIVFHAAAY KHVPLMEEEH NAWAALQNNT LGTWLAASEA ARAGAERFVL
ISTDKAVNPT NVMGATKRAA EMMISHLASQ GHATRFMAVR FGNVLGSSGS VIPKFKEQIA
KGGPVTVTHA EITRYFMTIP EAARLVVQAA AIGETGQVYV LDMGEPVRIV DLARDLIRLA
GHTVEEIGIV YSGLRAGEKL YEELLADADH TLPTPIARLL IARIEADVSR VSVLVDRALH
PDSATADEVR RQLMCAVPEY RAVGETIAAA SSGDVPFPVD PE