Gene Mpe_A3365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3365 
Symbol 
ID4786407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3575486 
End bp3576604 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content65% 
IMG OID640091939 
Producthypothetical protein 
Protein accessionYP_001022553 
Protein GI124268549 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.283378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA AGACCGCCCC GCAACGTCGT CGCTTCCTCA AGAAGGCTTC CGCCGGTGCC 
GTCGGCGCCA CCGCGATGGC CGCCCCGATG GTGTCGGTCG CGCAGACCAC CGCGCTGCGC
TTCCAGAGCA CCTGGCCCTC GAAGGACATC TTCCACGAGT ACGCGCAGGA CTTCGCCACC
AAGGTCAACA ACATGGCCGG TGGCCGGCTG AAGATCGAGG TGCTGCCGGC GGGTTCGGTG
GTGCCGGCCT TCCAGCTGCT GGAAGCTGTC AACAAGGGCA CGCTCGACGG CGGTCACGGC
GTCGTGGCCT ACCACTACGG CAAGAACTCG GCGCTGGCGC TGTGGGGTTC GGGCCCGTCC
TTCGGCATGG ACCCCAACAT GCTGCTGGCC TGGCACAACT ACGGCGGCGG CAAGGAACTG
CTGGCCGAGA TCTACAAGAG CCTGAACATG GACGTCGTGT CCTACCTGTA TGGCCCGATG
CCGACGCAGC CCTTCGGCTG GTTCAAGAAG CCGATCGGCA AGCTCGAGGA CATCAAGGGC
ACCAAGTTCC GCACCGTCGG CCTGGCGGTG GACATGTACA CCGACATGGG CGCCGCGGTG
AACCCGCTGC CGGGTGGCGA GATCGTGCCG GCGCTGGACC GCGGCCTGAT CGACGGTGCC
GAGTTCAACA ACGCCAGCTC CGACCGCCTG CTCGGCTTCC CCGACGTGGT GAAGAACTGC
ATGCTGCAGA GCTTCCACCA GAGCGGCGAG CAGTTCGAGA TCCTGTTCAA CAAGGGCAAG
CTCGACGCGC TGCCGGCCGA GCTGAAGGCG ATCGTCGACT ACGGCGTGCA GGCCGCCAGC
GCCGACATGA GCTGGAAGGC CGCGCACCGC AATTCGCTCG ACTACGGCGA GCTGAAGAAG
GCCGGCGTGA AGTTCTACAA GACGCCCGAC GCGATCCTGC GCGCGCAGCT CGCTGCCTGG
GACAAGATCA TCGCCAAGAA GGGCGGCGAG AACCCGCTGT TCCAGAAGGT GATCGATTCG
CAGAAAGCCT TCGCCGCGCG CGCCGGTCAA TGGTGGAACG ATTACACGGT TGACTTCAAG
ATGGCCTACA ACCATTATTT CGGCGCCAAG AAGGCCTGA
 
Protein sequence
MTQKTAPQRR RFLKKASAGA VGATAMAAPM VSVAQTTALR FQSTWPSKDI FHEYAQDFAT 
KVNNMAGGRL KIEVLPAGSV VPAFQLLEAV NKGTLDGGHG VVAYHYGKNS ALALWGSGPS
FGMDPNMLLA WHNYGGGKEL LAEIYKSLNM DVVSYLYGPM PTQPFGWFKK PIGKLEDIKG
TKFRTVGLAV DMYTDMGAAV NPLPGGEIVP ALDRGLIDGA EFNNASSDRL LGFPDVVKNC
MLQSFHQSGE QFEILFNKGK LDALPAELKA IVDYGVQAAS ADMSWKAAHR NSLDYGELKK
AGVKFYKTPD AILRAQLAAW DKIIAKKGGE NPLFQKVIDS QKAFAARAGQ WWNDYTVDFK
MAYNHYFGAK KA