Gene Mpe_A2394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2394 
Symbol 
ID4784290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2557291 
End bp2560179 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content68% 
IMG OID640090964 
Producthypothetical protein 
Protein accessionYP_001021584 
Protein GI124267580 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTGGT CGCTGCCGTG GTCACGCAAG CCCGACGCAT CGCCTGCTGA CGCCGTAGAT 
GCGGCCGATG ATGCCTGGGC ACGGCACGTA ACGGCGCTGG CGGTACAGGG TGTCGCCGAG
CCGGGCAGCG CGCTCGGCCG GGGCCGGCGC AGACCGGCCA CCCAGGCCGA CCACGATGCG
CTCTATGGCG TCGCGCCGTC GTTCGCGGAC TTGCTGCCCT GGGTCGAGTA CCTGCCCGAC
ACCAAGTGCA TGTTGCTGGA AGACGGCCAG TCGGTGGCGG CCTTCTTCGA GCTGGCGCCG
GTCGGCACCG AGGGCCGCGA GATGGCCTGG CTGTGGCAGG CGCGCGATGC GCTGGAGAAC
GCCCTGCAGG ATTCCTTCGA CGAGTTGGAC GACAACCCCT GGGTGGTGCA GCTCTACGCC
CAGGACGAGG CCGACTGGGA CAACTATCGG CGCTCCCTGG CGAACTATCT GCAGCCGCGT
GCACAGGGCA GCGCGTTCAG CGACTTCTAC CTGCGCTTCT TCGCCCATCA CCTGCGGGCC
ATCGCCAAGC CGGGTGGCCT GTTCGAGGAC ACCACGGTGA CGCGCCTGCC GTGGCGCGGC
CAAGTCCGGC GCGTGCGCAT GGTGGTCTAC CGCCGCACGT CCGCGGCCCA GACCTCGCGG
CGCGGCCAGT CGCCCGAGCA GGCGCTGACC ACGATCTGCG ACCGCCTCGC CGGCGGGCTG
GCCAATGCCG GCGTGAAAGC CCGGCGTCTC GGCGCGGCGG ACATCCATGC CTGGCTCCTG
CGTTGGTTCA ACCCGAATCC GACCTTGCTC GGCGCCACTG CCGAAGATCG GGAACGCTTC
TATGCGCTGA GCCGCTACCC GGAAGAGCGG GAGGAGGGCG AGATCGAACT CGCCAGCGGC
ACCGATTTCG CGCAGCGCCT GTTCTTCGGC CAGCCCCGCT CGGACGTGCC CAACGGCCTG
TGGTTCTTCG ACGGCATGCC GCATCGGGTG ATCGTGATGG ATCGCCTGCG CACGCCGCCC
GTGACGGGCC ATCTGACGGG CGAGACGCGC AAAGGCGGCG ATGCCATGAA CGCGCTGTTC
GACCAGATGC CCGAGGACAC GGTGATGTGC CTGACGCTGG TCGCCACACC CCAGGACGTG
CTGGAGGCGC ACCTCAACCA CCTCGCCAGG AAGGCCGTCG GCGAGACCCT GGCCTCGGAG
CAGACCCGGC AGGACGTGCA GCAGGCGCGC GGGCTGATCG GCAGCGCGCA CAAGCTCTAC
CGTGGCGCGC TGGCGTTCTA CCTGCGCGGC CGCGACCTGG CCCAGCTCGA TGCGCGCGGC
CTGCAGCTCG TCAACGTGAT GCTCAACGCC GGCCTGCAAC CGGTACGCGA AGAGGACGAG
GTGGCGCCGC TGAACAGCTA TCTACGCTGG CTGCCGTGCG TGTTCGATCC GGCGGCCGAC
AAGCGCCAGT GGTACACCCA GCTCATGTTC GCGCAACATG CGGCGAACCT GGCGCCGGTC
TGGGGCCGCA GTCAGGGCAC GGGGCATCCG GGCATCACGT TCTTCAACCG CGGCGGCGGC
CCGATCACCT TCGATCCGTT GAACCGCCTC GACCGGCAGA TGAACGCGCA TCTATTCCTG
TTCGGCCCCA CGGGTTCGGG CAAGAGCGCG ACGCTCAACA ACATCCTGAA CCAGGTGACG
GCGATCTACC GGCCGCGCCT GTTCATCGTC GAGGCGGGCA ACAGCTTCGG CCTGTTCGGC
GACTTCGCGG CACGGCTGGG CCTCACCGTG CATCGGGTGA AGCTCGCGCC GGGCGCGGGC
GTCAGTCTGG CTCCGTTCGC CGACGCCTGG CGCCTGGTCG ATACGCCGAG CCAGGTACAG
ACGCTGGACG CCGATGCGCT CGACGAAGAC CAGACCGATG CCGGCATGGC CGTGGAAGGC
GACGAGCAGC GCGACGTGCT CGGCGAGCTG GAGATCACTG CACGGCTGAT GATTACCGGC
GGCGAGGACA AGGAAGAAGC GCGCATGACG CGCGCCGACC GCAGCCTGAT CCGCCAGTGC
ATTCTCGATG CGGCCCAGCA TTGCGTGGCG GACAGACGCA CGGTGCTCAC GCGCGATGTG
CGCGACGCGC TGCGCGAGCG CGCCCGCGAC GCCACGCTGC CGGAGATGCG GCGCGCACGG
CTGCTGGAGA TGGCCGACGC CATGGATATG TTCTGCCAGG ACGTGGACGG CGAGATGTTC
GACCGGTCCG GCACGCCGTG GCCCGAGGCG GACATCACCA TCGTGGACCT GGCCACCTTC
GCGCGCGAGG GCTACAACGC CCAACTCTCG ATTGCCTACA TCTCGCTCAT CAACACCGTC
AACAACATCG CCGAGCGCGA CCAGTTCCTG GGCCGTCCGA TCATCAACGT GACGGACGAA
GGCCACATCA TCACGAAGAA CCCGCTGCTC GCGCCCTACG TGGTCAAGAT CACCAAGATG
TGGCGCAAGC TCGGCGCCTG GTTCTGGCTC GCCACGCAGA ACCTCGACGA CTTGCCGAAG
GCGGCCGAGC CCATGCTCAA CATGATCGAG TGGTGGATCT GCCTGTCGAT GCCACCCGAT
GAAGTGGAGA AGATCGCGCG CTTCCGCGAA CTCAACGCTT CGCAGAAGGC GCTGATGCTC
TCGGCGCGCA AGGAGGCCGG CAAGTTCAGC GAGGGCGTCA TCCTGTCCAA GTCGATGGAG
GTGCTGTTCC GCGCCGTGCC GCCCAGCCTC TACCTGGCGA TGGCGATGAC CGAGCCCGAG
GAGAAGGCCG AACGCTTCCA GTTGATGCAG CAGCACGGCA TCAGCGAACT GGATGCCGCC
TTCCGCGTGG CCGAGAAGAT CGACCGCGCG CGGGGCATCG AACCTCTGAC GCTGGACACG
CTGGCCTGA
 
Protein sequence
MAWSLPWSRK PDASPADAVD AADDAWARHV TALAVQGVAE PGSALGRGRR RPATQADHDA 
LYGVAPSFAD LLPWVEYLPD TKCMLLEDGQ SVAAFFELAP VGTEGREMAW LWQARDALEN
ALQDSFDELD DNPWVVQLYA QDEADWDNYR RSLANYLQPR AQGSAFSDFY LRFFAHHLRA
IAKPGGLFED TTVTRLPWRG QVRRVRMVVY RRTSAAQTSR RGQSPEQALT TICDRLAGGL
ANAGVKARRL GAADIHAWLL RWFNPNPTLL GATAEDRERF YALSRYPEER EEGEIELASG
TDFAQRLFFG QPRSDVPNGL WFFDGMPHRV IVMDRLRTPP VTGHLTGETR KGGDAMNALF
DQMPEDTVMC LTLVATPQDV LEAHLNHLAR KAVGETLASE QTRQDVQQAR GLIGSAHKLY
RGALAFYLRG RDLAQLDARG LQLVNVMLNA GLQPVREEDE VAPLNSYLRW LPCVFDPAAD
KRQWYTQLMF AQHAANLAPV WGRSQGTGHP GITFFNRGGG PITFDPLNRL DRQMNAHLFL
FGPTGSGKSA TLNNILNQVT AIYRPRLFIV EAGNSFGLFG DFAARLGLTV HRVKLAPGAG
VSLAPFADAW RLVDTPSQVQ TLDADALDED QTDAGMAVEG DEQRDVLGEL EITARLMITG
GEDKEEARMT RADRSLIRQC ILDAAQHCVA DRRTVLTRDV RDALRERARD ATLPEMRRAR
LLEMADAMDM FCQDVDGEMF DRSGTPWPEA DITIVDLATF AREGYNAQLS IAYISLINTV
NNIAERDQFL GRPIINVTDE GHIITKNPLL APYVVKITKM WRKLGAWFWL ATQNLDDLPK
AAEPMLNMIE WWICLSMPPD EVEKIARFRE LNASQKALML SARKEAGKFS EGVILSKSME
VLFRAVPPSL YLAMAMTEPE EKAERFQLMQ QHGISELDAA FRVAEKIDRA RGIEPLTLDT
LA