Gene Mpe_A3136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3136 
SymbolgspD 
ID4786649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3334966 
End bp3337176 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content69% 
IMG OID640091707 
Productgeneral secretion pathway protein D 
Protein accessionYP_001022324 
Protein GI124268320 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID[TIGR02517] general secretion pathway protein D 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.81817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.003504 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCATGA GCCTGACCAC TTCACCGAAG CCCGCCCGCC GCGCCACGGC GCTGCGCGCG 
CTGGCTGCGG CCTGTGCGGT CAGCGTCGCG CTCGGCAGCC TCGCGCCGCC GGCCCTGGCG
CAGAAGACCA AGACGCGCGA GCCGGTGACG CTGAACTTCG TCAACGCCGA GATCGAGGGC
GTGGCGCGGG CCATCGGCGC CATCCTGGAG CGGCAGATCG TGATCGATCC GCGCGTGAAG
GGCCAGATCA CGCTGTACAG CGAGCAGCCC CTGTCGCCGC GCGAGGCCTA CCTGAACTTC
CTGGCTGCGC TGCGCGGCCT GGGCTTCACG GTGGTCGAGG TCGCCGGCCT GCTGAAGGTG
GTGCCCGAGG CGGATGCCAA GCTGCAGACC GGCACCGTGT CGGTCGGCAA CGTCACGCGC
CAGGGCGATC AGATCATCAC GCAGATCTTC CGGCTGAACC ACGAGAACGC GAACAACCTG
GTGGCCGTGC TGCGTCCGCT GATCAGCCCG AACAACACCA TCAACGCCAA TCCCGGCAAC
AACTCGCTGG TGATCACCGA CTACGCCGAC AACCTGCAGC GCATCGCCAA GGTGATCGCG
ACGGTGGACG TGCCGGCGGG GACCGATGTC GAGGTGATCC CGCTGCAGCA TGCGGTCGCC
TCCGACATCG CGGTGCTGGT GCAGCGCCTG TCGGACGCGT CAGCGGCAAC GGCGGGCGCC
CCGGCGACCG CGGCTAGCGG CGGCGCTCTG TCGGTGATCG CCGACGCCCG CACCAACTCG
CTGCTGGTGC GCGCCGCCAA CCCGGCCAAG CTGGCCCAGG TGCGCTCGCT GGTCGGCAAG
CTCGACCAGC CCGGCGCCGC AGGCGCCGGC GGCAGCAACA TCTACGTGGT CTACCTGAAG
AACGCCGATG CGACGCGTCT GGCCCAAGTG CTGCGCGCGG CCTTCACCAG CAATACCAGC
AGCAGCAGCA GCGGCTCGGT CGGCGGCGCG GCGAGTCCGG TCACGAACCT GTCCAACCAG
GCGAATGCCC AGCTCGGCAA CCAGACCTCG GGCACGGGTG CTGCACCGCA GACGACCAAC
CCCGTCAGCG GCGCTGCCCA ACCGTCGACC GGCGGCTTCA TCCAGGCCGA TCCGGCCACC
AATGCGCTGA TCATCACCGC GGCCGAGCCG CTGTACCGCC AGCTGCGCTC GGTGATCGAC
CAGCTCGATG CGCGCCGCGC CCAGGTCTAC GTCGAGACGA TGCTGGTGGA GGTCAACGCC
ACCAAGGCGG CAGACGCCGG CATCCAGTGG CAAGGCCTGA TCGGCAAGGA CGGCGACAAG
ACCGGCCTGA TCGGTGGCAC CAATTTCGGG ACCGGCGGGG ACAACATCAT CAACCTGTCG
ATCGCCGCGG CCCAGGGCGG CACCGCCGTC AGCGGCAACC TGCCCTCGGC GGGTCTGAAT
CTCGGCGTCA TCCGCCGCTT CGCGGGTGTC TACACGCTGG GTGCGCTGGC GCGCTTCCTG
GAGACCAACG CCGACGGCAA CATCCTCTCG ACGCCGAACC TCGTCACGCT GGACAACGAG
GAAGCCAAGA TCGTCGTCGG CCAGAACGTG CCTTTCGTCA CCGGCTCGTT CACGAGCACG
GGGACCGGCA GCGGTGCGAC CAATCCGTTC CAGACCATCG AACGCAGGGA TGTCGGCATC
ACGCTGCGCG TGAAGCCCCA GATCGGCGAG AACGGCACCG TGCGCATGAC GATCTACCAG
GAAGCGTCGA GCGTCGTGAA TCAGAACGTC TCGCAAGGCG TGGCCGACGC CACCGCGGGG
CTGGTGACCA ACAAGCGCTC GATCGAGTCG ACGGTGGTGG TGAACGACGG CGACATCCTG
GTCATGGGCG GCCTGATGCA GGACCAGTTC CAGGACAACT CCAGCCGCGT GCCGGGCCTG
GGCCGGCTGC CCGGCATCGG CGCGCTGTTT CGCAGCGACA ACCGCACCCG CACGAAGAGC
AACCTGATGG TCTTCCTTCG CCCGCAGGTG ATGCGCAGCG CCGAGAGCAG CAATGCGCTG
TCGCTCGACC GCTACGATCT GATCCGGGCC CAGCAGCAGA ACAGCCAGCC CGAATCGCGC
GTGCTGCTGC CGAACTTCGA CACGCCGGTG CTGCCACCGC TGCGTCCGCC GCCGGGTGAG
CCGATGCAAC CCGCGCCCAC GCCGCCGGCC TCGCCGCCGC TGCAGAACTG A
 
Protein sequence
MIMSLTTSPK PARRATALRA LAAACAVSVA LGSLAPPALA QKTKTREPVT LNFVNAEIEG 
VARAIGAILE RQIVIDPRVK GQITLYSEQP LSPREAYLNF LAALRGLGFT VVEVAGLLKV
VPEADAKLQT GTVSVGNVTR QGDQIITQIF RLNHENANNL VAVLRPLISP NNTINANPGN
NSLVITDYAD NLQRIAKVIA TVDVPAGTDV EVIPLQHAVA SDIAVLVQRL SDASAATAGA
PATAASGGAL SVIADARTNS LLVRAANPAK LAQVRSLVGK LDQPGAAGAG GSNIYVVYLK
NADATRLAQV LRAAFTSNTS SSSSGSVGGA ASPVTNLSNQ ANAQLGNQTS GTGAAPQTTN
PVSGAAQPST GGFIQADPAT NALIITAAEP LYRQLRSVID QLDARRAQVY VETMLVEVNA
TKAADAGIQW QGLIGKDGDK TGLIGGTNFG TGGDNIINLS IAAAQGGTAV SGNLPSAGLN
LGVIRRFAGV YTLGALARFL ETNADGNILS TPNLVTLDNE EAKIVVGQNV PFVTGSFTST
GTGSGATNPF QTIERRDVGI TLRVKPQIGE NGTVRMTIYQ EASSVVNQNV SQGVADATAG
LVTNKRSIES TVVVNDGDIL VMGGLMQDQF QDNSSRVPGL GRLPGIGALF RSDNRTRTKS
NLMVFLRPQV MRSAESSNAL SLDRYDLIRA QQQNSQPESR VLLPNFDTPV LPPLRPPPGE
PMQPAPTPPA SPPLQN