Gene Mpe_A3137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3137 
SymbolgspE 
ID4786650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3337187 
End bp3338611 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content68% 
IMG OID640091708 
Productpili biogenesis ATPase 
Protein accessionYP_001022325 
Protein GI124268321 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT
[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.629616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0029573 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCACCC GCCACCCGCT GCCCTACGCC TACGCGAAGG CCCACTCGCT GCTGCTCGAG 
GACAACGGCA GCCAGCTCGT GCTGTGGGCC GGCGACGCCG TCTCGCCGTC CGTGTTGAGC
GAGGTGCTGC GCCTCTACGC CGTCGACGCC CTCGAGCGCG AAGCGCCGGC CGCACTGTCG
CAACGCATCG CCAGCGTCTA TGCCGGCGGC GAGTCGAGCG CCGCTGCGGT GATCGGCGAA
GTCGAGAGTG GCGTCGACCT GTCGCGCCTG ATGCAGGACC TGCCGGCGGT GGAAGACCTG
CTCGAGGCCG CCAACGATGC ACCCATCATC CGCATGCTCA ACGCGCTGCT GACGCAGGCC
GCGAAGGACG GCGCGAGCGA CATCCACATC GAGCCCTACG AGCGCAGCTC GGCAGTGCGC
TTCCGCGTCG ACGGCACGCT GCGCGAGGTG GTGCAGCCGA ACAAGGCGCT GCACGCCGCG
CTGATCTCGC GCCTGAAGAT CATGGCCGAG CTCGACATCT CCGAGAAGCG CCTGCCGCAG
GACGGCCGCA TCTCGCTGCG CATCGGGGGC CGGGCAATCG ACGTGCGCGT GTCCACGCTG
CCCAGCGCGC ACGGCGAACG TGCGGTGCTG CGGCTGCTCG ACAAGGGCGA CAGCACGCGC
TTCACGCTCG AATCGCTGGG CATGAGCGGC GAGACGCTCA CCAAGTTCAA GCGCCTCACG
GCACAGCCTC ACGGCATCGT GCTCGTCACG GGACCGACCG GCTCGGGCAA GACCACCACG
CTGTACGCCG GCCTCGGCCA GGTCGACACG CAGACGACCA ACGTGCTGAC GGTCGAGGAC
CCGATCGAGT ACGAGCTGCC GGGGATCGGG CAGACGCAGG TCAATCCGAA GATCGACCTG
ACCTTCGCCA AATCGCTGCG CGCGATCCTG CGCCAGGACC CAGACGTCAT CATGATCGGC
GAGATCCGCG ACTTCGAGAC CGCGCAGATC GCGATCCAGG CCTCGCTGAC CGGCCACCTG
GTACTCGCCA CGCTGCACAC CAACGACGCG CCCAGCGCCG TCACGCGACT GACCGACATG
GGCGTCGAGC CCTTCCTGCT CAGCAGCTCG CTGCTCGGTG TGCTGGCGCA GCGGCTGGTG
CGCAAGCTCT GCCCGGACTG CAAGAAGCAG GACGACAAGG GGCGCTGGCA CCCGGTCGGC
TGTCCCACCT GCGGCAGCAC CGGCTATAAG GGTCGCACCG GCGTCTACGA ATTGATGGTC
GCCGACAGCG CCCTGCAGAG CCTGATCCAC AGCCGTGCCG CCGAGTCGCA ATTGTTCGTC
GCTGCGGAAC GGGGCGGCAT GAAGACGATG CGCGAAGACG GCGAGCGGTT GGTGGAGAGC
GGGGTGACAT CGCTGGAAGA GCTGTTGAGG GTGACGAGGG AATGA
 
Protein sequence
MATRHPLPYA YAKAHSLLLE DNGSQLVLWA GDAVSPSVLS EVLRLYAVDA LEREAPAALS 
QRIASVYAGG ESSAAAVIGE VESGVDLSRL MQDLPAVEDL LEAANDAPII RMLNALLTQA
AKDGASDIHI EPYERSSAVR FRVDGTLREV VQPNKALHAA LISRLKIMAE LDISEKRLPQ
DGRISLRIGG RAIDVRVSTL PSAHGERAVL RLLDKGDSTR FTLESLGMSG ETLTKFKRLT
AQPHGIVLVT GPTGSGKTTT LYAGLGQVDT QTTNVLTVED PIEYELPGIG QTQVNPKIDL
TFAKSLRAIL RQDPDVIMIG EIRDFETAQI AIQASLTGHL VLATLHTNDA PSAVTRLTDM
GVEPFLLSSS LLGVLAQRLV RKLCPDCKKQ DDKGRWHPVG CPTCGSTGYK GRTGVYELMV
ADSALQSLIH SRAAESQLFV AAERGGMKTM REDGERLVES GVTSLEELLR VTRE