Gene Mpe_A0509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0509 
SymbolpilB 
ID4787084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp547297 
End bp549015 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content63% 
IMG OID640089067 
Productpilus biogenesis ATPase 
Protein accessionYP_001019706 
Protein GI124265702 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0385141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATACCC TCGCCGATGC CCCGCAGAGT TCGCTGTCCG GCGTCGCGCG CGTGCTCGTG 
CACGCGGGCA AGCTCAACGT CAAGACCGCC GAAGATCTGG TCCGCAGCGC CAAGGAGCAG
AAGCGCAGTT TCATCTCGGC GGTCCTGAGC GCTGGCGCCG TCAACCCTTC GGATCTTGCC
CATACGCTGT CCAGCGTGCT GGCGATGCCC TTGCTCGATC TGGCGGCGGT CGATCCGCAG
CGCCTGCCTC GCAACGTCAT CGATGCCAAG CTGGCCACCC AGTATCAGGT ATTGGTGCTG
GGCAAGCGCG GCAACCGCCT CTTCATCGCC GGGGCCGACC CCACCGACCA GGAGGCCGCT
GAACGCATCA AGTTCGCAAC GCAGCTGACG CCCGAGTGGG TGATCGTCGA GCACGACAAG
CTGTCCCGGC TCGTCGATGC AGCTACGACA ACGGCCAGCG AGTCTCTCGA GTCGCTGGTC
GCCGGTGATT TCGAGTTCGA CGTCACCGAG GAAGACACCA GCGCATCCGA CGCCGCCGAG
GTCACCACCG ACGTCGAGGA CGCACCGGTC GTTCGCTTCC TTCAGAAGAT GCTGATCGAT
GCGATCAACG CACGAGCCTC CGATCTGCAC TTCGAGCCGT TCGAATACAA CTACCGGGTC
CGCTTCCGCG TCGATGGCGA ACTGCGCGAG ATCACGCAAC CGCCGATTGC CATCAAGGAC
AAGCTCGCCT CCCGCATCAA GGTGATTTCC CGGCTCGACA TTGCCGAGAA GCGCGTGCCG
CAGGACGGCC GCATGAAGCT CAAGTTCGGC AGCAAGGCGA TTGATTTCCG GGTCAGCACC
CTGCCCACGC TGTTCGCCGA GAAGATCGTG ATCCGTATCC TCGACCCGTC GAGTGCCAAG
CTCGGCATCG AGGCGCTCGG CTACGAGAAG GTCGAGAAAG ACCGCTTGCT GGAAGCCATC
AAGCGCCCCT ATGGCATGGT GCTGGTCACC GGTCCCACAG GCTCCGGCAA GACGGTGTCT
CTGTACACCT GCCTGAACAT CCTGAACCAA CCCGGCGTCA ATATCTCCAC GGTGGAGGAC
CCGGCCGAAA TCAACCTCCC GGGCGTCAAC CAGGTCAACG TCAACGACAA GGCAGGACTG
AACTTCTCGG TCGCCTTGAA GGCCTTCCTG CGGCAGGATC CGGACGTCAT CATGGTCGGC
GAAATTCGCG ATCTCGACAC GGCGGACATC GCCATCAAGG CGGCGCAGAC CGGGCACATG
GTGATGAGCA CGCTGCATAC CAACGACGCG CCGACGACGC TGACCCGCCT GATGAACATG
GGCGTCGCGC CGTTCAACAT CGCCTCCAGC GTGATCCTGA TCACGGCGCA GCGGCTGGCG
CGCAAACTCT GCGAGAACTG CAAGGCGCCG GCCGATTACC CTCGCGAGGC GTTGCTTCGC
GCCGGCTTCG CTGAATCGGA CCTGGATGGC AGTTGGAAGC CCTATCGTGC CGTCGGCTGC
TCCAGTTGCA GCAATGGCTA CCGGGGGCGT GTCGGCATCT ACCAGGTCAT GCCGATCAGC
GAAGAGATCC AGCGCATCAT CCTGACGCAG GGCAATGCGG TCGACATCGC AAAGCAGGCA
CAGCAAGAGG GCGTGCGCGA TTTGCGTCAG TCGGGTCTGG TCAAGGTGCG AGCGGGTGTG
ACCACCCTGG AAGAAGTCAT CTCGGTCACC AACGAGTAA
 
Protein sequence
MDTLADAPQS SLSGVARVLV HAGKLNVKTA EDLVRSAKEQ KRSFISAVLS AGAVNPSDLA 
HTLSSVLAMP LLDLAAVDPQ RLPRNVIDAK LATQYQVLVL GKRGNRLFIA GADPTDQEAA
ERIKFATQLT PEWVIVEHDK LSRLVDAATT TASESLESLV AGDFEFDVTE EDTSASDAAE
VTTDVEDAPV VRFLQKMLID AINARASDLH FEPFEYNYRV RFRVDGELRE ITQPPIAIKD
KLASRIKVIS RLDIAEKRVP QDGRMKLKFG SKAIDFRVST LPTLFAEKIV IRILDPSSAK
LGIEALGYEK VEKDRLLEAI KRPYGMVLVT GPTGSGKTVS LYTCLNILNQ PGVNISTVED
PAEINLPGVN QVNVNDKAGL NFSVALKAFL RQDPDVIMVG EIRDLDTADI AIKAAQTGHM
VMSTLHTNDA PTTLTRLMNM GVAPFNIASS VILITAQRLA RKLCENCKAP ADYPREALLR
AGFAESDLDG SWKPYRAVGC SSCSNGYRGR VGIYQVMPIS EEIQRIILTQ GNAVDIAKQA
QQEGVRDLRQ SGLVKVRAGV TTLEEVISVT NE