Gene Mpe_A3406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3406 
SymbolpilW 
ID4786336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3621089 
End bp3622285 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content67% 
IMG OID640091982 
Productfimbrial biogenesis protein 
Protein accessionYP_001022594 
Protein GI124268590 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4966] Tfp pilus assembly protein PilW 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0526818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCCC GTTGCACCCT CTCGAAACGC GGCGCGTCAG TTCGCGGCTT CTCGCTGGTC 
GAACTGCTGG TTGCGGTCGC CATCGGCCTC GTCGTCACGC TCGCGGTGTT CGGCGTGCTG
GCAGCCAGCG AGGGCCGCAA GCGCACCTCG GTCTCAATCA ATGATGCCAA CCAGTCCGGC
GCCTATGCCG CCTACACGAT CGACCGCATG ATCCGCAGCG CAGGCTCCGG CTTCTCCGAG
GGCTGGGGCC GTGTCGGCGG ATGCCGACTG AACGCAACGT TGGGCGCGGC AGGAACGTGG
CCACGAGCGG CCGCCCTCCC TGCGCCCTTC ACCGCCATCC CGCTCACGCT TCGGCTGGCG
CCGGTGGTGA TTTTTCAGGG GGCCTCGACC GCTGGCTCCG ATGTCTTGAT GGTGATGAAC
GGGGCCGCCG GCTTCGCGGA ATCCCCCGCG GCGGTGCGCC CGGGTTCGGT GAGCGCGCTC
GAGTTCCGTG CACCCAATAC GATCGGATTC TTCGCGAACG ACCTGGTGAT GCTGGCCGGT
GGTGGCGAGT GCCAGTTGAC GCAGGTGGAT GATGACAAGC CGGCATGCGT CGCCGATCCG
ACCGCCGTGT TCCCACCGTT GAACTGTGGT CAGCAAGTGC CTCTGGGCGG CAGTTTCCAC
AACGCCTCGT CGACGACGTT TGCCACCTTG TCCACGGCGG ACGCCTATGC AATCCCGCTG
GGCAATACGA CCACCAATCG ACCGCAGTTC CAGTTGCTTG GCGTCGGCGA CAACACCACG
CTGTTCAGCT ACGACATGCT GCTGCTCAAC GGCAACGACG CGCCGCTGCC GCTGGCCGAA
GGCGTGATGA CGCTACGCGC GGTCTATGGG GTCGACACCG ATGACGATGG TGTGATCAAC
GACTGGTTCG CTCCCACCGC CGGAAGCATC TGGGATAGCG CGGCGCTGAT GAACGGCTCG
CCCGCCTCGG CGACCAACCT GCGGCGGATC GTTGCGGTGC GCATCGGCCT CGTGATGCGC
TCTTCGCTCA TCGAGCGGGA GGACGTGGCA CCGGCCACGC TGGGCCTGTT CACCGACCTT
ACCAGCGGTG GCGCCCCGCT GACGCAGAAC GTCGCGATCG CGACCGCCGA TCGACGCATG
CGCCACCGCG CCATCGAGGT GACCGTTCCT GTCCGCAACC TGCTGCTCCG GCCTTGA
 
Protein sequence
MTSRCTLSKR GASVRGFSLV ELLVAVAIGL VVTLAVFGVL AASEGRKRTS VSINDANQSG 
AYAAYTIDRM IRSAGSGFSE GWGRVGGCRL NATLGAAGTW PRAAALPAPF TAIPLTLRLA
PVVIFQGAST AGSDVLMVMN GAAGFAESPA AVRPGSVSAL EFRAPNTIGF FANDLVMLAG
GGECQLTQVD DDKPACVADP TAVFPPLNCG QQVPLGGSFH NASSTTFATL STADAYAIPL
GNTTTNRPQF QLLGVGDNTT LFSYDMLLLN GNDAPLPLAE GVMTLRAVYG VDTDDDGVIN
DWFAPTAGSI WDSAALMNGS PASATNLRRI VAVRIGLVMR SSLIEREDVA PATLGLFTDL
TSGGAPLTQN VAIATADRRM RHRAIEVTVP VRNLLLRP