Gene Mpe_A3408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3408 
SymbolpilY1 
ID4786338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3622953 
End bp3626774 
Gene Length3822 bp 
Protein Length1273 aa 
Translation table11 
GC content62% 
IMG OID640091984 
Productpilus tip-associated protein 
Protein accessionYP_001022596 
Protein GI124268592 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0199627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCT ACCCAACCCC AGACCGCCAT TCCACGCAGA TGCGATGGAA TTGCGGCGAT 
CACGCCGGCC TCGGGATGGC TGGGCTTGCA AGAGCTCTGG TCTTGAGTTC AGTCTGGATG
ACTTCGTCCT TGAGCGCGAA TGCCGCCACG GCTCTGGCAG ATGTCCCGGT GTTTGCAGGA
ACCACGGTCC CAGGCAACGT GGCCTTGGCA CTTTCAGTGG AATGGCCGAC TGCGGTTCGT
TCTGCTCACA CGACCGCGTA CGTGACGACC GATACGTATC TCGGCTATTT CGACCCGAAC
AAGTGCTACA AGTACCAGTA CTACGCGACC GAGACCGCCA CGCAGCTGCG GCACTTCTAT
CCGGTCGGCG ACAACACCGC GCACAACTGC ACCGGCACCG ACGAATGGAG CGGCAATTAC
CTGAACTGGT CCTCGACACC GACGATCGAC CCGTTTCGCT GGGCAATGAC CGGCGGCTAC
CGGATCATCG ACACAACCAC AACGACGATC CTCCAAAAGT CGCGTGACAC CAATCAGGGC
CTGGCACCGG ACAAAAGCCT CACGGTGGCG GCCACCGTGG CGCAGGCTTC GCCGCTGGCC
TTCACCAACC TCCATACGCG CATTCAAGGC CAGGGCATCA ACATGCTGTT CGCCTCGTCG
TCGGCGGCGC TTGGCGGTGT CGTGGTCGAC TACAACCCGG CTGTCCCACC CGTGGCAGGC
GTCGTCTACC GCGCTGTGAT GCGGGTGAAA GTCTGCGACT CCTCGGCGAC AGCCGGCACA
CGCGAAGCCA ATTGCGTGCA GTACGGCAGC AACTGGAAAC CAGAAGGACT GGTTCAGAAA
TATTCCAACC GTATCCGCTT CAGCGCCTTC GGCTACCTGA ACCATAGCGA CTTCCTGCGC
GACGGCGCGG TGCTGCGCGC TCAGCAGAAG TTCGTCGGCC CGACACAGCC GGTCCCGGGC
CAGCCGGCCG TCACGAACGC AGCGGCTGAA TGGGACGCGA CGACCGGCAT CTTTGCGATC
AACCCCGATG CGGCCGACGC GACACAGACC AACGCGAGCT TCACGCCGAG CGTGTCGATC
ACGAACAGCG GGGTGATGAA CTACCTGAAC AAGTTCGGTC AGCTCAACAC CAACAACTAC
AAGAGCTACG ACCCCGTCAG CGAACTGTTC TACACCGCGC TGCGCTACTA CAAGAACCAG
GGCAACGTTC CGGAGTACAC GGCGATGGGT ACCGCCAATG CTGCCACGCG GACGGCCTAC
CTCGACGACT TTCCGGTCAT CACCAACTGG AACGACCCCA TCCTCTATTC GTGCCAGCGC
AACTTCATTC TCGGCATCGG CGACATCTAC ACCCACCGCG ACAAGGATCT CCCCGGCTCC
ACAGGTACAA CGGAAGAGCC GAATCCCAAA CCAGCGGCGG TCACGGGCGA TACCACGGTG
AATGCCGAGA CCATGACCAA CCGAGCGTTC GCGCTGCAAG GTCTGGGCGC ACCGAACGTC
AACAACTACA GCGGCCGCAA CAACTCGGCC GGCATCGTCG GCCTGGCCTA CTACGCCAAC
ACCACCGACA TCCGACCCAC GGTGGCAGGC AATGCCGCCA CCGAGGGCAA GCAGACGGTC
CAGACCTACT GGGTCGACGT GCTCGAGCAA CCTTTCGTCG CGAACAACCA GTTTTACCTC
GCGGCAAAGT ACGGTGGCTT CACCGTGCCG AACGACTACG ACCCGGCCAC CCGCACCGCG
GCGCTGCCGC AGGAGTGGTG GTCCACGACC GGCCAGACCG TCGGTACTCA GGCACGGCCC
GACAACTACT ACACCGCCGG CCGTCCCGAC ACCATGGTCG CGGGCCTGAC GGCAGCCTTC
GAGAAGATCG TCGCAGACCT GGACGCCTAC ACCACCTCGT TCTCCACCGT AGACCCGGTC
CTGACCCTCA CCGGCAATGC GAGCTACAGC GCCTCGTACG ACGCGGAAAA CAACAGCTGG
ACGGGCGAAT TGCTGGCGAA CTCGCTGTCG TTCAGCTCGG GCGTCCCCAT CCCCACACGG
CAATGGGGCG CCACCGAGAA GTTGAAGACG CAGCTCACCG GCACGGGCTG GAGCACCAAC
CGAAATGTGG TGACCTGGGA TCCTGCGGGC GCAACCGGCG TGGCATTTCG CTCTACCAGC
ACCGGCGCCG GAAGAGTCAA TGCTGCCCAG CTTGCACTGC TCGACACGAG CTATGTTGCC
GGGGACGACA GCGTCAACTA TCTGAATTAC CTTCGAGGTG ATCGGACGAA CGAACTGGCC
TCGACGACCC CCGGTTACCG AACGCGCACG GAGTTGCTCG GCGACATCGT CGGGTCGCGT
GTGTTGCCGG TCGGTCCGCC TTCGCTATCG CTGTCGGACG TCACCAACCC TGGCTATCGA
GCATTCAGAG CGGCCCGTGC GAACCGCCCG ACCGTCGTTT ATGTCGGCGC GAACGACGGC
ATGCTCCACG CTTTCAATGG CGCACTTTCG GGCACCGATG CCGGGCGTGA GATCTTCGCC
TACATCCCGA ATGCAGCATT CAATGGCCCG GACGGAACGC CGAATGTGAG CGGTCTGGCC
TCGCTAGGCC GCACTCCGTT CAATCATCGC TTCTTCGTGA ACGCCACGCC GGTGGTCAAG
GACGTGGACT TCAACCGGAC CGGGACAGGC AGCACGCCGG CCTCGACGGC GTCCGATTGG
CGGTCGATCC TGGTCGGCGG CCTTGGAAAG GGCGGGCGCA GCTTCTACGC GATCGATGTG
ACCGATCCGG GCGGAATCAA CTCCGAAGCA GACGCTGCGG CCCGCGTGCT TTGGGAGTTC
ACCGATTCAC GCATGGGCTA TTCGTTCGGC GAACCGATGA TCGTGAAGAC GCGCAAGTAC
GGCTGGACGG TGATCTTCAC GTCGGGCTAC AACACACCCG ACGGTCAAGG CTACCTGTTC
TTCGTGAATC CTAAGACGGG TGCGCTGCTC GAAGCGGTTT CTACTGGCGT GGGGACGATT
GCCAACGATG CGGGCCTGGC GCACGCGAAC GCTTACGTGC TCGACTTCAC CGATGGCTAC
GCTGACGCGA TCTACGCTGG CGATCTGCTC GGCAATCTCT GGCGGCTGAA CGTCACTGGC
ACGACCGGAA GCTACCCTGC CCCGCTGCGA CTTGCCACAC TCAGCCACCC CACCGAAGGG
GCTCAACCAG TGACTTCGCG GCCTCTGATC GAAGTGCACC CGACGACGCG CCAACGCGTG
GTGATGATCG GCACCGGTCG CCTGTTGGAC TCGACCGACA TCGGGTCCGC CCAGATGCAA
AGCTTCTACG CCATCACAGA TGGCAATGCG GTCGCCTTCA ACACCACGCT TCCGAGCGGC
GTCTCGTTCC CGATCGGTCG TGACAAGCTG GTCGAGAACA CTAATCTCCT GGCGGGCTAC
ACCGCCACGG CGGCCGCCCC CATGGGATGG TTCATCGATC TCGGCAAGAA CGCGAGCAAC
ACGGTGGCAT GGCGGCTGAC CAATGACCCT ATCAGCTTCT TCGGCACCGT CGCCTTCACA
CCCAGTTTGC CTGGTGGCGA CAGCTGCAAC CCCTCGGGCA TCAGCCGTAT CTACGGAGCG
GATTTCTCCA GAGGTATCTC TCGATTGCTG GAGAACTCCA CGGTTGTCTC GTATATCCAG
ACGACCAGCC TTATCACCGA CTTCAAGTTC GTGAAGGTCG ACGGCAAGAT GACGGCGATC
CGCGGCGACG AGAAGGGTGA GTTGGGCAAG AACGATCTCA ATTTCGGTCT CCCTGTCGGA
TTGAGGCGCT TGAACTGGCG CGAACTCCCG CTGTCGAACT GA
 
Protein sequence
MKPYPTPDRH STQMRWNCGD HAGLGMAGLA RALVLSSVWM TSSLSANAAT ALADVPVFAG 
TTVPGNVALA LSVEWPTAVR SAHTTAYVTT DTYLGYFDPN KCYKYQYYAT ETATQLRHFY
PVGDNTAHNC TGTDEWSGNY LNWSSTPTID PFRWAMTGGY RIIDTTTTTI LQKSRDTNQG
LAPDKSLTVA ATVAQASPLA FTNLHTRIQG QGINMLFASS SAALGGVVVD YNPAVPPVAG
VVYRAVMRVK VCDSSATAGT REANCVQYGS NWKPEGLVQK YSNRIRFSAF GYLNHSDFLR
DGAVLRAQQK FVGPTQPVPG QPAVTNAAAE WDATTGIFAI NPDAADATQT NASFTPSVSI
TNSGVMNYLN KFGQLNTNNY KSYDPVSELF YTALRYYKNQ GNVPEYTAMG TANAATRTAY
LDDFPVITNW NDPILYSCQR NFILGIGDIY THRDKDLPGS TGTTEEPNPK PAAVTGDTTV
NAETMTNRAF ALQGLGAPNV NNYSGRNNSA GIVGLAYYAN TTDIRPTVAG NAATEGKQTV
QTYWVDVLEQ PFVANNQFYL AAKYGGFTVP NDYDPATRTA ALPQEWWSTT GQTVGTQARP
DNYYTAGRPD TMVAGLTAAF EKIVADLDAY TTSFSTVDPV LTLTGNASYS ASYDAENNSW
TGELLANSLS FSSGVPIPTR QWGATEKLKT QLTGTGWSTN RNVVTWDPAG ATGVAFRSTS
TGAGRVNAAQ LALLDTSYVA GDDSVNYLNY LRGDRTNELA STTPGYRTRT ELLGDIVGSR
VLPVGPPSLS LSDVTNPGYR AFRAARANRP TVVYVGANDG MLHAFNGALS GTDAGREIFA
YIPNAAFNGP DGTPNVSGLA SLGRTPFNHR FFVNATPVVK DVDFNRTGTG STPASTASDW
RSILVGGLGK GGRSFYAIDV TDPGGINSEA DAAARVLWEF TDSRMGYSFG EPMIVKTRKY
GWTVIFTSGY NTPDGQGYLF FVNPKTGALL EAVSTGVGTI ANDAGLAHAN AYVLDFTDGY
ADAIYAGDLL GNLWRLNVTG TTGSYPAPLR LATLSHPTEG AQPVTSRPLI EVHPTTRQRV
VMIGTGRLLD STDIGSAQMQ SFYAITDGNA VAFNTTLPSG VSFPIGRDKL VENTNLLAGY
TATAAAPMGW FIDLGKNASN TVAWRLTNDP ISFFGTVAFT PSLPGGDSCN PSGISRIYGA
DFSRGISRLL ENSTVVSYIQ TTSLITDFKF VKVDGKMTAI RGDEKGELGK NDLNFGLPVG
LRRLNWRELP LSN