Gene Rmet_5689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5689 
SymbolpilY1 
ID4042553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp2445568 
End bp2448876 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content63% 
IMG OID637981108 
Producttype IV fimbrial biogenesis protein PilY 
Protein accessionYP_587817 
Protein GI94314608 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAAT CCGCCCTTCA TCTCGCGCTC GCCGGCTGCC TCGCGATGGC CGTCGCGCTG 
CCGGCCTCGG CCGAGGACAT CGACCTGTAC ACGGGTCTCC AGCCGGAGGC TGGCAAGCCC
AACGTCCTGA TCGTGCTCGA CAACGCGTCA GCCTGGAACG CCTCGGCGAA TTTCACGTGT
TCCACCTCAG GCGTGGTGTC GAGCAATAAC GCGAACACCG ACGCGGGTGC CGAGCAGTGC
GCGCTGTACA ACGCCGTCAA CTCGATTCTG AAAAGCCCAA CGCTGCTCGG CAATATCAAC
CTCGGCATCA TGATGTTCGG CGACAGCAAG AACTTCGGCG GCATCATGAA GTTCCCGTCC
GCGGCCCCGT ACAAGCTGCC GCTGATGGAC ACCACCGGCG TCAACAACCT CCTCACGTAC
ATCAAGACGA TCGACCGGAC TGGCGACAGC GCCGGCAACT CGCAGGTCGC GGGGTCGATG
CAGGAAGCCT GGGCCTACTA TGCCGGCAAG CAGGGCCTGT CCGGCATCAC CTATACGTCG
CCGATTGACA ACCCATGCCA GCGCAACTTC GTCATCTACA TCGCCAACTC GGTCAACAAC
GGCAAACCCG GTGACACCGG ACAGAATGCC ATCGACAGGC TGACGAACCA GGCCAAGGCG
AGCACCGCGC AACTGCAGCA GATCGTGCTG CCATCGGCCA CCGCCAAGTA CCAGGCCAAC
TGGGCTGACG AGTGGGCCCG GTTCAACTAC CAGACCAATC TGTCGGCCAG TTCGACCGAC
CATCAGAACA TCGTCACCTA TTCGATCGGG GTGACCGACG GCAGCAACCC CGACTACCTG
CAACTGGCCA AGAGCATGGC CAGCAACGGT GGGGGCAAGT ATTCGGAAGT CAAGCTCGGG
GACCCGAACG GGCTGGCCGA CGCCCTGATG GCCATCTTCA ACGAAGTGCA GGCGGTGAAC
AGCGTCTTCG CCTCGGTCAG TCTTCCGGCT TCGGTCAATG CGCAGGGCCA GTTCCTGAAC
CAGGTCTACG TCGGGATGTT CCGCCCCGAC GCCACCGCCG CACCGCGCTG GTTGGGGAAC
CTGAAGCAGT ATCAGGTGGG CTATGACACC AACGGCAACC TGGTCCTGCA GGATGCCAAG
GGTGCCAGCG CCATCAGCAA CGCCCAGACC GGTTTCATCT CGCCCAACTC CGTGAGCTTC
TGGACGGCGG AGCCACCCCA GTCCTATTCC AGTGCCGGAT ACGGTGCCAA CATGTCGACC
TGGCCGTCTG GTGGCTACTG GCAGAACAGT CCGTCTGGAA CTGGCTGGCA GTTCGATTCG
CCAGACGGCG AAATCGTGGA AAAGGGCGGC GTGGCGGAAA TGATGCGCGC GCAATGGCTG
GGCGATCAGA CTTCCCGCAG GCTGCTTACC TGCAATGGCG TGGGCACGTG TTCCACGTCG
GGTGGCGTTA GCGCCCTGAC GAGCTTCGAT GCCAACAACA AGTGGTTCAC TACCACGGCT
GGGCTGGCAG CGCTCAATGT CAACGCCACG ACGGATACCA CCAACAAGAA CGTGATCCAG
GCCCTCCCGT CCACCGAGGT GCCTAACCTG ATTGCCTGGG TGCGCGGCCA GGACATGACC
CCTGCCAGCA CCAGTACTAG CTCCGGCGGC AGCAGCGGAA ATGGGAATGG CAAGAACAAC
GGCGGCAGCG GTTCCGGCAC TACCACGACG GCAACGACCA TGGCAGGCGC GGAAGCGGAA
CTCGGCCCGG GGGCACCGGT AACCGTCCGC TCGTCGATCC ACGCCGACGT GCTCCACTCG
CGCCCCGCTG TGGTCAACTA TGGCGGCTCC ATTGGTGTCG TGGTCTACTA CGGCACGAAC
GACGGCGTCT TCCACGCCGT CAACGGCAAC CGCTCACAGG GCATCGGCGC CACACGCGCC
GGCGGCGAAA TCTGGGGCTT CATCGCGCCC GAGTTCTTCG GCAAGCTCAA TCGTCTCTAC
CAGAACACGC CGGAAATCAA GCTCTCGACC ACGCCGGCTG GCATCACGCC GACGCCGCTG
CCGCGCGACT ACTTCTTCGA TGGCAGCACG ACGGTCATGC AGGATCTCCG CGATCCGAAC
AACCCGCACG TCATGATCTT CCTGACCGCA CGACGTGGCG GCAGCCTGGT CTACGCACTA
GACGTGACCG ATCCGACCAA CCCGCAGTTT GTCTGGCGGC TCAGCAACAC TGACCTCGCC
GAGATGGGCC AGACCTGGTC GCAGCCCAAG GTGATGCGCG TGCACGGCAA TGCGAACCCC
GTCATCGTCA TGGGCGCGGG CTACGACCCT GCCGAGGATA GCGACCCGTC CCCGGGCGTC
AATACGATGG GGCGCGGCCT GATCATGGTC GACGCCTACA AGGGCAACAT CGTCTGGTCC
GCGCAGCCGT CCTGTGCTGG CGTCACAACG CCATGCCTGG CCGTATCCGG CATGACCCGC
GCGATTCCTT CCGATGTCAC GCCGCTCGAT CGCAACGGCG ATGGCTATGT GGAGCGCGTG
TACGTGGGCG ATGTAGGCGG CAATGTCTGG CGCGCCGATT TCGAAACCGC AGCCAGCAAT
GCGCCCACCG CCTGGACGGT CACGAAGCTT GCGGCCCTGG GGGGCGGCGC AGGTACGAAC
GATGCTCGCA AGTTCTTCTA CCCGCCTGAT GTCGTCTCGA CGAATGGATA CGATGCGATC
TCGCTTGGCT CCGGCGACCG CGAGCATCCG CTTTACTCGG CGTCCACCGC ACCCGGCACC
GCATACAACG TGGTCAACCG CCTCTACATG CTGAAGGACA CCAACACCAC TGGCATGTCG
ACGTCGTGGA GCCCGATCAC GGAAAGCAAC CTGTTCGATG CCACGGCCAC GACCTACGAT
GGCAGCAAGT CGGGCTTCTT CATCACGCTG GTCAACCCGG GCGAGAAGGC GGTGAACGCA
CCGCTGACCG TCGCTGGCTA TACGAACGTG GGCACCAACA CGCCTTCCGT GCCGGTCGCC
GGCGCTTGCT ATCCGAACCT TGGCACCGCC CGTAGCTATT CCTACAACTT CCTGACCAGC
ATCGGCCAGA ACACGAACCG CTACATCGTG CTGGATGGCG GCGGCTTCCC GCCTTCCTCG
GTGTTTGGCC TGATCACAGT CAGTACTGGC GGCAACTCCG TCGTCACGCC AGTGCTGCTG
GGCGGTGGCA ACCAGACCGC CACCGGTGGC GGTGACTCGA AGTCCGGCCT CGGCGTACAG
AAAGTGAAGC CTGCCGGGCT TGGCAAGCGC AAGCGTGTCT ACTGGTACGA CGAGATCGAC
AAGAAGTAG
 
Protein sequence
MRKSALHLAL AGCLAMAVAL PASAEDIDLY TGLQPEAGKP NVLIVLDNAS AWNASANFTC 
STSGVVSSNN ANTDAGAEQC ALYNAVNSIL KSPTLLGNIN LGIMMFGDSK NFGGIMKFPS
AAPYKLPLMD TTGVNNLLTY IKTIDRTGDS AGNSQVAGSM QEAWAYYAGK QGLSGITYTS
PIDNPCQRNF VIYIANSVNN GKPGDTGQNA IDRLTNQAKA STAQLQQIVL PSATAKYQAN
WADEWARFNY QTNLSASSTD HQNIVTYSIG VTDGSNPDYL QLAKSMASNG GGKYSEVKLG
DPNGLADALM AIFNEVQAVN SVFASVSLPA SVNAQGQFLN QVYVGMFRPD ATAAPRWLGN
LKQYQVGYDT NGNLVLQDAK GASAISNAQT GFISPNSVSF WTAEPPQSYS SAGYGANMST
WPSGGYWQNS PSGTGWQFDS PDGEIVEKGG VAEMMRAQWL GDQTSRRLLT CNGVGTCSTS
GGVSALTSFD ANNKWFTTTA GLAALNVNAT TDTTNKNVIQ ALPSTEVPNL IAWVRGQDMT
PASTSTSSGG SSGNGNGKNN GGSGSGTTTT ATTMAGAEAE LGPGAPVTVR SSIHADVLHS
RPAVVNYGGS IGVVVYYGTN DGVFHAVNGN RSQGIGATRA GGEIWGFIAP EFFGKLNRLY
QNTPEIKLST TPAGITPTPL PRDYFFDGST TVMQDLRDPN NPHVMIFLTA RRGGSLVYAL
DVTDPTNPQF VWRLSNTDLA EMGQTWSQPK VMRVHGNANP VIVMGAGYDP AEDSDPSPGV
NTMGRGLIMV DAYKGNIVWS AQPSCAGVTT PCLAVSGMTR AIPSDVTPLD RNGDGYVERV
YVGDVGGNVW RADFETAASN APTAWTVTKL AALGGGAGTN DARKFFYPPD VVSTNGYDAI
SLGSGDREHP LYSASTAPGT AYNVVNRLYM LKDTNTTGMS TSWSPITESN LFDATATTYD
GSKSGFFITL VNPGEKAVNA PLTVAGYTNV GTNTPSVPVA GACYPNLGTA RSYSYNFLTS
IGQNTNRYIV LDGGGFPPSS VFGLITVSTG GNSVVTPVLL GGGNQTATGG GDSKSGLGVQ
KVKPAGLGKR KRVYWYDEID KK