Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_5689 |
Symbol | pilY1 |
ID | 4042553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 2445568 |
End bp | 2448876 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637981108 |
Product | type IV fimbrial biogenesis protein PilY |
Protein accession | YP_587817 |
Protein GI | 94314608 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.180745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAAT CCGCCCTTCA TCTCGCGCTC GCCGGCTGCC TCGCGATGGC CGTCGCGCTG CCGGCCTCGG CCGAGGACAT CGACCTGTAC ACGGGTCTCC AGCCGGAGGC TGGCAAGCCC AACGTCCTGA TCGTGCTCGA CAACGCGTCA GCCTGGAACG CCTCGGCGAA TTTCACGTGT TCCACCTCAG GCGTGGTGTC GAGCAATAAC GCGAACACCG ACGCGGGTGC CGAGCAGTGC GCGCTGTACA ACGCCGTCAA CTCGATTCTG AAAAGCCCAA CGCTGCTCGG CAATATCAAC CTCGGCATCA TGATGTTCGG CGACAGCAAG AACTTCGGCG GCATCATGAA GTTCCCGTCC GCGGCCCCGT ACAAGCTGCC GCTGATGGAC ACCACCGGCG TCAACAACCT CCTCACGTAC ATCAAGACGA TCGACCGGAC TGGCGACAGC GCCGGCAACT CGCAGGTCGC GGGGTCGATG CAGGAAGCCT GGGCCTACTA TGCCGGCAAG CAGGGCCTGT CCGGCATCAC CTATACGTCG CCGATTGACA ACCCATGCCA GCGCAACTTC GTCATCTACA TCGCCAACTC GGTCAACAAC GGCAAACCCG GTGACACCGG ACAGAATGCC ATCGACAGGC TGACGAACCA GGCCAAGGCG AGCACCGCGC AACTGCAGCA GATCGTGCTG CCATCGGCCA CCGCCAAGTA CCAGGCCAAC TGGGCTGACG AGTGGGCCCG GTTCAACTAC CAGACCAATC TGTCGGCCAG TTCGACCGAC CATCAGAACA TCGTCACCTA TTCGATCGGG GTGACCGACG GCAGCAACCC CGACTACCTG CAACTGGCCA AGAGCATGGC CAGCAACGGT GGGGGCAAGT ATTCGGAAGT CAAGCTCGGG GACCCGAACG GGCTGGCCGA CGCCCTGATG GCCATCTTCA ACGAAGTGCA GGCGGTGAAC AGCGTCTTCG CCTCGGTCAG TCTTCCGGCT TCGGTCAATG CGCAGGGCCA GTTCCTGAAC CAGGTCTACG TCGGGATGTT CCGCCCCGAC GCCACCGCCG CACCGCGCTG GTTGGGGAAC CTGAAGCAGT ATCAGGTGGG CTATGACACC AACGGCAACC TGGTCCTGCA GGATGCCAAG GGTGCCAGCG CCATCAGCAA CGCCCAGACC GGTTTCATCT CGCCCAACTC CGTGAGCTTC TGGACGGCGG AGCCACCCCA GTCCTATTCC AGTGCCGGAT ACGGTGCCAA CATGTCGACC TGGCCGTCTG GTGGCTACTG GCAGAACAGT CCGTCTGGAA CTGGCTGGCA GTTCGATTCG CCAGACGGCG AAATCGTGGA AAAGGGCGGC GTGGCGGAAA TGATGCGCGC GCAATGGCTG GGCGATCAGA CTTCCCGCAG GCTGCTTACC TGCAATGGCG TGGGCACGTG TTCCACGTCG GGTGGCGTTA GCGCCCTGAC GAGCTTCGAT GCCAACAACA AGTGGTTCAC TACCACGGCT GGGCTGGCAG CGCTCAATGT CAACGCCACG ACGGATACCA CCAACAAGAA CGTGATCCAG GCCCTCCCGT CCACCGAGGT GCCTAACCTG ATTGCCTGGG TGCGCGGCCA GGACATGACC CCTGCCAGCA CCAGTACTAG CTCCGGCGGC AGCAGCGGAA ATGGGAATGG CAAGAACAAC GGCGGCAGCG GTTCCGGCAC TACCACGACG GCAACGACCA TGGCAGGCGC GGAAGCGGAA CTCGGCCCGG GGGCACCGGT AACCGTCCGC TCGTCGATCC ACGCCGACGT GCTCCACTCG CGCCCCGCTG TGGTCAACTA TGGCGGCTCC ATTGGTGTCG TGGTCTACTA CGGCACGAAC GACGGCGTCT TCCACGCCGT CAACGGCAAC CGCTCACAGG GCATCGGCGC CACACGCGCC GGCGGCGAAA TCTGGGGCTT CATCGCGCCC GAGTTCTTCG GCAAGCTCAA TCGTCTCTAC CAGAACACGC CGGAAATCAA GCTCTCGACC ACGCCGGCTG GCATCACGCC GACGCCGCTG CCGCGCGACT ACTTCTTCGA TGGCAGCACG ACGGTCATGC AGGATCTCCG CGATCCGAAC AACCCGCACG TCATGATCTT CCTGACCGCA CGACGTGGCG GCAGCCTGGT CTACGCACTA GACGTGACCG ATCCGACCAA CCCGCAGTTT GTCTGGCGGC TCAGCAACAC TGACCTCGCC GAGATGGGCC AGACCTGGTC GCAGCCCAAG GTGATGCGCG TGCACGGCAA TGCGAACCCC GTCATCGTCA TGGGCGCGGG CTACGACCCT GCCGAGGATA GCGACCCGTC CCCGGGCGTC AATACGATGG GGCGCGGCCT GATCATGGTC GACGCCTACA AGGGCAACAT CGTCTGGTCC GCGCAGCCGT CCTGTGCTGG CGTCACAACG CCATGCCTGG CCGTATCCGG CATGACCCGC GCGATTCCTT CCGATGTCAC GCCGCTCGAT CGCAACGGCG ATGGCTATGT GGAGCGCGTG TACGTGGGCG ATGTAGGCGG CAATGTCTGG CGCGCCGATT TCGAAACCGC AGCCAGCAAT GCGCCCACCG CCTGGACGGT CACGAAGCTT GCGGCCCTGG GGGGCGGCGC AGGTACGAAC GATGCTCGCA AGTTCTTCTA CCCGCCTGAT GTCGTCTCGA CGAATGGATA CGATGCGATC TCGCTTGGCT CCGGCGACCG CGAGCATCCG CTTTACTCGG CGTCCACCGC ACCCGGCACC GCATACAACG TGGTCAACCG CCTCTACATG CTGAAGGACA CCAACACCAC TGGCATGTCG ACGTCGTGGA GCCCGATCAC GGAAAGCAAC CTGTTCGATG CCACGGCCAC GACCTACGAT GGCAGCAAGT CGGGCTTCTT CATCACGCTG GTCAACCCGG GCGAGAAGGC GGTGAACGCA CCGCTGACCG TCGCTGGCTA TACGAACGTG GGCACCAACA CGCCTTCCGT GCCGGTCGCC GGCGCTTGCT ATCCGAACCT TGGCACCGCC CGTAGCTATT CCTACAACTT CCTGACCAGC ATCGGCCAGA ACACGAACCG CTACATCGTG CTGGATGGCG GCGGCTTCCC GCCTTCCTCG GTGTTTGGCC TGATCACAGT CAGTACTGGC GGCAACTCCG TCGTCACGCC AGTGCTGCTG GGCGGTGGCA ACCAGACCGC CACCGGTGGC GGTGACTCGA AGTCCGGCCT CGGCGTACAG AAAGTGAAGC CTGCCGGGCT TGGCAAGCGC AAGCGTGTCT ACTGGTACGA CGAGATCGAC AAGAAGTAG
|
Protein sequence | MRKSALHLAL AGCLAMAVAL PASAEDIDLY TGLQPEAGKP NVLIVLDNAS AWNASANFTC STSGVVSSNN ANTDAGAEQC ALYNAVNSIL KSPTLLGNIN LGIMMFGDSK NFGGIMKFPS AAPYKLPLMD TTGVNNLLTY IKTIDRTGDS AGNSQVAGSM QEAWAYYAGK QGLSGITYTS PIDNPCQRNF VIYIANSVNN GKPGDTGQNA IDRLTNQAKA STAQLQQIVL PSATAKYQAN WADEWARFNY QTNLSASSTD HQNIVTYSIG VTDGSNPDYL QLAKSMASNG GGKYSEVKLG DPNGLADALM AIFNEVQAVN SVFASVSLPA SVNAQGQFLN QVYVGMFRPD ATAAPRWLGN LKQYQVGYDT NGNLVLQDAK GASAISNAQT GFISPNSVSF WTAEPPQSYS SAGYGANMST WPSGGYWQNS PSGTGWQFDS PDGEIVEKGG VAEMMRAQWL GDQTSRRLLT CNGVGTCSTS GGVSALTSFD ANNKWFTTTA GLAALNVNAT TDTTNKNVIQ ALPSTEVPNL IAWVRGQDMT PASTSTSSGG SSGNGNGKNN GGSGSGTTTT ATTMAGAEAE LGPGAPVTVR SSIHADVLHS RPAVVNYGGS IGVVVYYGTN DGVFHAVNGN RSQGIGATRA GGEIWGFIAP EFFGKLNRLY QNTPEIKLST TPAGITPTPL PRDYFFDGST TVMQDLRDPN NPHVMIFLTA RRGGSLVYAL DVTDPTNPQF VWRLSNTDLA EMGQTWSQPK VMRVHGNANP VIVMGAGYDP AEDSDPSPGV NTMGRGLIMV DAYKGNIVWS AQPSCAGVTT PCLAVSGMTR AIPSDVTPLD RNGDGYVERV YVGDVGGNVW RADFETAASN APTAWTVTKL AALGGGAGTN DARKFFYPPD VVSTNGYDAI SLGSGDREHP LYSASTAPGT AYNVVNRLYM LKDTNTTGMS TSWSPITESN LFDATATTYD GSKSGFFITL VNPGEKAVNA PLTVAGYTNV GTNTPSVPVA GACYPNLGTA RSYSYNFLTS IGQNTNRYIV LDGGGFPPSS VFGLITVSTG GNSVVTPVLL GGGNQTATGG GDSKSGLGVQ KVKPAGLGKR KRVYWYDEID KK
|
| |