Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_2835 |
Symbol | |
ID | 7970724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 2979443 |
End bp | 2982568 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644793420 |
Product | putative type 4 fimbrial biogenesis PilY1-related protein signal peptide |
Protein accession | YP_002944721 |
Protein GI | 239815811 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.341777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA ACTTCTTCCG GTCGCTCTGC GCCGCTGCGA TGCTGATGCT CGGCCTGGGC TCGGCCCAGG CGGAAGACAT CGATCTTTTC GTCGGCTACA ACCAGACCAC CACGGAGGCG CCGAATGTGC TGTTCGTGGT GGACAACACG GCCAACTGGA CCCCGCTCTT CACCGCTGAA ATGAACGCGC TGGCGAGCGC CTTCGAAAGC CTGTCCTCGG GCAAGTTCAA GGTCGGGGTG ATGATGTTCA CCGAAACAGG AGGAGGCAAT TCGAATGTCG ACGGCGCCTA TCTGCGTGCT GGCATCCGCC TGATGGACGA TGCCAGCAAG CCCAAGTACG GCGCGCTGAT TCGAAACCTC GACGTGGGTG CCGACAAGTC CAACGGCGGC AAGGCCGGCA AGATGATGGC CGAGGTCTAC CGCTATCTGG CGGGCCAGGC TCCATTGGCC GGCAACAACA AGCTCAAGGC CGATTACAGG GGCAATGTCT CCACTTACGG CCAGGGCAAA AACGAATCCG CTGGATCTGT CGCATCCAGG GCAATCTGGG CGCTCCCTGG CAATCCGCTG AATGCAATCG GCAGCACCAC CTACAACGCC CCGAACACCG CGAACTGCAT CGGCACCTAC GTCATCTACA TCAGCAATGG TGCCGCACAG GACAACAACA GCGACACCAC GACTTCCACC GATGCCCTGC GCGCCGCGGG CGGCACGACC ACCGCCATTA CGCTGAACCC GAGCGGTTCC CAGGACAACG TGGCCGACGA ATGGGCCAGG TTCCTGCAAA GCAGCATGGG CGTGAAGGTC TACACCGTCG AAGCCGCGAC GGCCAGCGGC GGCCAGGGGC CGGGCTGGAC TGCATTGCTC AAGAGCATGT CGGAGCAAAG CAAGGGCGAG TACTACGACA TTTCCACCAA TCCCAACCTG GGCAAGGCGC TGAGCGATGC GCTCGACGAT ATCTTCAGCA AGATCCAGTC GGTCAACAGC GTGTTCTCGT CGGTGAGCCT GCCCGTGAGC GTGAACACGC AGGGCACCTA CCTGAACCAG GTGTTCGTGG GCATGTTCCG CCCCGACGAT TCCGCCTATC CGCGCTGGAA CGGCAATCTC AAGCAATACA AGCTGGGCGT GGATGGCAGC AACCAGCTGA TCCTGCAGGA CGCCTCGGGC GCCAACGCCA TCAACAACCA GACCGGCTTC ATCGCACCTT GCGCGCGGAG TTTCTGGACG CCGACCAGTC GCAACACCTA CTGGAGCTTC AACCCCTCTG GCGACTGCAT CAGCAGCGAC ACGACCATCG ACCTTCGCGC TTCGGACAGC CCGGACGGCA ACGTGGTCGA AAAAGGCGCG CAAGCCTACA TGCTGCGCCA AGTGCTCACG GCCGACCGCA ACGTCATGAC CTGCGCGCCG GGCGCCTGCT CGGCCTTGAC CACCTTCGCC GATGGCAATG CGGGCATCAC GTCGACGGCC CTTGGGGCGC AAACCACCAC CGAACGGACC GCGTTGATCA ACTGGCTCCG GGGCCTGGAC AACAAGGGCG ACGAGCGCAA GGACACCACC GGAAATCCCC TGACCTCCAC CACCGCCATG CGCCCCTCGG TGCACGGCGA CGTGGTGCAT TCGAGGCCGG TGGCCATCAA CTACGGCAGC GATGACCTGG CGAACGCGCA GGTGGTGGTG TTCTACGGCG CCAACGATGG CGTGCTGCGC GCCGTCAACG GCAACCAGGC CACCTCGATC GGGTCGGCAG CGCCCGGCAG CGAGCTGTGG TCCTTCGTGC CGCCGGAGTT CTACGGCAGC ATCAAGCGGC TGTACGACAA CACCACGCGC ATCAACTTTC CGGGCATTCC GGTGAGCGTG GGCGCCACCG CGCCCAAGCC TTATGGCATG GACGGCCCGA TGGCGGCCTA CCGCCAGGGC ACGTCGGCAT GGCTCTATGC GAGCATGCGC CGCGGCGGCC GGCTGATCTA TGCGTTCGAC GTGTCCAACC CCGCGACGCC CGTTCTCAAA TGGCGCCGCG GCTGCCCCAA CCAGGACAAC GACACGGGCT GCGACAGCGG TGCCAACGAC ATGAGCGGCA TCGGCCAGAC CTGGTCGGCA CCGAAGATCG TGAAGGCCAG CGGCTATGGC GCCGGCAACA CGCCCATGGT GATGTTCGGC GGCGGCTACG ACAAGTGCGA GGACGTCGAC ACCACCACGC CCGCCAGCGC CTGCGGCTCG GCCACCAAGG GCAAGAAGAT CTATGTGCTC GACGCCAGCA CGGGCGACGT CCTCAAGACC TTCGACACCG TCCGCGCGGT CAGCGGGGAG GTCACCATCG TGAACGACAG CACCGGCCTC GCCAAGTTCG CCTACGCGGC CGACCTGGGC GGCAACGTGT ACCGCATCAC CATCGGAAGC GGCGCGCCCG CCAGCTGGAG CATGGTCAAG ATCGCCTCGC TCGGCTGCAG CGACACCGTG ACCGCCTGCA CGCCCAACCG CAAGTTCATG TTCGGTCCCG ACGTGGTCGA GGACAACGGC ATGTACGTGC TGCTGCTGGG CTCGGGCGAC CGCGAGAAGC CGCTGCGCTC CTACTCCGCC GCGCTCAGCG TTTCCAACCG CTTCTACATG CTGGTCGACA AGCCCTCCGA CACCGCTTGG CTCACCGCGG AGGCCAGCCA CTGCGACGGC AACTCGCTGC TGTGCCATAA CTCGCTCTTT GCCATCACGG GCACCACGGC GCCGTCCGCG ACCGAGCTCG CGGCCAAGAA GGGCTGGTAC CTCGCGCTTG CGGCCGGCGA GCAGGTGGTG ACCTCGTCGG TCACGGCCTA TGGCAACACG ACCTTCAACA CCCACACGCC CACCGACCCG GCCGTGAGCC AGTCGTGCCG CTCCAACCTG GGCACCGCCA ACGTCTACAA CCTGTCGTAC CTGGACGCCA CGGGGCAGAA CGGCGCGCGC TTCCAGAACG TGATTGGCGG CGGTCTCGCG CCATCGCCCG TGGTCGGCCG GGTCATGGTG GGCGACACCT TGCGCGACGT GGTGATCGGC GCCAATCCCG ACTCGTTCCT GAGCCCGAAG GGCGCCACGG TCAAGGCCGC CTTCAAGCAG CCCAAGGGCC GCGTCTACTG GTTCATCCAG AAGTAA
|
Protein sequence | MKKNFFRSLC AAAMLMLGLG SAQAEDIDLF VGYNQTTTEA PNVLFVVDNT ANWTPLFTAE MNALASAFES LSSGKFKVGV MMFTETGGGN SNVDGAYLRA GIRLMDDASK PKYGALIRNL DVGADKSNGG KAGKMMAEVY RYLAGQAPLA GNNKLKADYR GNVSTYGQGK NESAGSVASR AIWALPGNPL NAIGSTTYNA PNTANCIGTY VIYISNGAAQ DNNSDTTTST DALRAAGGTT TAITLNPSGS QDNVADEWAR FLQSSMGVKV YTVEAATASG GQGPGWTALL KSMSEQSKGE YYDISTNPNL GKALSDALDD IFSKIQSVNS VFSSVSLPVS VNTQGTYLNQ VFVGMFRPDD SAYPRWNGNL KQYKLGVDGS NQLILQDASG ANAINNQTGF IAPCARSFWT PTSRNTYWSF NPSGDCISSD TTIDLRASDS PDGNVVEKGA QAYMLRQVLT ADRNVMTCAP GACSALTTFA DGNAGITSTA LGAQTTTERT ALINWLRGLD NKGDERKDTT GNPLTSTTAM RPSVHGDVVH SRPVAINYGS DDLANAQVVV FYGANDGVLR AVNGNQATSI GSAAPGSELW SFVPPEFYGS IKRLYDNTTR INFPGIPVSV GATAPKPYGM DGPMAAYRQG TSAWLYASMR RGGRLIYAFD VSNPATPVLK WRRGCPNQDN DTGCDSGAND MSGIGQTWSA PKIVKASGYG AGNTPMVMFG GGYDKCEDVD TTTPASACGS ATKGKKIYVL DASTGDVLKT FDTVRAVSGE VTIVNDSTGL AKFAYAADLG GNVYRITIGS GAPASWSMVK IASLGCSDTV TACTPNRKFM FGPDVVEDNG MYVLLLGSGD REKPLRSYSA ALSVSNRFYM LVDKPSDTAW LTAEASHCDG NSLLCHNSLF AITGTTAPSA TELAAKKGWY LALAAGEQVV TSSVTAYGNT TFNTHTPTDP AVSQSCRSNL GTANVYNLSY LDATGQNGAR FQNVIGGGLA PSPVVGRVMV GDTLRDVVIG ANPDSFLSPK GATVKAAFKQ PKGRVYWFIQ K
|
| |