Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_5681 |
Symbol | |
ID | 7975734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012792 |
Strand | + |
Start bp | 390078 |
End bp | 394907 |
Gene Length | 4830 bp |
Protein Length | 1609 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644796264 |
Product | RHS protein |
Protein accession | YP_002947538 |
Protein GI | 239820353 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGACC AGGCCCCCCA GAACGACGCG CAAGCCAAGG AGCGCGAGCG CCAGACCGCG GTGGCACCGC TCAACACCAT CGGCGAGGAA GACATCGGCG CCGGCGCCGC CAAGATCGAC AAGTGGCTGC GCAAGCTCAC CAACGACTAC GTCACGCTGA ACGTGCTGGC AACCTTCGCG GGCAGCCTGC CGGTCATCGG CAACATCATG GCGCTCATCG ACGCGGTCTG GGACCTCATC GAGATGGTCA CCAAGAAGGC GTACAGCGAC GTGCTGCAGT GGGTCAGCCT GGCCATCAAC CTGCTGGGCG TGATCCCGTT TCCGCCCGCC ACCGGCGCAG CGCGCATGAG CCTGCGGCCC ATGCTGCACC TGCTCAAGCA GGAGATCGCC AAGTCGGCCA AGAACGCGGT GCCGAACATC GGCGAGGCGT TCATTGCCGT GCTCATCACG CACCTGAACG ACACCATCGC CGGCACGCTC GACAAGTTCG TGGACGAGGC GCTCGGCTAC CTCGACGACT TCCTGCAGTC CTGCGCAAAA AAGGTCGACG GCATCGCCGA TGCGCTGATC GGCGCGCTGC AGGTGGCGCT GGGCGAGAAG CCGGTGTTCG CAACCGGCGC CGCTGCCGAG CGGAACACCT ACGACCCCAA GACCCAGAGC ACCTGGAGCC GCATGATGGC CGCCGCGGCC GAGGCCGCGA AGAAATCCGC CAACTACGCG GCGGCCACCG CGAGCCACTA TGCGCTGCCC GACAGCGCCA AGGCCATGAT CCGGGACACC GTCGCCAGCC TGACCGACCT CAAGGGCGCG GCCCGCCGGC AGATCATGCG GCTGGGCGAC GAGAGCCTGC AGTACGGCAT CAAGTGGATG ATCAAGATCC TCAAGGATGC ATTGCTCAGG CGCAAGGGAA AGCACAGCGC CAACATTTCC GCGAACAACG GAACGCAGGT CGAAAGCAAG AAGCCCGGCG GCGAACAGGC CGCCACCTCC GCCCAGGTGC CAGCCGGCGG CGACCCCGGC TGCAAGAACT GCACGAGCGC CGCCGCGGGC GGCGCCATCA GCCTGGCCAC CGGCTGCGAG AGCTTCGACC ACACCGACTT CGTGCTGGGC GCCCCCCTGC CCATCACCTG GACGCGCACC TACCGCAGCG ATCTTGCCGC CTTCGACCAG GGCAGCCTCG GCGCGCGCTG GATCACGCCC TTCAGCACGC GCGTCGATGT TGCCAAGCCC GCCAGGGGCC GCCGCCAGGG CCAGATGAGC CTGATCTATC GCGGTGCCGA CGGCCGCAGC CACGCCTTCC CGCTGCTGGC CGTGGGCCAG AGCCATCGCA ACCCGATCGA GGAGATCACC CTCACCCGCC TCGGCGAACG GCTGCTCGCG GTCGATTTCG GCAAGCCCAT GCCGGCCGGC CAGGCCCCCG ACTGGCGCGA GACCTACGAG CTGGTCGACA CCGTGCCCGC CAAGGCCGCC GCGCAAGGCA AGCAGCACTT CCGGCTGGTG GCGCAGCACA CGAACAGCGG CGCCGCCGTC GGCCTGCGCT ACGACCACGT GATCGCGGCC ACCGGCGAGC AGGTGCTCAG CGACATCATC AGCAGGCAGG GCGAGGCCAT CATCGCCCAT GTCGGCACCC AGCCCGATGC GCAGACCGGC CTCATCAAGG CCCTGTGGGA GCTGAAGGAA GGCCGCGTCG TGCGCCAGCT GGCCGCCTAC ATCCACGACG CCGAAGGCGA CCTCGTCGCC GCGCAGGACG AGAACGGCGC CGGCTGGCAG TACAGCTACA GCCACCACCT CGTCACCCGC TACACCGACC GCACCGGCCG CGGCATGAAC CTGCAGTACG ACGGCACCGG CGCCGATGCC AAGGCCGTGC GCGAATGGTC CGACGACGGC AGCTTTGCGC TCACGCTCGA GTGGGACAAG AACATCCGCC TCACCTACCT GACCGACGCC ATGGGCGGCG AGACCTGGTA CTACTACGAC GTGCTCGGCT ACACCTACCG GATCATCCAC CCGGACAAGC GCGAGGAATG GTTCCTGCGC GACGACGCCA AGAACATCAC GCGCCACATC CACACCGACG GCACCACCGA CGACTACGTC TACGACGCCC ACGGCAACCT CAAGACCCAC ACCCGCGCCG ACGGCAGCAG CGTGCACTAC GCCTACGACG CGCGCCACCG CATCACCGGC ATCCTCGACG CCGAAGGCGG CGCCTGGAAG CGCGACTACG ACGCAAGCGG CAACCTCACC GAAGAGATCG ACCCGCTCGG CCACAAGACC GAATACGCCT ACGACAACGC CGGGCGCCCC GTGCGCATCA CCGACGCCAA GGGCGGCATC AAGACCCTCG CCTACACCCC CGACGGCCAG CTGGCCAGCC ACACCGACTG CTCGGGCAAG ACCACCCAGT GGGCCTACGA CGGCCGCGGC CGCCTGGCCA GGATCACCAA CGCCCTGGGC CAGGCCACGC GCTTTCGCTA CACCGAAGCC GGCGAAGCCG CGCAGTCCGC GATGCCCGGC CAGGCCAACA ACCACCCGGG CCAGCTCGAG GAGATCGTCC ACCCCGACAA CACCAGCGAG TACTTCGCGC ACGACGCCGA AGGCCGCCTG CTCGCGCACA CCGATGCGCT GGGCCGCCGC ACCAGCTACG GCTACACCCG CGCCGGCCTC GTCGCCCAGC GCACCGATGC GGCCGGCCAC ACCCTCAAAT ACCACTGGGA CCTGCTGGGC CGCCTGCGCG AACTGCACAA CGAGAACGGC AGCCGCTACG ACTTCCGCTA CGACCCCGTC GGCAAGCTGC TCGAAGAAAC CGGCTTCGAC CGCAAGGGCA CCCAGTACCG CTACGACGAA GCCAGCGGCG TGCTGGCCGA GGTCATCGAA GCCGGCCACA GCACCCGGCT CGCGTTCGAC CCCCTGGGCC GCCTGAGCGA GCGCCAGGCC GGCGACGATG CCGAGCGCTT CGCCTACGAC CGCAACGGCC GCCTGGTCGA GGCCACCAAC GCCGAGGCCA GGCTCCAGTG GTTCTATGAC CGCGCGGGCA ACCTCGTGCG CGAGCACCAG CACTACCTGG ACCACGGCCA CACGGCCGTC TGGCAGCACG GCTACGACGA GCTCAACCAG CGCATCGCCA GCATCCGCCC CGACGGCCAC GTCACCCAAT GGCTCACCTA CGGCTCCGGC CACGTGCACG GCCTGCTCGT GGACGGCCAG GACATCCTGG GCTTCGAGCG CGACGACCTG CACCGCGAGA TCGGCCGCGA ACAGGGCAAC GGGCTGACGC AGAAGCTGCA CTACGACCCG GCCGGCCGGC TGCTGGAGCA GCAGATTTCG CAGACCAGGC CCGGCGCGAT CGAGGCCGTC GGCATCCGCC GCAGCTATGC CTACGACAAG GCGGGGCAGC TGGTGGCCAT CGGCGACAGC CGCCGCGGCA ACCTGAGCTA CCGCTACGAC CCCGTGGGGC GCCTGCTGGA AGCCCACAGC CGCCTGGGAC GCGAGACCTT CGCGTTCGAT CCGGCGGGCA ACATCGGCAG TCCATCCGAC GTCGGAGCCG ACACCCAAGC CGCAGGTCGC ATGACCACCC GCGTCGCGGT GCGCCTGGGC GGTGATGGCC GCAGCATGGC CGGTCGGCTG ATGGACAACC TGCTCAAGGA CTATGCGGGC ACCCACTACA CCTGGGACGA GCGGGGCAAC CTGATCGAGC GCAGCCGCAA CGGAGAAATC ACCACCTTCA CCTGGGACGG CTTCAACCGC ATGCGCAGCG CCGAGACCTT CGGTGAGACC ACAAGCTTCA GCTACGACGC GCTGGGCCGG CGCATCGCCA AGCGAAGCTC TCACGCGACG ACCTTGTTCG GCTGGGACGG CGACACGCTG GCCTTCGAGA GCACGCAGAG CACCGAGGGC CAGCAGGAGC TGCAGGCCTG GCGCGGTGAC AGCGTGCACT ACATCCACGA GCCCGGCTCC TTCGTGCCGC TGGTGCAGAT CAGGCAGGCC CAAGCCGTGG CGCTGAGCCA GACCACCGAC GTGAAGGCGC TGATCGCGGC CAATGGCGGG CGCTACGACA TCGAGCAGGA TCCGTTATGG AATGGGGAGC AACTCAAGAC GCCGCAGGCC TTTGCGAGAG AAGAGATCGC GTTCTACCAA TGCGACCACT TGGGCACGCC GCAGGAGCTG ACGGACCATG AGGGGCGGAT CGCCTGGTCT GCGCAATACA AGGCCTGGGG CGAAGCCAAG CAGGCGATCA GCGAGGCGGG ACGGAAGGCG GGGTTCAGGA ATCCGATTCG GTTCCAGGGG CAGTACTTTG ACGACGAGAC GGGGCTGCAC TACAACCGAC ACCGGTATTA CGACCCGAGT TGCGGGCGGT TTGTCTCGAA AGATCCCATC GGACTGGCCG GCGGGAGCAA TCTCCAGCAG TACGCCCCCA ATCCACTCGG TTGGATAGAC CCGTTGGGAC TGGCCGGTAA GGGAATTACA CCGAACAACA AGGGAACTCG TACCACTATT GAAGGAGGCA ATCTGCCTCA ACCAGTTGCC GGATATAGCA CAAAAGCGGG CGGAAATGGC GCCTCCCATC CTGTTGTTCG AGACCTGTAT GACTCTGTTC CTCGAGATGA ACGATCCGTC TTCCACGGTG ATTGCGGCGA GGCAGACGCA TTGAGCAAAA TTGCATCGTT AAATAATGCT AAAAGCATCG AGCAGCTACA AGCTCTCACC AACGGAAGTG CCACCGAAAC ACTTAGAAAT GATGGAAAAC TTATGGTGTG CTGTTCTTCT TGCAGGCACG TAACAGGAAC TCTCGGAATT CGAGATAATG CAAGGAATCG AGGAAAATGA
|
Protein sequence | MADQAPQNDA QAKERERQTA VAPLNTIGEE DIGAGAAKID KWLRKLTNDY VTLNVLATFA GSLPVIGNIM ALIDAVWDLI EMVTKKAYSD VLQWVSLAIN LLGVIPFPPA TGAARMSLRP MLHLLKQEIA KSAKNAVPNI GEAFIAVLIT HLNDTIAGTL DKFVDEALGY LDDFLQSCAK KVDGIADALI GALQVALGEK PVFATGAAAE RNTYDPKTQS TWSRMMAAAA EAAKKSANYA AATASHYALP DSAKAMIRDT VASLTDLKGA ARRQIMRLGD ESLQYGIKWM IKILKDALLR RKGKHSANIS ANNGTQVESK KPGGEQAATS AQVPAGGDPG CKNCTSAAAG GAISLATGCE SFDHTDFVLG APLPITWTRT YRSDLAAFDQ GSLGARWITP FSTRVDVAKP ARGRRQGQMS LIYRGADGRS HAFPLLAVGQ SHRNPIEEIT LTRLGERLLA VDFGKPMPAG QAPDWRETYE LVDTVPAKAA AQGKQHFRLV AQHTNSGAAV GLRYDHVIAA TGEQVLSDII SRQGEAIIAH VGTQPDAQTG LIKALWELKE GRVVRQLAAY IHDAEGDLVA AQDENGAGWQ YSYSHHLVTR YTDRTGRGMN LQYDGTGADA KAVREWSDDG SFALTLEWDK NIRLTYLTDA MGGETWYYYD VLGYTYRIIH PDKREEWFLR DDAKNITRHI HTDGTTDDYV YDAHGNLKTH TRADGSSVHY AYDARHRITG ILDAEGGAWK RDYDASGNLT EEIDPLGHKT EYAYDNAGRP VRITDAKGGI KTLAYTPDGQ LASHTDCSGK TTQWAYDGRG RLARITNALG QATRFRYTEA GEAAQSAMPG QANNHPGQLE EIVHPDNTSE YFAHDAEGRL LAHTDALGRR TSYGYTRAGL VAQRTDAAGH TLKYHWDLLG RLRELHNENG SRYDFRYDPV GKLLEETGFD RKGTQYRYDE ASGVLAEVIE AGHSTRLAFD PLGRLSERQA GDDAERFAYD RNGRLVEATN AEARLQWFYD RAGNLVREHQ HYLDHGHTAV WQHGYDELNQ RIASIRPDGH VTQWLTYGSG HVHGLLVDGQ DILGFERDDL HREIGREQGN GLTQKLHYDP AGRLLEQQIS QTRPGAIEAV GIRRSYAYDK AGQLVAIGDS RRGNLSYRYD PVGRLLEAHS RLGRETFAFD PAGNIGSPSD VGADTQAAGR MTTRVAVRLG GDGRSMAGRL MDNLLKDYAG THYTWDERGN LIERSRNGEI TTFTWDGFNR MRSAETFGET TSFSYDALGR RIAKRSSHAT TLFGWDGDTL AFESTQSTEG QQELQAWRGD SVHYIHEPGS FVPLVQIRQA QAVALSQTTD VKALIAANGG RYDIEQDPLW NGEQLKTPQA FAREEIAFYQ CDHLGTPQEL TDHEGRIAWS AQYKAWGEAK QAISEAGRKA GFRNPIRFQG QYFDDETGLH YNRHRYYDPS CGRFVSKDPI GLAGGSNLQQ YAPNPLGWID PLGLAGKGIT PNNKGTRTTI EGGNLPQPVA GYSTKAGGNG ASHPVVRDLY DSVPRDERSV FHGDCGEADA LSKIASLNNA KSIEQLQALT NGSATETLRN DGKLMVCCSS CRHVTGTLGI RDNARNRGK
|
| |