Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_3391 |
Symbol | |
ID | 7970629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 3570483 |
End bp | 3572216 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644793975 |
Product | extracellular solute-binding protein |
Protein accession | YP_002945274 |
Protein GI | 239816364 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.285234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATGC GCTACACGGC ATTGGCACTG GCAGCGGCGA TGTTCGCCGC GCACGGTGCG GCATCGGCGG GCGAGGCCGA AGCCAAGAAA TGGATCGACA GCGAATTCCA GCCATCGACC TTGTCCAAGG ACAAGCAGAT GGAAGAAATG AAATGGTTCA TCGAAGCCGC CAAGAAGCTT CAGGCCAAGG GCGTGAAAGA GGTGTCCGTC GTCTCCGAAA CGCTGACCAC CCACGAGTAC GAATCGAAGA CGCTGGCCAA GGCCTTCGAG GAAATCACCG GCATCAAGGT CAAGCACGAC CTGATCCAGG AAGGCGACGT GGTCGAGAAG CTGCAGACCT CCATGCAGTC GGGCAAGTCG ATCTACGACG GCTGGGTGTC CGACTCCGAC CTGATCGGCA CGCACTACCG CTACGGCAAG ATCATGTCGC TGACCGAGTA CATGGGCGGT GCGGGCAAGG AGTACACCAA CCCCGGCCTC GACCTGAAGG ACTTCATCGG CACCAGCTTC ACCACCGCGC CCGACGGCCA GCTCTACCAG CTGCCAGACC AGCAGTTCGC CAACCTCTAC TGGTTCCGCG CCGACCTGTT CGCGCGCAAG GACCTGCAGG ACAAGTTCAA GGCCAAGTAC GGCTACGACC TGGGCGTGCC GCTCAACTGG AGCGCCTACG AGGACATCGC CGAGTTCTTC AGCGTCGACG TGAAGAACAT CGACGGCAAG CCGATCTACG GCCACATGGA CTACGGCAAG AAGGACCCGT CGCTGGGCTG GCGCTTCACC GATGCGTGGC TCTCGATGGC GGGCGCCGCC GACAAGGGCA TTCCCAACGG CATGCCGATC GACGAATGGG GCATCCGCGT GGCCGCGGAC AAGTGCACGC CCACGGGCGC GGCCGTCTCG CGCGGCGGCG CCACCAACTC GCCAGCTGCC GTGTACGCGC TCACCAAGTA CATCGACTGG ATGAAGAAGT ACGCGCCCAA GGAAGCCATG GGCATGACCT TCGGCGAATC GGGCCCGGTG CCGGCGCAGG GCCAGATCGC CCAGCAGATC TTCTGGTACA CGGCCTTCAC GGCCGACATG ACCAAGAAGG GCCTGCCGGT CGTGAACGAA GACGGCTCGC CCAAGTGGCG CATGGCCCCC GGCCCGAACG GCCCGTACTG GAAGCAGGGC ATGCAGAACG GCTACCAGGA CGTGGGCTCG TGGACCTTCT TCAAGGGCCA TGACGCCAAC AAGACGGCGG CGGCCTGGCT CTACGCGCAG TTCATCACCG CCAAGACCAC GTCGCTGAAG AAGTCGATCA CCGGCCTGAC CTTCATCCGC GACAGCGACA TCCGTTCGGA CTTCTTCACC CAGAACGCCA ACAAGTACGG CGGCCTGATC GAGTTCTACC GCAGCCCGGC GCGCGTGGCC TGGACGCCCA CCGGCACCAA CGTGCCCGAC TACCCGAAGC TCGCGCAGCT CTGGTGGAAG AACGTGGCCG AGGCCGTCAC GGGCGAGAAG ACCCCGCAGA AGGCCATGGA CAACCTGGCC GACGAGATGG ACAACGTGAT GGCCCGCCTC GAGCGCGCCG GCATGGCCAA GTGCGCGCCC AAGCTCAACC CGAAGGGCGA CCCGGCCAAG TACCTGAGCG ACGACCACGC GCCCTGGAAG AAGCTGGCCA ACGAGAAGCC CAAGGGCGAG ACCATCGACT ACAACAAGCT GCTGACGGCC TGGAAGGAAG GCAAGGTGCG CTGA
|
Protein sequence | MKMRYTALAL AAAMFAAHGA ASAGEAEAKK WIDSEFQPST LSKDKQMEEM KWFIEAAKKL QAKGVKEVSV VSETLTTHEY ESKTLAKAFE EITGIKVKHD LIQEGDVVEK LQTSMQSGKS IYDGWVSDSD LIGTHYRYGK IMSLTEYMGG AGKEYTNPGL DLKDFIGTSF TTAPDGQLYQ LPDQQFANLY WFRADLFARK DLQDKFKAKY GYDLGVPLNW SAYEDIAEFF SVDVKNIDGK PIYGHMDYGK KDPSLGWRFT DAWLSMAGAA DKGIPNGMPI DEWGIRVAAD KCTPTGAAVS RGGATNSPAA VYALTKYIDW MKKYAPKEAM GMTFGESGPV PAQGQIAQQI FWYTAFTADM TKKGLPVVNE DGSPKWRMAP GPNGPYWKQG MQNGYQDVGS WTFFKGHDAN KTAAAWLYAQ FITAKTTSLK KSITGLTFIR DSDIRSDFFT QNANKYGGLI EFYRSPARVA WTPTGTNVPD YPKLAQLWWK NVAEAVTGEK TPQKAMDNLA DEMDNVMARL ERAGMAKCAP KLNPKGDPAK YLSDDHAPWK KLANEKPKGE TIDYNKLLTA WKEGKVR
|
| |