Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_4001 |
Symbol | |
ID | 7974566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 4243417 |
End bp | 4244745 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644794587 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002945881 |
Protein GI | 239816971 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.131149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCC AACGACGCAG CACGCTGGCC ATGCTGGCCC TTTCATCCAC GCTCTGGCTC GCGAGCGGCA CACCCGCCGC CGCGCAGCCG GTGCAGGTGC AGTGGTGGCA CGCGATGGGC GGCGCGCTCG GCGAGCGCGT GGAAGAACTG GTGAAGAACT TCAACGGCTC GCAGAACAAG TACGCCGTCA CCGCCGTCTA CAAGGGCAAC TACGACGAGG TCATCAACGG CACCATTGCC GCCTACCGCG CCAAGCGCGC GCCGGCGCTC GTGCAGATCT ACGAGCGCGG CTTCATGACC ATGCTGCTGT CCGATGCAAC CATGCCGGTG CAGGACCTGC TCGACCAGCG CGGCTACAAG GTCGACTGGG CCGATTTCGT GAAGCCGGTG GCCGGCTTCT ACAGCTACAA GGGCAAGCTG ATGACGATGC CCTTCAATTC GTCCTCGCCG ATCCTCTGGT ACAACAAGGC GCACTTCGAG AAGGCCGGCT TCGCCGGGCC CGCGCAGACC TGGCAGGAGC TGGAGAAGCA GCTCTACGCC ATCAAGCAGA AGGGCATCTC GGCCTGCGGC TCGGTGCTCG CGGGCGACTA CCACTGGAGC CTGCTCGAGA ACTACAGCGC CATCAACGAC CTGCCCTATG CCACCAAGGC CAACGGCTAT CAGGGGCTGG ACACCGAGTT CGTCTACAAC AAGACCTCGG TGGTTTCGCA GGTGGCGCGC ATCAAGAAAT GGATCGACGA CGACGTGATG CAGATCGCCG GCCAGGGCCT GAGCCCCGAG CAGCTCTTCA CCTCGGGCAA GTGCTCCACC TACTTTGCCT CGACCGCTGC ACACAGCGGC ATCGAGCGCG AATCCAAGAT CGACTGGAGC GCCACCTACC TGCCGTGGGA AGAGGGCAAG CAGCCGAAGA ACAGCACCAT CGGCGGCGCG TCGCTCTGGG TGATGAAGGG CCAGAAGCCG GCCGAGTACG AGGCCGTGGC CGCCTTCCTC GACTACATCG CCAAGCCCGA GACGCAGTTC TGGTGGGTCA AGGCCACGGG CTACGTGCCG CTGACCAACA AGGCCTACGA GTTGGCCAAG TCGCAGGGCT ACTACAAGGA GCATCCGACG CGCGAGATTG CGATCCTGCA GCTCACGCGC GGCACGCCCA CGGCCAACTC CACGGGCTTT CACTTCGGCA ACTTCACGCA GACCATGATG GCGCAGCGCG ACGAGTTCCA GAACGTGGTG GCCGGCAAGA AGACGCCGCA GGTGGCCATG GACGATGCCG TCAAGCGCGG CAACGAGATC CTGCGGCAGT ACGAGAAGCT CAACAAGGGC CGCTACTGA
|
Protein sequence | MKIQRRSTLA MLALSSTLWL ASGTPAAAQP VQVQWWHAMG GALGERVEEL VKNFNGSQNK YAVTAVYKGN YDEVINGTIA AYRAKRAPAL VQIYERGFMT MLLSDATMPV QDLLDQRGYK VDWADFVKPV AGFYSYKGKL MTMPFNSSSP ILWYNKAHFE KAGFAGPAQT WQELEKQLYA IKQKGISACG SVLAGDYHWS LLENYSAIND LPYATKANGY QGLDTEFVYN KTSVVSQVAR IKKWIDDDVM QIAGQGLSPE QLFTSGKCST YFASTAAHSG IERESKIDWS ATYLPWEEGK QPKNSTIGGA SLWVMKGQKP AEYEAVAAFL DYIAKPETQF WWVKATGYVP LTNKAYELAK SQGYYKEHPT REIAILQLTR GTPTANSTGF HFGNFTQTMM AQRDEFQNVV AGKKTPQVAM DDAVKRGNEI LRQYEKLNKG RY
|
| |