Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_5947 |
Symbol | |
ID | 7974986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012792 |
Strand | - |
Start bp | 663947 |
End bp | 665512 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644796511 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002947785 |
Protein GI | 239820600 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.281116 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCATT CCTTCTTCCG TCTGGCAGCC ATGGCCGGTG CCGGCCTCGC CTGCGCCGCG GCACTCGCGC AAACCACCGT GACGGTCGCG CAGCCGGCGG ACATCCGTTC CACCAATCCC GGCGTCAATC GCGACAACAC CACCGACGGC ATCGTGCTCA ACATGGTCGA AGGGCTCGTG GGCTACCGGC AGGACGGCAG CGTGGGACCG CTGCTCGCGC AATCGGTCGA CGTCTCCAAG GATGGGCTGA CCTACACTTT CAAGCTGCGC AAGGGCGTCA AGTTCCACAA CGACGCGCCG CTCACCTCGG CCGAAGTCGC CTGGAGCTGG AAGCGCTACA TGGACCCGAA GACCGACTGG CGCTGCCTCA GCGAGTACGA CGGGCGCAAC GGCCTCAAGG TGGTGGACGC CGCAACGCCC GACGACGCCA CCTTCGTGCT GAAGATCAAC CGTCCCTCGG CCGTGTTCCT CGATACGCTG GCGCGCTCCG ACTGCGGCAT GACGGCCATC CTGCATCCGA GCTCGGTCAA GCCCGACGGC AGCTGGGACA AGCCCGTGGG AACCGGCCCG TTCAAGTTCG GCGAATGGAA GCGCGGCGAG TACGTCACGG TGACCGCGTT CAAGGGCTAC ACCTCGTCGC CGGGCGCCAC CAAGGCCGAT GGCTACGTGG GCCTGAAGGC GCCGCTCGTG GACACGGTGC GCTTCCTGGT TGTGCCCGAC GCGGCCACCG CCGCGGCCGG CCTCAAGTCC GGAGCCATCG ACGCCGGCCA GATCACATCG AGCGATGCGC AGGAGCTCAA GGCCGATCGC AACCTCGTCG TGCAGGCACC CACCGAGGCG GTGAAGAACA CGCTGCTCTT CCAGACGCGC GATCCGCTGC TGAAGAACGT GAAGCTGCGT CAGGCGATCG CGGCCTCGCT CGACATGGAG CAGATCGTCG CGGCCGCATC CGAAGGCCTG GGTTCGGTCA ACAACTCGGC GATTCACAAG GGCTCGGCCT TTTACGGCGC GGCCGAAAAG AAGGGCTTCC ATTACGACCC CGCCCTGGCC GCGAAGCTGC TGCGCGAGGC CGGCTACAAG AACGAGAAGA TCACCATCCA GACCAACAAG CGCGCCCATG TGCCGAGCTA CACCGTGGCG GTGCTGGCGC AGGCGATGAT GCAATCGGTG GGGATCAACG CGCAGATCGA AGTGCTGGAA TGGGCCACGC AGCTCGACCG GTACAACAGC GGGAACTACC AGATGAGCTC GTTCAGCTAT TCGTCACGGC TCGATCCCGC GCTGAGCTAC GAGCAGTTCT CCGGGCCGAA GGACAAGCAG GCGCGCAAGG TGTGGGAAGA CCCGCAGGCG CTGAAGCTGC TCGACGAGAG CTTCACCGAG ACCGACCGGG CAAAGCGCCA GGCGCTGTTC GACCAGCTCC ACGCATTGAT GATCCAGCAG GTCCCGATGA TCATGCTGTT CAACGGCATC GACGCCTGGG GCGTGCGCAA GCGCCTCGCC GGGTTTTCGG TATGGGAAGG CAAGCCGCGG CTGTGGGGGG TGTCGGCCGC GGCCAAGGCG GGCTGA
|
Protein sequence | MPHSFFRLAA MAGAGLACAA ALAQTTVTVA QPADIRSTNP GVNRDNTTDG IVLNMVEGLV GYRQDGSVGP LLAQSVDVSK DGLTYTFKLR KGVKFHNDAP LTSAEVAWSW KRYMDPKTDW RCLSEYDGRN GLKVVDAATP DDATFVLKIN RPSAVFLDTL ARSDCGMTAI LHPSSVKPDG SWDKPVGTGP FKFGEWKRGE YVTVTAFKGY TSSPGATKAD GYVGLKAPLV DTVRFLVVPD AATAAAGLKS GAIDAGQITS SDAQELKADR NLVVQAPTEA VKNTLLFQTR DPLLKNVKLR QAIAASLDME QIVAAASEGL GSVNNSAIHK GSAFYGAAEK KGFHYDPALA AKLLREAGYK NEKITIQTNK RAHVPSYTVA VLAQAMMQSV GINAQIEVLE WATQLDRYNS GNYQMSSFSY SSRLDPALSY EQFSGPKDKQ ARKVWEDPQA LKLLDESFTE TDRAKRQALF DQLHALMIQQ VPMIMLFNGI DAWGVRKRLA GFSVWEGKPR LWGVSAAAKA G
|
| |