Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_5515 |
Symbol | |
ID | 7975372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012792 |
Strand | - |
Start bp | 216939 |
End bp | 217940 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644796101 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002947375 |
Protein GI | 239820190 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.467837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCTTC GACGCAGACA ACTGATCCAG GCCGCCGCCG GCGCGGCCGC ACTGGCCGGC GGCATGCCGG CCTTCGCCCA GGCGCGGACC AAGGTCAAGG TGGGCTACCT GCACACCCCG GCCGTGGACG GCCACATCTG GATCGGGCAG GAGAGCGGCG CCTTTGCGAA GCAGGGGCTC GAACTGGAGA TGATCCAGTT CACCACCGGC CTGGAGCTGT TCCAGGCCAT GATCGGCGGC AGCCTGGACA TGCTGTCCAC CGGCGCCGTG GTGTCCAACT TTCCGGCGCG CGGCCAGGGC AAGGTGTTCC TGATGAACAA CATCGAGTAT GCGACGGCCC AGCTGTGGGT GCGCGAGGAC GCCGGCATCA AGACCGTGGC CGACCTCAAG GGCAAGCAGA TCTCCACCAC CACCGGCACC ACGGCGCACG TGTTCCTGGA CCGCGCGCTG CGCTCGGGCA ACCTGGACCC GGCCAAGGAC GTGAAGCTGG TCAACCAGCG CATGACCGAG GCCGTCACCT CCTTCATCTC CGGCGCGGTG CCGGCGGTGG CGCTGTGGGT GCCCTTCGAT TCGGTGATCC GCCAGAAGCT GCCGGGCGCG CGCAAGCTGA TCGACGCCTC GGCCTTCTTC CCCGAGGCCG CCATCATGGG CGGCTGGGCC GCGCGCAACG ACTACTACGA CAAGAACCGC GCCGTCATCG CCAAGCTGAT TGCCGCATGG GCCGAGGTCA ACGACGTGGT CACCGGCAAG CCCGACGCCG CGGCCGAGAT GCTGCAGAAG ACCCAGTACA AGGAGGTGCC GCTCGCGGAC TTCAAGGCGC AGTTCAAGGC CTCCAAGTAC TACACCAACG CCGAATGGCG CACGCGCTAC CAGGACGGCA CCGTCACCAA GTGGCTGCAG CAGGTGACGG ACTTCTTCGT TGCCAACGCC AACATCCAGG GCGCACTGAA GGCCGAGCAG TACTTCGACG CCAAGCCCTT CCTGGAGACG GTCAAGGCAT GA
|
Protein sequence | MALRRRQLIQ AAAGAAALAG GMPAFAQART KVKVGYLHTP AVDGHIWIGQ ESGAFAKQGL ELEMIQFTTG LELFQAMIGG SLDMLSTGAV VSNFPARGQG KVFLMNNIEY ATAQLWVRED AGIKTVADLK GKQISTTTGT TAHVFLDRAL RSGNLDPAKD VKLVNQRMTE AVTSFISGAV PAVALWVPFD SVIRQKLPGA RKLIDASAFF PEAAIMGGWA ARNDYYDKNR AVIAKLIAAW AEVNDVVTGK PDAAAEMLQK TQYKEVPLAD FKAQFKASKY YTNAEWRTRY QDGTVTKWLQ QVTDFFVANA NIQGALKAEQ YFDAKPFLET VKA
|
| |