Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_3003 |
Symbol | |
ID | 7973723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 3158851 |
End bp | 3160401 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644793588 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002944889 |
Protein GI | 239815979 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAT CTTCTTCCTC CCTGCGATGG GCATCGGCAC TGCTGGGCCT GGCCGCACTG GCTGCATCGG GCACGGCACT GGCCGCGAAA GACGTGGTGC TGTCGATCGG CTACCAGCCC GAAACGCTGG ACCCGTACAA CACCAACACC ACCATCACCA CGGCCGTGAC CAAGACCTTC TATGAAGGCC TGTTCCAGTT CGACAAGGAC CTGAAGGTGC AGAACGTGCT GGCCGAGAGC TACGACGTGT CGAAGGACGG CTTGGTCTAC ACCATCAAGC TGCGCAGCGG CGTCAAGTTC CACGACGGCA CCGACTTCAA CGCAGAGGCC GTGAAGTTCG TGCTCGACCG TGTGCTCAAC CAGGACAACA AGCTGCTGCG CTACAACCAG TTCAACCGCG TGAGCAAGGT CGAGGCGCTG AACCCCACCA CCGTGCGCAT CACGCTCAAG GAGCCCTTCG GCCCCTTCAT CAACTCGCTG GCCCATGCCT CGGCCGCGAT GATCTCGCCC ACCGCGCTCA AGAAATGGGG CAACAAGGAC ATCGCTTTCC ACCCCGTGGG CACCGGCCCC TTCGAATTCG TCGAGTGGAA GCAGACCGAA GCCGTCAAGG CCAAGAAGTT CGACGGCTAC TGGAAGAAGG GCTATCCGAA GATCGACACG GTCTCGTGGA AGCCGGTGCT CGAGAACAAC ACGCGCGCCG CGATGCTGCA GACCGGCGAA GCCGACTTCG CCTATCCCAT TCCCTATGAG CAGGCCGACC TGCTGAAGAA GAGCGAGAAG CTCGAAGTGG TGGCGACGCC GTCGATCATC ACGCGCTTCC TGGCATTCAA CATGCTGCAG AAGCCCTACG ACAACCCGAA GGTGCGCGAA GCCATCGGCT ACGCCATCAA CAAGGAAGCG CTGGCCAAGG TCGCGTTCGG CGGCTATGCC TTCCCCGCGC AAGGCATCGT GCCGCAGGGC GTCAAGTACG CCGAGAAGAT GGCGCCCATT CCCTACGACC TGAAGAAGGC CAAGGAGCTG ATGAAGGAAG CCGGCTACCC CGATGGCTTC GAGTCGGTGC TGTGGAGCGC CTACAACAAC ACGACCAGCC AGAAGACCAT CCAGTTCGTG CAGCAGCAGC TCGCGCAGAT CGGCATCAAG CTGCAGGTGC AGGCGCTCGA AGTGGGCCAG CGCACCGAGC AGGTGGATGC ATGGCCCGAT CCGAAGACGG CGAAGGTCCG CATGTACTAC ACGGGCTGGT CGTCGTCGAC CGGTGAAGCC GACTGGGGCC TGCGCCCGCT GTTCGCCTCG GAAGCCTGGG CACCCAAGCT CAACAACATG TCGTTCTACA AGAGCGAGGT GGTGGACAAC GCGCTGGCCA AGGCGCTGGT GACGGTCGAC GAAAAGGAAC GCACCGCGCT CTACAAGACC GCGCAGGAAG AGATCCGCAA GGACCTGCCG CGCGTGCCGA TGGTCACGGA GAAGAACCTG TCGGCGCATG CCAAGCGCTT GTCGGGTGTG TTCGTGATGC CTGACGGCAA CATCAACATC GACGCCATCG CGGTCAATTG A
|
Protein sequence | MKQSSSSLRW ASALLGLAAL AASGTALAAK DVVLSIGYQP ETLDPYNTNT TITTAVTKTF YEGLFQFDKD LKVQNVLAES YDVSKDGLVY TIKLRSGVKF HDGTDFNAEA VKFVLDRVLN QDNKLLRYNQ FNRVSKVEAL NPTTVRITLK EPFGPFINSL AHASAAMISP TALKKWGNKD IAFHPVGTGP FEFVEWKQTE AVKAKKFDGY WKKGYPKIDT VSWKPVLENN TRAAMLQTGE ADFAYPIPYE QADLLKKSEK LEVVATPSII TRFLAFNMLQ KPYDNPKVRE AIGYAINKEA LAKVAFGGYA FPAQGIVPQG VKYAEKMAPI PYDLKKAKEL MKEAGYPDGF ESVLWSAYNN TTSQKTIQFV QQQLAQIGIK LQVQALEVGQ RTEQVDAWPD PKTAKVRMYY TGWSSSTGEA DWGLRPLFAS EAWAPKLNNM SFYKSEVVDN ALAKALVTVD EKERTALYKT AQEEIRKDLP RVPMVTEKNL SAHAKRLSGV FVMPDGNINI DAIAVN
|
| |