Gene Vapar_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4001 
Symbol 
ID7974566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4243417 
End bp4244745 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content64% 
IMG OID644794587 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002945881 
Protein GI239816971 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.131149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCC AACGACGCAG CACGCTGGCC ATGCTGGCCC TTTCATCCAC GCTCTGGCTC 
GCGAGCGGCA CACCCGCCGC CGCGCAGCCG GTGCAGGTGC AGTGGTGGCA CGCGATGGGC
GGCGCGCTCG GCGAGCGCGT GGAAGAACTG GTGAAGAACT TCAACGGCTC GCAGAACAAG
TACGCCGTCA CCGCCGTCTA CAAGGGCAAC TACGACGAGG TCATCAACGG CACCATTGCC
GCCTACCGCG CCAAGCGCGC GCCGGCGCTC GTGCAGATCT ACGAGCGCGG CTTCATGACC
ATGCTGCTGT CCGATGCAAC CATGCCGGTG CAGGACCTGC TCGACCAGCG CGGCTACAAG
GTCGACTGGG CCGATTTCGT GAAGCCGGTG GCCGGCTTCT ACAGCTACAA GGGCAAGCTG
ATGACGATGC CCTTCAATTC GTCCTCGCCG ATCCTCTGGT ACAACAAGGC GCACTTCGAG
AAGGCCGGCT TCGCCGGGCC CGCGCAGACC TGGCAGGAGC TGGAGAAGCA GCTCTACGCC
ATCAAGCAGA AGGGCATCTC GGCCTGCGGC TCGGTGCTCG CGGGCGACTA CCACTGGAGC
CTGCTCGAGA ACTACAGCGC CATCAACGAC CTGCCCTATG CCACCAAGGC CAACGGCTAT
CAGGGGCTGG ACACCGAGTT CGTCTACAAC AAGACCTCGG TGGTTTCGCA GGTGGCGCGC
ATCAAGAAAT GGATCGACGA CGACGTGATG CAGATCGCCG GCCAGGGCCT GAGCCCCGAG
CAGCTCTTCA CCTCGGGCAA GTGCTCCACC TACTTTGCCT CGACCGCTGC ACACAGCGGC
ATCGAGCGCG AATCCAAGAT CGACTGGAGC GCCACCTACC TGCCGTGGGA AGAGGGCAAG
CAGCCGAAGA ACAGCACCAT CGGCGGCGCG TCGCTCTGGG TGATGAAGGG CCAGAAGCCG
GCCGAGTACG AGGCCGTGGC CGCCTTCCTC GACTACATCG CCAAGCCCGA GACGCAGTTC
TGGTGGGTCA AGGCCACGGG CTACGTGCCG CTGACCAACA AGGCCTACGA GTTGGCCAAG
TCGCAGGGCT ACTACAAGGA GCATCCGACG CGCGAGATTG CGATCCTGCA GCTCACGCGC
GGCACGCCCA CGGCCAACTC CACGGGCTTT CACTTCGGCA ACTTCACGCA GACCATGATG
GCGCAGCGCG ACGAGTTCCA GAACGTGGTG GCCGGCAAGA AGACGCCGCA GGTGGCCATG
GACGATGCCG TCAAGCGCGG CAACGAGATC CTGCGGCAGT ACGAGAAGCT CAACAAGGGC
CGCTACTGA
 
Protein sequence
MKIQRRSTLA MLALSSTLWL ASGTPAAAQP VQVQWWHAMG GALGERVEEL VKNFNGSQNK 
YAVTAVYKGN YDEVINGTIA AYRAKRAPAL VQIYERGFMT MLLSDATMPV QDLLDQRGYK
VDWADFVKPV AGFYSYKGKL MTMPFNSSSP ILWYNKAHFE KAGFAGPAQT WQELEKQLYA
IKQKGISACG SVLAGDYHWS LLENYSAIND LPYATKANGY QGLDTEFVYN KTSVVSQVAR
IKKWIDDDVM QIAGQGLSPE QLFTSGKCST YFASTAAHSG IERESKIDWS ATYLPWEEGK
QPKNSTIGGA SLWVMKGQKP AEYEAVAAFL DYIAKPETQF WWVKATGYVP LTNKAYELAK
SQGYYKEHPT REIAILQLTR GTPTANSTGF HFGNFTQTMM AQRDEFQNVV AGKKTPQVAM
DDAVKRGNEI LRQYEKLNKG RY