Gene Vapar_4355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4355 
Symbol 
ID7970546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4600868 
End bp4602355 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content66% 
IMG OID644794944 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002946232 
Protein GI239817322 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0355555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAACC GTCGCACCCT CCTTGCCACC GCCGGCGCCA CCGTGGCGCT GGCCTCTCCC 
ATCGCCGGCA TGGCGCAGGG GCGGAAGGAC GCCATCGTGA TCGGCATGGC GCTCGAGCCG
CCGGGCCTGG ACCCGACCGC CGGCGCCGCG GCCGCCATCG CGGAAGTGGT GCACTACAAC
ATCCTGGAGA CGCTCACCAA GATCAACGCC GACGGCAGCG TCACGCCGCT CCTGGCCGAG
AGCTGGGAAA TCTCGCCCGA CCTGAAGACC TACACCTTCA AGCTGCGGCG CGGCGTCAAG
TACCAGAACG GCGAGCCCTT CAATGCCGCC GCGGTGAAAT TCTCCTTCGA CCGCGCCGGC
GGCGAGAAGA GCACCAACAA GGACAAGCGC ACCTTCGCGA ACCTGAGCAC GCAGGTGGTC
GACGACTACA CCGTGGTGGT CATCAACAAG GAAATCGACC CCGACCTGCC CTTCGTGCTG
GGCCAGGCCA CGGCCGTGAT CGTCGAGCCC AAGAGCGCCG ACGGCAACGC CACCAAGCCG
GTCGGCACCG GCCCCTACAA GCTCGACAAC TGGGCCAAGG GCTCGTCGAT CACGCTGAGC
AAGTGGGAGG GCTTCCGCAG CCCGGCCACG GCCAGGATCA ACAAGGTCAC CTTCCGCTTC
ATTTCCGACA CGGCCGCGCA GGCCGCCGCG CTGATGGCCG GCGACGTCGA CGTGTTCACG
CGCATCGGCA CGCGCGCGGT GCCGCAGTTC AAGATGAACC CGCAGTTCCA GGTGATCCTG
GCCGGCTCGC GCGCCAAGAC CATTCTGTCG ATCAACAACA AGAAGAAGCC GCTGGACGAC
GTGCGCGTGC GCCGCGCCAT CCTGGCGGCC ATCGACCGCA AGGCCGTGAT CGAAGGCGCG
GCCGACGGCT TCGGCGTGCC GATCGGCAGC CACTACGTGC CGGGCGCCGC AGGCTATGTC
GACACCACGG GCATCAACCC CTTCGACCTC GAGAAGGCCA AGAAGCTGAT GGCCGAGGCC
GGCGTGAAGA CGCCGCTCGA ACTCACCATG ACGCTGCCGC CGCCGCCCTA CGCACGCCAG
GGCGGCGAGG TGATCGTGGC GCAGCTCGCC AAGATCGGCA TCACGGTCAA GGTGCAGAAC
GTGGAGTGGG CGCAGTGGCT CAGCGGCACC TACGGCAACA AGGACTACGA CCTGTCGATC
GTCTCGCACG TCGAGCCCTT CGACCTCGGC AACTACGCCA AGCCCGACTA CTACTGGGGC
TACCAGTCGA AGGCCTTCAA CGCGCTGTTC GACAAGATCA AGGCGACGGC CAATGCGGCC
GAGCGCAACA AGCTGCTCGG CGAAGCGCAG AAGATGCTGG CGGTCGATGC GGCCAACGGC
TTCCTCTACC AGCCGCAGTT CCCCACCATC GCGAAGAAGA ACGTGAAGGG CCTCTGGAAG
GAGAACCCGA TCTTCGTGAA CGACCTCTCG GCGCTGTCAT GGGGATGA
 
Protein sequence
MLNRRTLLAT AGATVALASP IAGMAQGRKD AIVIGMALEP PGLDPTAGAA AAIAEVVHYN 
ILETLTKINA DGSVTPLLAE SWEISPDLKT YTFKLRRGVK YQNGEPFNAA AVKFSFDRAG
GEKSTNKDKR TFANLSTQVV DDYTVVVINK EIDPDLPFVL GQATAVIVEP KSADGNATKP
VGTGPYKLDN WAKGSSITLS KWEGFRSPAT ARINKVTFRF ISDTAAQAAA LMAGDVDVFT
RIGTRAVPQF KMNPQFQVIL AGSRAKTILS INNKKKPLDD VRVRRAILAA IDRKAVIEGA
ADGFGVPIGS HYVPGAAGYV DTTGINPFDL EKAKKLMAEA GVKTPLELTM TLPPPPYARQ
GGEVIVAQLA KIGITVKVQN VEWAQWLSGT YGNKDYDLSI VSHVEPFDLG NYAKPDYYWG
YQSKAFNALF DKIKATANAA ERNKLLGEAQ KMLAVDAANG FLYQPQFPTI AKKNVKGLWK
ENPIFVNDLS ALSWG