Gene Vapar_5515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_5515 
Symbol 
ID7975372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012792 
Strand
Start bp216939 
End bp217940 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content67% 
IMG OID644796101 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002947375 
Protein GI239820190 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.467837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTTC GACGCAGACA ACTGATCCAG GCCGCCGCCG GCGCGGCCGC ACTGGCCGGC 
GGCATGCCGG CCTTCGCCCA GGCGCGGACC AAGGTCAAGG TGGGCTACCT GCACACCCCG
GCCGTGGACG GCCACATCTG GATCGGGCAG GAGAGCGGCG CCTTTGCGAA GCAGGGGCTC
GAACTGGAGA TGATCCAGTT CACCACCGGC CTGGAGCTGT TCCAGGCCAT GATCGGCGGC
AGCCTGGACA TGCTGTCCAC CGGCGCCGTG GTGTCCAACT TTCCGGCGCG CGGCCAGGGC
AAGGTGTTCC TGATGAACAA CATCGAGTAT GCGACGGCCC AGCTGTGGGT GCGCGAGGAC
GCCGGCATCA AGACCGTGGC CGACCTCAAG GGCAAGCAGA TCTCCACCAC CACCGGCACC
ACGGCGCACG TGTTCCTGGA CCGCGCGCTG CGCTCGGGCA ACCTGGACCC GGCCAAGGAC
GTGAAGCTGG TCAACCAGCG CATGACCGAG GCCGTCACCT CCTTCATCTC CGGCGCGGTG
CCGGCGGTGG CGCTGTGGGT GCCCTTCGAT TCGGTGATCC GCCAGAAGCT GCCGGGCGCG
CGCAAGCTGA TCGACGCCTC GGCCTTCTTC CCCGAGGCCG CCATCATGGG CGGCTGGGCC
GCGCGCAACG ACTACTACGA CAAGAACCGC GCCGTCATCG CCAAGCTGAT TGCCGCATGG
GCCGAGGTCA ACGACGTGGT CACCGGCAAG CCCGACGCCG CGGCCGAGAT GCTGCAGAAG
ACCCAGTACA AGGAGGTGCC GCTCGCGGAC TTCAAGGCGC AGTTCAAGGC CTCCAAGTAC
TACACCAACG CCGAATGGCG CACGCGCTAC CAGGACGGCA CCGTCACCAA GTGGCTGCAG
CAGGTGACGG ACTTCTTCGT TGCCAACGCC AACATCCAGG GCGCACTGAA GGCCGAGCAG
TACTTCGACG CCAAGCCCTT CCTGGAGACG GTCAAGGCAT GA
 
Protein sequence
MALRRRQLIQ AAAGAAALAG GMPAFAQART KVKVGYLHTP AVDGHIWIGQ ESGAFAKQGL 
ELEMIQFTTG LELFQAMIGG SLDMLSTGAV VSNFPARGQG KVFLMNNIEY ATAQLWVRED
AGIKTVADLK GKQISTTTGT TAHVFLDRAL RSGNLDPAKD VKLVNQRMTE AVTSFISGAV
PAVALWVPFD SVIRQKLPGA RKLIDASAFF PEAAIMGGWA ARNDYYDKNR AVIAKLIAAW
AEVNDVVTGK PDAAAEMLQK TQYKEVPLAD FKAQFKASKY YTNAEWRTRY QDGTVTKWLQ
QVTDFFVANA NIQGALKAEQ YFDAKPFLET VKA