Gene Vapar_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3872 
Symbol 
ID7969729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4101541 
End bp4102530 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content70% 
IMG OID644794458 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002945752 
Protein GI239816842 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCTG ACTTCCCCAT CGACAGGCGA CGCCTGCTCC AGGCCGCCGC GGGCTGGAGC 
GCCCTGGGCG CGGTGGGCGC GGCACCGGCG CGCGACCTTT CTGGCGTGAC ACTGCGCGTG
GGCACCTACA AGGGGCTCTG GCGGCCGCTG CTGCAGGCCT CCGGCCAAGC CAATACGCCC
TACAAGATCG ACTGGCGCGA GCTCAACAAC GGCGTGCTGC ACATCGAGGC CATCAACGGC
GATGCGCTCG ACCTGGGCTC GGGCAGCGAG ATCCCGCCGG TGTTCGCGGC GCGCCAGAAA
TCCAGCGTGC GGCTGGTGGC CGTGACGCAT GAAGACCTCA ACAACCAGGC CACGCTGGCG
CGCAAGGATT CGCCGATCCG CCGCATCGCC GACTTCAAGG GCAAGCGCGT GGGCTACGTG
CGCGCCACCA CCTCGCACTA CTACCTGGCC AGGCAGCTGG CCGAGGCGGG GCTTTCGTTC
AGCGACATCC AGGCCGTGAG CCTCACGCCT TCGGACGGCC TCTCGGCCTT CGCGCGCGGC
GACCTCGATG CCTGGGCCAT CTACGGCTAC AACGGCCAGC TGGCGCGCAC GCAGTACGGC
GCGCGCACCA TCAAGACCGG GGTGGGCTAC CTCTCGGGCA ACTTCCCGAT CTACGCCAAC
CCGCGTGCGC TCGACGACGA ACTGCGCCGC GCGGCGCTGG GCGACCTGCT GCAGCGCCTG
CAGCGCGCCT TCGCATGGAT CAACGGCAAC TTCCTGGCCT ATGCGCGTGC GCAGTCGGCC
GAGACGCGCG TGCCGGTCGG CGACCTGGTC GAACTCTTCA ACGGCCGCAG CGGCGACTAC
AGCCTGGGCC CGGTTACCGA TGCGGTGGTG CGCAGCCACC AGGAGGTGGC CGACACCTTC
CTGAAGATCG GCGTGCTCGA TGGGCCCGCC GACGTGAAGC CCCTGTGGGA CCGCCGCTTC
GAGAGCCTGC TGCGCCTGCC CGCCGCCTGA
 
Protein sequence
MTSDFPIDRR RLLQAAAGWS ALGAVGAAPA RDLSGVTLRV GTYKGLWRPL LQASGQANTP 
YKIDWRELNN GVLHIEAING DALDLGSGSE IPPVFAARQK SSVRLVAVTH EDLNNQATLA
RKDSPIRRIA DFKGKRVGYV RATTSHYYLA RQLAEAGLSF SDIQAVSLTP SDGLSAFARG
DLDAWAIYGY NGQLARTQYG ARTIKTGVGY LSGNFPIYAN PRALDDELRR AALGDLLQRL
QRAFAWINGN FLAYARAQSA ETRVPVGDLV ELFNGRSGDY SLGPVTDAVV RSHQEVADTF
LKIGVLDGPA DVKPLWDRRF ESLLRLPAA