Gene Vapar_3842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3842 
Symbol 
ID7969699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4063848 
End bp4065158 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID644794428 
ProductGluconate 2-dehydrogenase (acceptor) 
Protein accessionYP_002945722 
Protein GI239816812 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGGC TGTTCAACCT GCTGGTGGCG GTGTTCGTCG TGGGCGCGGC GCTGTATTTC 
TTCCTGAACC GCAGCGACAA GTCCGAGGGC GAGGCGCCGG TGCTGGCCGG CGCGCCCGCC
AATGCGGCGC TGCTCGTGCG CGGCGAATAC CTCACCAAGG CGGCCGACTG CATCGCATGC
CACACCGTGC CCGGCGGCAA GGGACAGCCC TTCGCGGGCG GCCTGCCCTT CGTGCTGCCT
TTCGGCACGA TCTACTCGTC GAACATCACC GCGGACCCGG AAACGGGCAT CGGCAAATGG
AGCGACGACG ACTTCGTGCG CGCGCTGCAC GACGGCGTGC GCGCGGACGG AAAACGCCTG
TACCCGGCGT TCCCGTACAC CTCGTACACC GCGCTGAGCC GCAATGACGT GCTCGCGATC
AAGGCCTACC TGTTCAGCCT GCCCAAGGTG AGCCAGCCGA ACCGGGAGGC CGACCTGGGC
TTTCCGTTCA ACCAGCGCTG GGCGATGGGC TTGTGGAACG CGGCCTTCTT CAAGAGCAGC
CGCTTCGAGG CCGATGCATC GAAGCCGCCC GCGTGGAACC AGGGCAAGTA CCTTGCCACG
GCGCTCGGCC ACTGCGCGGA ATGCCACACG CCGCGCAACT TCGCCTTCGC GATGGATGCG
GACAACAACC TCGCGGGCGA ATCGATCCAG GGCTGGCGCG CCTACAACAT CACGTCGGAC
GCGAAGCACG GCATCGGCGC CTGGAGCGAT GCCGAGGTTG CCTCGTACCT GACGACCGGC
CACGCGGCGG GCCGCGGCTC GGCCTCGGGG CCGATGGGCG AGGCGGTGGA GCACAGCCTG
CAGTACCTGA AGCCGGAGGA CGCCTCGGCG CTCGTGAGCT ACCTGCGCAC GGTGCCCGCG
AAGGCCGGCA AGAACCCGAT CGAAGTCGAT GCGAAGGCGC CATGGGCCGT GGCCGCGAGC
GCGGCAGCGC CCGCCACCAG CACGGCCGAA GGCCACGAGC AGGGGCTGCG GCTCTTCGCC
GCGGCGTGCG CGAGCTGCCA CCAGTGGAAC GGACAAGGCC AGCAGAGCGC CAACGCGTCG
CTGCTGGGGA CGCGCGGCGT GAACGACCCG GACGGCTCCA ACGTCACGCA GATGATCCTG
CAGGGCGTGA AGATGCGCGT GCGCGACCAG GAGGTCTACA TGCCGGCGTT CGGCAAGGCC
TACACGGACA CCGAGGTCGC GGCGCTCGCC AACTACGTCA TCGCGCAATT CGGGAACAAG
CAGGGCGCAG CGGTGACCCC GGAGTTCGTG GCGAAGCAGC GCACGCGCTA G
 
Protein sequence
MKRLFNLLVA VFVVGAALYF FLNRSDKSEG EAPVLAGAPA NAALLVRGEY LTKAADCIAC 
HTVPGGKGQP FAGGLPFVLP FGTIYSSNIT ADPETGIGKW SDDDFVRALH DGVRADGKRL
YPAFPYTSYT ALSRNDVLAI KAYLFSLPKV SQPNREADLG FPFNQRWAMG LWNAAFFKSS
RFEADASKPP AWNQGKYLAT ALGHCAECHT PRNFAFAMDA DNNLAGESIQ GWRAYNITSD
AKHGIGAWSD AEVASYLTTG HAAGRGSASG PMGEAVEHSL QYLKPEDASA LVSYLRTVPA
KAGKNPIEVD AKAPWAVAAS AAAPATSTAE GHEQGLRLFA AACASCHQWN GQGQQSANAS
LLGTRGVNDP DGSNVTQMIL QGVKMRVRDQ EVYMPAFGKA YTDTEVAALA NYVIAQFGNK
QGAAVTPEFV AKQRTR