Gene Vapar_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_1043 
Symbol 
ID7972014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp1140773 
End bp1142224 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content71% 
IMG OID644791639 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002942960 
Protein GI239814050 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.414752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTCACG GCGAGATCCA TCAGAACGTG CGGCAATACA TCAACGGCCG CTGGGAAACC 
AGCGCGACCA CCGGTGTGAG CGCCAATCCT TCGGACACCA GCGAAGTGGT GGCCGAATAC
GCACGGGCCG ATCGCCGCCA GGTGGAGTCC GCGATCCGCG CAGCCGCCGA TGCCTTTTCC
CACTGGAGCC ACAGCACGCC GCAGCGCCGG GCCGACGTGC TCGACCGCAT CGGCACCGAA
CTCCTGGCGC GCAAGGACGA GCTGGGCCTG CTGCTGGCGC GCGAGGTTGG CAAGACCCTG
CCCGAGGCGG TGGCCGAGGC GGCGCGCGCG GGGCAGGTGT TCAAGTTCTT CGCGGGCGAG
GCACTGCGCG GCGGCGGCGA GAACATGGCC TCGCTGCGCG CAGGCGTGCA GGTCGACGTC
ACGCGCGAGC CGGTGGGCGT GGCCGGGCTC ATCACGCCGT GGAATTCGCC GCTCGCGGTG
CCGGCCACCA AGATTGCACC GGCGCTCGCG CACGGCAACT GCGTGGTGTT CAAGCCGGCC
GAGCTGGTGC CGGCCTGCGG CTGGACGCTG GCCGAGATCA TCAGCCGCGC GGCACTGCCC
GCGGGCGTGT TCAACCTCGT GATGGGCAGC GGCCGCGAAG CCGGCCAGGC GCTGGTCGAC
AGCCCGCTGG TCGATGCCCT GAGCTTCACC GGCTCGGCGC GCAACGGCGA ACGCATCCTG
CAGGCGGCCG CCGCGCGGCG CGCCAAGGTG CAGCTCGAGA TGGGCGGCAA GAACGCGCTC
GTGGTGCTGG CCGATGCCGA CATCGACCAC GCGGTCGACT GCGCGGTGCA GGGCGCCTAT
TTTTCCAACG GCCAGCGCTG CACGGCGTCG AGCCGGCTGA TCGTCGAGGC CGCGGTGCAC
GATGCCTTCG TCTCGCGGCT TCGCGAGCGG CTCAAGGCGC TGAAGATCGG CCATGCGCTC
GAACGCGGCG TCGACGTCGG GCCGCTGGTC GATGAAGAGC GCCTCGCGCG CAGCCTGGCC
TGGGTCGGCA TTGCGCGCGA AGAGGGCGCT GAGCATGTGT GGGGCGGCGA GCCGCTCCAG
CGCGCCACGG CCGGCCACTA CATGAGCCCC GCGCTGTTCC TGGCCCAGCC CGGGCACCGC
ATCGCGCGCG AGGAAATCTT CGGCCCGCTG GCCTGCGTGC TGCGCGCGGC CGACTACGAC
GAGGCGCTGG CGCTGTGCAA CGACACGCCT TCGGGCCTGA GCGCGGGCAT CTGCACCAAT
TCGCTCAAGC ACGCGATGCA TTTCAGGCGC CATGCCGAAG TCGGCATGAC GATGGTCAAC
CTGCCGACGG CGGGCGTGGA CTTCCACGCG CCCTTCGGCG GGCGCAAGGG GTCGGGCTAC
GGGCCGCGCG AACAAGGGCG CCATGCCGCG GAGTTCTACA CGACGGTCAA GACCGGCTAC
ATGCTGGCCT GA
 
Protein sequence
MIHGEIHQNV RQYINGRWET SATTGVSANP SDTSEVVAEY ARADRRQVES AIRAAADAFS 
HWSHSTPQRR ADVLDRIGTE LLARKDELGL LLAREVGKTL PEAVAEAARA GQVFKFFAGE
ALRGGGENMA SLRAGVQVDV TREPVGVAGL ITPWNSPLAV PATKIAPALA HGNCVVFKPA
ELVPACGWTL AEIISRAALP AGVFNLVMGS GREAGQALVD SPLVDALSFT GSARNGERIL
QAAAARRAKV QLEMGGKNAL VVLADADIDH AVDCAVQGAY FSNGQRCTAS SRLIVEAAVH
DAFVSRLRER LKALKIGHAL ERGVDVGPLV DEERLARSLA WVGIAREEGA EHVWGGEPLQ
RATAGHYMSP ALFLAQPGHR IAREEIFGPL ACVLRAADYD EALALCNDTP SGLSAGICTN
SLKHAMHFRR HAEVGMTMVN LPTAGVDFHA PFGGRKGSGY GPREQGRHAA EFYTTVKTGY
MLA