Gene Vapar_3841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3841 
Symbol 
ID7969698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4061602 
End bp4063851 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content71% 
IMG OID644794427 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_002945721 
Protein GI239816811 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCCCG CCGCAGGCAT CAGCCGGCGC AGCGCGCTGC AGGCCGGCGG CCTGGCGCTG 
GCCTTCACCT GGTTCGGCGC GGGCAAGGCC TTTGCCGCGA TCAGCCCCAG GCAGCAGCCG
GGCGACGCGG CCGCCGCGCT CGCGGACGGC AACCCGGCAT TCGCGCCCAA TGCCTTTGTC
CGCATCGATG CCGACGGCGG CGTGCGCCTC GTGATGCCGA TGGCCGAGAT GGGCCAGGCA
ATCTACACCG GCTCGGCGAT GCTGCTGGCC GAGGAACTGG GCGTCGAGCT CGACCAGGTG
CGCGTCGAAC ATTCGCCGCC GAGCGAGGCG CTCTACGGCA TGCCGCTGCT CGGCGGCCAG
ATCACCGGCG GCTCGACCAG CACGCGCGGC ACCTACGGCG TGCTGCGCGA GGCGGGCGCG
GTGGCGCGCA CGCTGCTGGT GGGCGCCGCC GCCGCGCAGT GGAAGGTCGA CCCCGCGGGC
TGCACGGTCG CGCGCGGCGT GGTGAGCCAC GCGGCCTCCA ACCGGCAGCT CGGTTTCGGC
GCACTCGCCG GTGCCGCTGC CAAATTGCCG ATGCCGGAAA AGGTGACGCT GAAGGAGCCG
AAGGATTTCA AGCTGATCGG CCAGCCGCTG CGGCGCGTCG ATTCGGCCGG CAAGGTCGAC
GGCTCCACGC AGTTCGGCAT CGACGTGCGC CTGCCGGGCA TGAAGGTGGC CACGGTCAGG
GCCTGCCCCA CGCTCGGGGG CGTGCTGGCC TCGGTGGACG ACAAGGCGGC GCGGGCGATT
CGCGGCGTGG TCGACGTGCT GCGCATCAAG GACGCGGTGG CCGTGGTGGG CGAGCATTTC
TGGGCCGCGA AGCGCGGACT CGATGCATTG AAGGTGCAGT GGACGCCGGG GCAGAACGCC
GCGCTCACCA CGCAGCAGCT GCGCGCGTCG CTGGCCAACG CGCTGGCCAA GGACAAGGCC
ATCGTCGGCA AGGAAACCGG CAAGCGGCCT GAAGGCACGC TGGTGCAGGC CACCTACGAC
CTGCCGATGC TGGCCCACGC GACCATGGAG CCGCTCAACA CCACGGTGCA CGTGCGGCCC
GACCAGTGCG AGATCTGGGT CGGCACGCAG GTGCCGACGC GCTGCGTGAG CGCTGCGGCC
AAGATTGCCG GCGTTGCCGA AGACAAGGTC GTGCTGCACA ACCAGTACCT GGGCGGCGGC
TTCGGCCGCC GGCTCGAGAC CGACTCGGTC GAGCAGGCCG TGGCCTTCGC CAAGCAGGTG
CCCTACCCGC TCAAGGTGGT GTGGACGCGC GAGGAAGACA TCCGCCACGA CATCGTGCGA
CCGATGTACC ACGACGACAT CTCGGCCGTG GTCGACGGCG ACGGCCAGAT CCTCTGGTTC
GGCGACCGCA TCGCGGGCGG CACGGTGCTG GGCCGCTGGG CCCCCGCCTT CATGGGCAAG
GACGGCATGG ACAGCGACCT GATCGAATGC ATCGCCGAAC CTTGCTACGA CCTGCCGAAC
CTCAAGGTCG AGTGGGTGCG GCACGACATG CCGGCGGGCC TGAACGTCGG CTGGTGGCGC
GGCGTGGGGC CGACGCACAA CCTCTTCGTG ATGGAGAGCT TCATCGACGA GCTCGCGCAC
CGCGCGAAGA AAGACCCCGT GGCCTACCGG CGCGCCATGC TGAAGAAGAA CCCGCGCACG
CTCGGGGTGC TCGACCTGGC GGCCGGCAAG ATCGGCTGGG GCCAGGGCGC GCTGGCGGCG
CGCGTCGGAC GCGGCGTGGC CGTGGGCGAT GCCTTCGGCA GCCGCGTGTG CGCCATCGTC
GAGGCCGAGG TCACGCCGCA GGGCGAAGTG CGCATGCGGC GCGCGGTGGT CGCGGTCGAT
TGCGGCATTG CCGTGAACAC AGGCTCCATC GAGGCGCAGA TCCAGGGCGG GCTGCTGTTC
GGGCTGAGCG CGGCGCTCTT CAGCGAGATC ACGCTGCGCG AGGGGGCCAT CGAGCAGAGC
AACTTCCACG ACTACCGGAT GCTGCGCATC AACGAGGCGC CGCCGGTCGA GGTCCACACC
GTGAAGAGCG GCGAAGCGCC CGGCGGGCTT GGCGAAGTCG GCACCGCCAT CGCCGCGCCG
GCGCTGGCCA ACGCGATCTT CGCGGCGACG GGCGTGCGGC TGCGCGCGCT GCCGGTGAAC
CGCGCACTGC TGGCGCAGGA CAAGGAGGCG CTGAAGAAGA AGATCGCCAA TGCGGGGCTC
TCCGGGCTCG GCGCAAGGAG TGCAGCATGA
 
Protein sequence
MKPAAGISRR SALQAGGLAL AFTWFGAGKA FAAISPRQQP GDAAAALADG NPAFAPNAFV 
RIDADGGVRL VMPMAEMGQA IYTGSAMLLA EELGVELDQV RVEHSPPSEA LYGMPLLGGQ
ITGGSTSTRG TYGVLREAGA VARTLLVGAA AAQWKVDPAG CTVARGVVSH AASNRQLGFG
ALAGAAAKLP MPEKVTLKEP KDFKLIGQPL RRVDSAGKVD GSTQFGIDVR LPGMKVATVR
ACPTLGGVLA SVDDKAARAI RGVVDVLRIK DAVAVVGEHF WAAKRGLDAL KVQWTPGQNA
ALTTQQLRAS LANALAKDKA IVGKETGKRP EGTLVQATYD LPMLAHATME PLNTTVHVRP
DQCEIWVGTQ VPTRCVSAAA KIAGVAEDKV VLHNQYLGGG FGRRLETDSV EQAVAFAKQV
PYPLKVVWTR EEDIRHDIVR PMYHDDISAV VDGDGQILWF GDRIAGGTVL GRWAPAFMGK
DGMDSDLIEC IAEPCYDLPN LKVEWVRHDM PAGLNVGWWR GVGPTHNLFV MESFIDELAH
RAKKDPVAYR RAMLKKNPRT LGVLDLAAGK IGWGQGALAA RVGRGVAVGD AFGSRVCAIV
EAEVTPQGEV RMRRAVVAVD CGIAVNTGSI EAQIQGGLLF GLSAALFSEI TLREGAIEQS
NFHDYRMLRI NEAPPVEVHT VKSGEAPGGL GEVGTAIAAP ALANAIFAAT GVRLRALPVN
RALLAQDKEA LKKKIANAGL SGLGARSAA