Gene Vapar_4728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_4728 
Symbol 
ID7971738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp5023807 
End bp5025693 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content67% 
IMG OID644795313 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002946599 
Protein GI239817689 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCCC CCGACAAGTT CACCTCCCTG CTCTCGCTCA CGCGCGAGCC CTTTCCCGCA 
TCGCACAAGT GCCTGATCCC GGGCAGCCGG CCCGACCTCA ACGTGCCGGT GCGCGACGTG
CTGCTGACCA ACGGCGAGAC CGTGTCGCTC TACGACACCT CGGGCCCCTA CACCGATGCC
AAGGTCGAGA TCGACGTGCG CCGCGGCCTG CCCGGCGTGC GCGGCGCCTG GATCACCGAG
CGCAACGACA CCGAAAGCTA CGAAGGCCGC TCGCACCAGG CGCTCGACGA GGGCCTGAAG
CACGCGCACG ACCACGATGC CCAGCGCCTG GCCGAACTGC GGGCCGGCGC CTCGGCGCTG
CAGCGCACGC CGCGCCGCGC CAAGGCGGGC GCCAACGTCA CGCAGATGCA CTACGCGCGC
CGCGGCATCG TCACACCCGA GATGGAATAC GTGGCGCTGC GCGAGAACGG CAAGCGCGAG
TGGATGGCCG AATACCTCGC CAACGAGGAG CGCGCCAAGC GCGTGGCCGG CAACCCGATG
GGCGCCAGCA TTCCGCGCAT CATCACGCCC GAGTTCGTGC GCGACGAGGT GGCGCGCGGC
CGCGCCATCA TCCCGGCCAA CATCAACCAT CCTGAAGTGG AGCCGATGGC CATCGGCCGC
AACTTCAAGG TCAAGATCAA CGCCAACATC GGCAACTCGG CCGTCACCTC GAGCATCGAG
GAAGAAGTGG AAAAGCTCGT GTGGGCGATC CGCTGGGGCG CCGACAACGT GATGGACCTT
TCCACCGGCA AGAACATCCA CACCACGCGC GACTGGATCG TGCGCAACAG TCCCGTGCCC
ATCGGCACCG TGCCGATCTA CCAGGCGCTC GAGAAGGTGG GCGGCGTGGC CGAGGACCTG
ACCTGGGAGA TCTTCCGCGA CACGCTGATC GAGCAGGCCG AGCAGGGCAT CGACTACTTC
ACCATCCATG CCGGCGTGCG GCTGCCGTTC ATCCACCTGA CGGCCGACCG CATGACGGGC
ATCGTCTCGC GCGGCGGCTC GATCATGGCC AAGTGGTGCA TCGCGCACCA CAAGGAGAGC
TTTCTCTACG AGCGCTTCGA GGACATCTGC GACATCATGA AGGCCTACGA CGTGAGCTTC
TCGCTCGGCG ACGGCCTGCG CCCGGGCTCG GGCGCCGACG CCAACGACGA AGCGCAGTTT
GCCGAGCTGC GCACGCTGGG CGAGCTCACG CAGATCGCAT GGAAGCACGA CGTGCAGACC
ATGATCGAGG GGCCCGGCCA CGTGCCGATG CACATGATCC AGGCCAACAT GGACGAGCAG
CTCAAGCACT GCCACGAGGC GCCGTTCTAC ACGCTCGGGC CGCTGACCAT CGACATCGCG
CCGGGCTACG ACCATATCTC CAGCGCCATC GGCGCCGCGA TGATCGGCTG GGCCGGCACC
GCGATGCTCT GCTACGTGAC GCCCAAGGAG CACCTGGGCC TGCCCGACCG CGACGACGTG
AAGCAGGGGA TCATTGCCTA CAAGATCGCC GCGCATGCGG CCGACGTGGC CAAGGGGCAC
CCCGGCGCGC GCTCGCGCGA CGATGCGCTC AGCAAGGCGC GCTTCGAATT CCGCTGGCAG
GACCAGTTCA ACCTGGGCCT GGACCCCGAC ACGGCGCGCG AATTCCATGA CGAGACCCTG
CCCAAGGATT CGAGCAAGGT GGCGCATTTC TGCTCGATGT GCGGACCGAA GTTCTGCTCG
ATGAAGATCA CGCAGGAAGT GCGCGAGTAC GCGGCGAAGA AGGGCGTGGC CGAGGCGGAA
GCCATGGCCG AAGGAATGGC GCAGAAGTCC AGGGAGTTCA TGGCGGGCGG CGGCGAGATC
TACATCCCGA TCCAGCCCGC GTCCTGA
 
Protein sequence
MNAPDKFTSL LSLTREPFPA SHKCLIPGSR PDLNVPVRDV LLTNGETVSL YDTSGPYTDA 
KVEIDVRRGL PGVRGAWITE RNDTESYEGR SHQALDEGLK HAHDHDAQRL AELRAGASAL
QRTPRRAKAG ANVTQMHYAR RGIVTPEMEY VALRENGKRE WMAEYLANEE RAKRVAGNPM
GASIPRIITP EFVRDEVARG RAIIPANINH PEVEPMAIGR NFKVKINANI GNSAVTSSIE
EEVEKLVWAI RWGADNVMDL STGKNIHTTR DWIVRNSPVP IGTVPIYQAL EKVGGVAEDL
TWEIFRDTLI EQAEQGIDYF TIHAGVRLPF IHLTADRMTG IVSRGGSIMA KWCIAHHKES
FLYERFEDIC DIMKAYDVSF SLGDGLRPGS GADANDEAQF AELRTLGELT QIAWKHDVQT
MIEGPGHVPM HMIQANMDEQ LKHCHEAPFY TLGPLTIDIA PGYDHISSAI GAAMIGWAGT
AMLCYVTPKE HLGLPDRDDV KQGIIAYKIA AHAADVAKGH PGARSRDDAL SKARFEFRWQ
DQFNLGLDPD TAREFHDETL PKDSSKVAHF CSMCGPKFCS MKITQEVREY AAKKGVAEAE
AMAEGMAQKS REFMAGGGEI YIPIQPAS