Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_4728 |
Symbol | |
ID | 7971738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 5023807 |
End bp | 5025693 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644795313 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002946599 |
Protein GI | 239817689 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCCC CCGACAAGTT CACCTCCCTG CTCTCGCTCA CGCGCGAGCC CTTTCCCGCA TCGCACAAGT GCCTGATCCC GGGCAGCCGG CCCGACCTCA ACGTGCCGGT GCGCGACGTG CTGCTGACCA ACGGCGAGAC CGTGTCGCTC TACGACACCT CGGGCCCCTA CACCGATGCC AAGGTCGAGA TCGACGTGCG CCGCGGCCTG CCCGGCGTGC GCGGCGCCTG GATCACCGAG CGCAACGACA CCGAAAGCTA CGAAGGCCGC TCGCACCAGG CGCTCGACGA GGGCCTGAAG CACGCGCACG ACCACGATGC CCAGCGCCTG GCCGAACTGC GGGCCGGCGC CTCGGCGCTG CAGCGCACGC CGCGCCGCGC CAAGGCGGGC GCCAACGTCA CGCAGATGCA CTACGCGCGC CGCGGCATCG TCACACCCGA GATGGAATAC GTGGCGCTGC GCGAGAACGG CAAGCGCGAG TGGATGGCCG AATACCTCGC CAACGAGGAG CGCGCCAAGC GCGTGGCCGG CAACCCGATG GGCGCCAGCA TTCCGCGCAT CATCACGCCC GAGTTCGTGC GCGACGAGGT GGCGCGCGGC CGCGCCATCA TCCCGGCCAA CATCAACCAT CCTGAAGTGG AGCCGATGGC CATCGGCCGC AACTTCAAGG TCAAGATCAA CGCCAACATC GGCAACTCGG CCGTCACCTC GAGCATCGAG GAAGAAGTGG AAAAGCTCGT GTGGGCGATC CGCTGGGGCG CCGACAACGT GATGGACCTT TCCACCGGCA AGAACATCCA CACCACGCGC GACTGGATCG TGCGCAACAG TCCCGTGCCC ATCGGCACCG TGCCGATCTA CCAGGCGCTC GAGAAGGTGG GCGGCGTGGC CGAGGACCTG ACCTGGGAGA TCTTCCGCGA CACGCTGATC GAGCAGGCCG AGCAGGGCAT CGACTACTTC ACCATCCATG CCGGCGTGCG GCTGCCGTTC ATCCACCTGA CGGCCGACCG CATGACGGGC ATCGTCTCGC GCGGCGGCTC GATCATGGCC AAGTGGTGCA TCGCGCACCA CAAGGAGAGC TTTCTCTACG AGCGCTTCGA GGACATCTGC GACATCATGA AGGCCTACGA CGTGAGCTTC TCGCTCGGCG ACGGCCTGCG CCCGGGCTCG GGCGCCGACG CCAACGACGA AGCGCAGTTT GCCGAGCTGC GCACGCTGGG CGAGCTCACG CAGATCGCAT GGAAGCACGA CGTGCAGACC ATGATCGAGG GGCCCGGCCA CGTGCCGATG CACATGATCC AGGCCAACAT GGACGAGCAG CTCAAGCACT GCCACGAGGC GCCGTTCTAC ACGCTCGGGC CGCTGACCAT CGACATCGCG CCGGGCTACG ACCATATCTC CAGCGCCATC GGCGCCGCGA TGATCGGCTG GGCCGGCACC GCGATGCTCT GCTACGTGAC GCCCAAGGAG CACCTGGGCC TGCCCGACCG CGACGACGTG AAGCAGGGGA TCATTGCCTA CAAGATCGCC GCGCATGCGG CCGACGTGGC CAAGGGGCAC CCCGGCGCGC GCTCGCGCGA CGATGCGCTC AGCAAGGCGC GCTTCGAATT CCGCTGGCAG GACCAGTTCA ACCTGGGCCT GGACCCCGAC ACGGCGCGCG AATTCCATGA CGAGACCCTG CCCAAGGATT CGAGCAAGGT GGCGCATTTC TGCTCGATGT GCGGACCGAA GTTCTGCTCG ATGAAGATCA CGCAGGAAGT GCGCGAGTAC GCGGCGAAGA AGGGCGTGGC CGAGGCGGAA GCCATGGCCG AAGGAATGGC GCAGAAGTCC AGGGAGTTCA TGGCGGGCGG CGGCGAGATC TACATCCCGA TCCAGCCCGC GTCCTGA
|
Protein sequence | MNAPDKFTSL LSLTREPFPA SHKCLIPGSR PDLNVPVRDV LLTNGETVSL YDTSGPYTDA KVEIDVRRGL PGVRGAWITE RNDTESYEGR SHQALDEGLK HAHDHDAQRL AELRAGASAL QRTPRRAKAG ANVTQMHYAR RGIVTPEMEY VALRENGKRE WMAEYLANEE RAKRVAGNPM GASIPRIITP EFVRDEVARG RAIIPANINH PEVEPMAIGR NFKVKINANI GNSAVTSSIE EEVEKLVWAI RWGADNVMDL STGKNIHTTR DWIVRNSPVP IGTVPIYQAL EKVGGVAEDL TWEIFRDTLI EQAEQGIDYF TIHAGVRLPF IHLTADRMTG IVSRGGSIMA KWCIAHHKES FLYERFEDIC DIMKAYDVSF SLGDGLRPGS GADANDEAQF AELRTLGELT QIAWKHDVQT MIEGPGHVPM HMIQANMDEQ LKHCHEAPFY TLGPLTIDIA PGYDHISSAI GAAMIGWAGT AMLCYVTPKE HLGLPDRDDV KQGIIAYKIA AHAADVAKGH PGARSRDDAL SKARFEFRWQ DQFNLGLDPD TAREFHDETL PKDSSKVAHF CSMCGPKFCS MKITQEVREY AAKKGVAEAE AMAEGMAQKS REFMAGGGEI YIPIQPAS
|
| |