Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2162 |
Symbol | |
ID | 3971983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 2354093 |
End bp | 2356222 |
Gene Length | 2130 bp |
Protein Length | 709 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637925270 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_532035 |
Protein GI | 90423665 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.215518 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCC GGTCCAATCC CGACACCACG CTGCCCGCTG TGACCACCGG CCCGCTGCCC TCCTCGCGCA AGATCTTCGC GACGCCCGAC GAAGCGCCGG AGCTGCGCGT GCCGTTGCGC GAGATCATCC TCAGCGACGG CGCCGGCGAA CCGAACCTGC CGGTGTACGA CACCACCGGC CCCTACACCG ATCCCAGCGT CACCATCGAC GTCAATGCCG GGCTGTCGCG GATCCGCACC GCCTGGGTGA AAGAGCGCGG CGGCGTCGAG GAATATCAAG GCCGCGACGT CAAGCCGGAG GACAACGGCA ATGTCGGCGC CGCCCACGCC GCAAAATCCT TCACCGCCTA TCACAAGCCG CTGCGCGGAC TGGATGCGCC CGCCGCAGGC ACGGCGAACT CCCCTCCCCC TCGCGGGGAG GGGTCGGGGG TGGGGGGAGC AACAAACACT GTGCCCTCCT CTACCCCCCT CCCCACCCCT CCCCCGCAAG GGGGGAGGGA GCAAGGCATC GCCTATGCGT GCGGCCCACA CCTGCCGCCG ATGGTGACGC AGCTGGAATT TGCCCGCGCA GGCATCATCA CCAAGGAGAT GATCTACGTC GCCACCCGGG AAAACCTCGG CCGCAAACAG CAGCTCGCGC GCGCCGAGGC AGCGCTGGCC GACGGCGAAT CGTTCGGCGC GTCGGTGCCG GCCTTCGTCA CCCCGGAATT CGTCCGCAGC GAGATCGCGC GCGGCCGCGC GATCATCCCC GCCAACATCA ACCACGGCGA GTTGGAGCCG ATGATCATCG GCCGCAATTT TCTCACCAAG ATCAACGCCA ATATCGGCAA CAGCGCTGTC ACCTCCTCGG TCGAGGAAGA AGTCGACAAG ATGGTGTGGG CGATCCGCTG GGGCGCCGAC ACCGTGATGG ACCTCTCCAC CGGCCGCAAC ATCCACACCA CGCGGGAATG GATTCTGCGC AACGCGCCGA TCCCGATCGG CACCGTGCCG ATCTATCAGG CGCTGGAGAA GTGCGAAGGC GATCCGGTCA AGCTCACTTG GGAGCTATAT CGCGACACGC TGGTCGAGCA ATGCGAACAG GGCGTCGATT ACTTCACGAT CCACGCCGGC GTGCGGCTGG CTTACATCCA CCTCACCGCC AACCGCACCA CCGGCATCGT GTCGCGCGGC GGCTCGATCA TGGCGAAGTG GTGCCTGGCG CATCACCAGG AGAGCTTCCT CTACACGCAT TTCGACGAGA TCTGCGACCT GATGCGCAAA TACGACGTGT CGTTCTCGCT CGGCGACGGC CTGCGGCCGG GCTCGATCGC GGATGCCAAC GACCGCGCGC AATTCGCCGA ATTGGAGACG CTCGGCGAGC TCACCAAGAT CGCCTGGGAT AAGGGCTGCC AGGTGATGAT CGAAGGCCCC GGCCATGTGC CGCTGCACAA GATCAAGATC AACATGGACA AGCAGCTGAA AGAATGCGGC GAGGCGCCGT TCTATACGCT CGGGCCTTTG ACAACAGACA TTGCGCCTGG CTACGACCAC ATCACCTCGG GCATTGGGGC GGCGATGATC GGCTGGTTCG GCTGCGCGAT GCTGTGCTAC GTCACGCCGA AGGAACACCT TGGCCTGCCC GATCGCAACG ACGTCAAGGT CGGGGTGATT ACTTACAAGA TCGCCGCCCA TGCCTCCGAT CTCGCCAAGG GCCATCCGGC GGCGCAATTG CGCGACGACG CGCTGTCGCG CGCCCGCTTC GACTTCCGCT GGCAGGATCA GTTCAACTTA GGCCTCGATC CCGACACCGC GCAGGCGTTC CACGACGAGA CGCTGCCGAA GGACGCCCAC AAGGTGGCGC ATTTCTGCTC GATGTGCGGA CCGAAATTCT GCTCGATGAA GATCACACAG GACGTGCGCG ACTACGCGGC GGGGCTTGGC GACAACGAGA AGGCGGCGCT CAATCTGGCC GGCGGCAGCT CATTGGGCAG CGTCGGGATG TCGATCTCCG GCAAGCTGGA AGACGGCCTG CCCGCCGACG CTTTTGCCAA GGCGGGCATG GCGGAGATGA GCGAGAAGTT TCGGACTATG GGCGAGCAAC TCTATCTCGA CGCCGAGAAG GTGAAGGAAA GCAACAAGGC GCTGTCGTAG
|
Protein sequence | MNIRSNPDTT LPAVTTGPLP SSRKIFATPD EAPELRVPLR EIILSDGAGE PNLPVYDTTG PYTDPSVTID VNAGLSRIRT AWVKERGGVE EYQGRDVKPE DNGNVGAAHA AKSFTAYHKP LRGLDAPAAG TANSPPPRGE GSGVGGATNT VPSSTPLPTP PPQGGREQGI AYACGPHLPP MVTQLEFARA GIITKEMIYV ATRENLGRKQ QLARAEAALA DGESFGASVP AFVTPEFVRS EIARGRAIIP ANINHGELEP MIIGRNFLTK INANIGNSAV TSSVEEEVDK MVWAIRWGAD TVMDLSTGRN IHTTREWILR NAPIPIGTVP IYQALEKCEG DPVKLTWELY RDTLVEQCEQ GVDYFTIHAG VRLAYIHLTA NRTTGIVSRG GSIMAKWCLA HHQESFLYTH FDEICDLMRK YDVSFSLGDG LRPGSIADAN DRAQFAELET LGELTKIAWD KGCQVMIEGP GHVPLHKIKI NMDKQLKECG EAPFYTLGPL TTDIAPGYDH ITSGIGAAMI GWFGCAMLCY VTPKEHLGLP DRNDVKVGVI TYKIAAHASD LAKGHPAAQL RDDALSRARF DFRWQDQFNL GLDPDTAQAF HDETLPKDAH KVAHFCSMCG PKFCSMKITQ DVRDYAAGLG DNEKAALNLA GGSSLGSVGM SISGKLEDGL PADAFAKAGM AEMSEKFRTM GEQLYLDAEK VKESNKALS
|
| |