Gene RPC_2162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2162 
Symbol 
ID3971983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2354093 
End bp2356222 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content65% 
IMG OID637925270 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_532035 
Protein GI90423665 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.215518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCC GGTCCAATCC CGACACCACG CTGCCCGCTG TGACCACCGG CCCGCTGCCC 
TCCTCGCGCA AGATCTTCGC GACGCCCGAC GAAGCGCCGG AGCTGCGCGT GCCGTTGCGC
GAGATCATCC TCAGCGACGG CGCCGGCGAA CCGAACCTGC CGGTGTACGA CACCACCGGC
CCCTACACCG ATCCCAGCGT CACCATCGAC GTCAATGCCG GGCTGTCGCG GATCCGCACC
GCCTGGGTGA AAGAGCGCGG CGGCGTCGAG GAATATCAAG GCCGCGACGT CAAGCCGGAG
GACAACGGCA ATGTCGGCGC CGCCCACGCC GCAAAATCCT TCACCGCCTA TCACAAGCCG
CTGCGCGGAC TGGATGCGCC CGCCGCAGGC ACGGCGAACT CCCCTCCCCC TCGCGGGGAG
GGGTCGGGGG TGGGGGGAGC AACAAACACT GTGCCCTCCT CTACCCCCCT CCCCACCCCT
CCCCCGCAAG GGGGGAGGGA GCAAGGCATC GCCTATGCGT GCGGCCCACA CCTGCCGCCG
ATGGTGACGC AGCTGGAATT TGCCCGCGCA GGCATCATCA CCAAGGAGAT GATCTACGTC
GCCACCCGGG AAAACCTCGG CCGCAAACAG CAGCTCGCGC GCGCCGAGGC AGCGCTGGCC
GACGGCGAAT CGTTCGGCGC GTCGGTGCCG GCCTTCGTCA CCCCGGAATT CGTCCGCAGC
GAGATCGCGC GCGGCCGCGC GATCATCCCC GCCAACATCA ACCACGGCGA GTTGGAGCCG
ATGATCATCG GCCGCAATTT TCTCACCAAG ATCAACGCCA ATATCGGCAA CAGCGCTGTC
ACCTCCTCGG TCGAGGAAGA AGTCGACAAG ATGGTGTGGG CGATCCGCTG GGGCGCCGAC
ACCGTGATGG ACCTCTCCAC CGGCCGCAAC ATCCACACCA CGCGGGAATG GATTCTGCGC
AACGCGCCGA TCCCGATCGG CACCGTGCCG ATCTATCAGG CGCTGGAGAA GTGCGAAGGC
GATCCGGTCA AGCTCACTTG GGAGCTATAT CGCGACACGC TGGTCGAGCA ATGCGAACAG
GGCGTCGATT ACTTCACGAT CCACGCCGGC GTGCGGCTGG CTTACATCCA CCTCACCGCC
AACCGCACCA CCGGCATCGT GTCGCGCGGC GGCTCGATCA TGGCGAAGTG GTGCCTGGCG
CATCACCAGG AGAGCTTCCT CTACACGCAT TTCGACGAGA TCTGCGACCT GATGCGCAAA
TACGACGTGT CGTTCTCGCT CGGCGACGGC CTGCGGCCGG GCTCGATCGC GGATGCCAAC
GACCGCGCGC AATTCGCCGA ATTGGAGACG CTCGGCGAGC TCACCAAGAT CGCCTGGGAT
AAGGGCTGCC AGGTGATGAT CGAAGGCCCC GGCCATGTGC CGCTGCACAA GATCAAGATC
AACATGGACA AGCAGCTGAA AGAATGCGGC GAGGCGCCGT TCTATACGCT CGGGCCTTTG
ACAACAGACA TTGCGCCTGG CTACGACCAC ATCACCTCGG GCATTGGGGC GGCGATGATC
GGCTGGTTCG GCTGCGCGAT GCTGTGCTAC GTCACGCCGA AGGAACACCT TGGCCTGCCC
GATCGCAACG ACGTCAAGGT CGGGGTGATT ACTTACAAGA TCGCCGCCCA TGCCTCCGAT
CTCGCCAAGG GCCATCCGGC GGCGCAATTG CGCGACGACG CGCTGTCGCG CGCCCGCTTC
GACTTCCGCT GGCAGGATCA GTTCAACTTA GGCCTCGATC CCGACACCGC GCAGGCGTTC
CACGACGAGA CGCTGCCGAA GGACGCCCAC AAGGTGGCGC ATTTCTGCTC GATGTGCGGA
CCGAAATTCT GCTCGATGAA GATCACACAG GACGTGCGCG ACTACGCGGC GGGGCTTGGC
GACAACGAGA AGGCGGCGCT CAATCTGGCC GGCGGCAGCT CATTGGGCAG CGTCGGGATG
TCGATCTCCG GCAAGCTGGA AGACGGCCTG CCCGCCGACG CTTTTGCCAA GGCGGGCATG
GCGGAGATGA GCGAGAAGTT TCGGACTATG GGCGAGCAAC TCTATCTCGA CGCCGAGAAG
GTGAAGGAAA GCAACAAGGC GCTGTCGTAG
 
Protein sequence
MNIRSNPDTT LPAVTTGPLP SSRKIFATPD EAPELRVPLR EIILSDGAGE PNLPVYDTTG 
PYTDPSVTID VNAGLSRIRT AWVKERGGVE EYQGRDVKPE DNGNVGAAHA AKSFTAYHKP
LRGLDAPAAG TANSPPPRGE GSGVGGATNT VPSSTPLPTP PPQGGREQGI AYACGPHLPP
MVTQLEFARA GIITKEMIYV ATRENLGRKQ QLARAEAALA DGESFGASVP AFVTPEFVRS
EIARGRAIIP ANINHGELEP MIIGRNFLTK INANIGNSAV TSSVEEEVDK MVWAIRWGAD
TVMDLSTGRN IHTTREWILR NAPIPIGTVP IYQALEKCEG DPVKLTWELY RDTLVEQCEQ
GVDYFTIHAG VRLAYIHLTA NRTTGIVSRG GSIMAKWCLA HHQESFLYTH FDEICDLMRK
YDVSFSLGDG LRPGSIADAN DRAQFAELET LGELTKIAWD KGCQVMIEGP GHVPLHKIKI
NMDKQLKECG EAPFYTLGPL TTDIAPGYDH ITSGIGAAMI GWFGCAMLCY VTPKEHLGLP
DRNDVKVGVI TYKIAAHASD LAKGHPAAQL RDDALSRARF DFRWQDQFNL GLDPDTAQAF
HDETLPKDAH KVAHFCSMCG PKFCSMKITQ DVRDYAAGLG DNEKAALNLA GGSSLGSVGM
SISGKLEDGL PADAFAKAGM AEMSEKFRTM GEQLYLDAEK VKESNKALS