Gene RPD_3432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3432 
Symbol 
ID4023946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3815473 
End bp3817398 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content65% 
IMG OID637963637 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_570557 
Protein GI91977898 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCA GGTCGAACCC CGACACGACG CGCCCCGCCG TCACCACCGG CGGCCTGCCC 
TCCTCGAAGA AGATCTACGC GACGCCCGCC GCCGCGCCGG ATCTGCGCGT ACCGCTGCGC
GAGATCATTC TGTCGGAAGG CGCCGGCGAG CCGAACCTGC CGATCTACGA CACGTCGGGC
CCCTACACCG ACCCGAGCGT CACCATCGAC GTCAATGCCG GCCTGTCGCG GGCCCGCACG
CAATGGGTCA AGGAGCGCGG CGGCGTCGAA GAATATCAGG GCCGCGACGT CAAGCCGGAA
GACAACGGCA ATGTCGGCGC CGCGCATGCG GCGAAGTCGT TCACCGCCTA TCACAAGCCG
CTGCGCGGCA TCGGCGACGC GCCGATCACC CAGTACGAAT TCGCCCGCCG GGGCATCATC
ACCAAGGAGA TGATCTACGT CGCGGAGCGC GAGAATCTCG GCCGCAAGCA GCAACTCGAG
CGCGCCGAGG CGGCGCTGGC CGACGGCGAG AGCTTCGGCG CCAGCGTGCC GGCGTTCATC
ACGCCGGAAT TCGTCCGCGA CGAGATCGCG CGCGGTCGCG CCATCATCCC GGCCAATATC
AACCACGGCG AACTCGAGCC GATGATCATC GGCCGCAACT TCCTGACCAA GATCAACGCC
AATATCGGCA ACAGCGCCGT GACGTCGTCG GTGGAGGAAG AAGTCGACAA GATGGTGTGG
GCGATCCGCT GGGGCGCCGA CACCGTGATG GACCTCTCCA CCGGCCGCAA CATCCACACC
ACCCGGGAAT GGATCCTGCG CAACTCGCCG GTGCCGATCG GCACGGTGCC GATCTATCAG
GCGCTGGAGA AGTGCGAAGG CGATCCCGTC AAGCTGACCT GGGAGCTGTA CAAGGACACG
CTGATCGAGC AGTGCGAACA GGGCGTCGAT TACTTCACGA TCCACGCCGG CGTGCGGCTG
CAATACATTC ACCTCACCGC CAATCGCGTC ACAGGCATCG TCTCGCGCGG CGGCTCGATC
ATGGCGAAGT GGTGCCTCGC GCATCACAAG GAGAGCTTCC TCTACACGCA TTTCGACGAG
ATCTGCGACC TGATGCGCAA GTACGACGTG TCGTTCTCGC TCGGCGACGG CCTGCGCCCG
GGCTCGATCG CCGACGCCAA CGACCGCGCC CAATTCGCCG AACTGGAGAC GCTCGGCGAA
CTCACCAAGA TCGCCTGGGC CAAGGGCTGC CAAGTGATGA TCGAAGGCCC CGGCCACGTG
CCGATGCACA AGATCAAGAT CAACATGGAC AAGCAGCTCA AGGAATGCGG CGAGGCGCCG
TTCTACACGC TCGGGCCGCT GACCACCGAC ATCGCGCCGG GCTATGATCA CATCACCAGC
GGCATCGGCG CTGCGATGAT CGGCTGGTTC GGCTGCGCGA TGCTGTGCTA CGTCACGCCG
AAGGAACATC TCGGCCTGCC CGATCGCAAC GACGTCAAGG TCGGCGTGAT CACCTACAAG
ATCGCCGCCC ACGCCGCTGA TCTGGCCAAG GGCCACCCCG CCGCGCAACT GCGCGACGAC
GCGGTGTCCC GCGCCAGATT CGATTTCCGC TGGCAGGACC AGTTCAACCT CGGCCTCGAC
CCCGACACCG CGCAGGCGTT CCACGACGAA ACCCTGCCCA AGGACGCCCA CAAGGTCGCG
CATTTCTGCT CGATGTGCGG GCCGAAATTC TGCTCGATGA AGATCACCCA GGACGTGCGC
GACTACGCAG CGGGATTGGG CGACAACGAG AAGGCCGCGC TGTATCCGGC CGGAAGCGTC
GGGATGAGCA TCAGCGGTGT GATCGAGGAC GGCATGGCGC AGATGAGCGC GAAGTTCAGG
GATATGGGCG AGCACCTGTA TCTCGACGCC GAGAAGGTGA AGGAGAGCAA CAAGGCGCTG
TCGTAA
 
Protein sequence
MNIRSNPDTT RPAVTTGGLP SSKKIYATPA AAPDLRVPLR EIILSEGAGE PNLPIYDTSG 
PYTDPSVTID VNAGLSRART QWVKERGGVE EYQGRDVKPE DNGNVGAAHA AKSFTAYHKP
LRGIGDAPIT QYEFARRGII TKEMIYVAER ENLGRKQQLE RAEAALADGE SFGASVPAFI
TPEFVRDEIA RGRAIIPANI NHGELEPMII GRNFLTKINA NIGNSAVTSS VEEEVDKMVW
AIRWGADTVM DLSTGRNIHT TREWILRNSP VPIGTVPIYQ ALEKCEGDPV KLTWELYKDT
LIEQCEQGVD YFTIHAGVRL QYIHLTANRV TGIVSRGGSI MAKWCLAHHK ESFLYTHFDE
ICDLMRKYDV SFSLGDGLRP GSIADANDRA QFAELETLGE LTKIAWAKGC QVMIEGPGHV
PMHKIKINMD KQLKECGEAP FYTLGPLTTD IAPGYDHITS GIGAAMIGWF GCAMLCYVTP
KEHLGLPDRN DVKVGVITYK IAAHAADLAK GHPAAQLRDD AVSRARFDFR WQDQFNLGLD
PDTAQAFHDE TLPKDAHKVA HFCSMCGPKF CSMKITQDVR DYAAGLGDNE KAALYPAGSV
GMSISGVIED GMAQMSAKFR DMGEHLYLDA EKVKESNKAL S