Gene Rpal_4096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4096 
Symbol 
ID6411780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4394348 
End bp4396273 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content64% 
IMG OID642713978 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001993067 
Protein GI192292462 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000111141 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATCC GCTCCAATCC CGAGACCACC CGCGCGGCTG TGACCACCGG TGCCCTGCCC 
TCCTCCAAGA AGATCTACGC CACGCCTGCC AGCGCGCCGG ATCTGCGCGT GCCGCTGCGC
GAGATCATCC TGAGCGAAGG CGCAGGCGAG CCGAACCTGC CGGTGTACGA CACCTCCGGC
CCCTACACCG ATCCGACCGT CGTGATCGAC GTCAACAAGG GCCTGCCGCG TCCGCGCACC
GAATGGGTCA AGCAGCGCGG CGGCGTCGAG CAATATGAGG GCCGCGACAT CAAGCCGGAA
GACAACGGCA ATGTCGGCGC TGCGCACGCG GCCAAAGCCT TCACCGCGCA TCACCAGCCG
CTGCGCGGGA TCAGTGATGC GCCGATCACC CAATACGAAT TCGCCCGCCG CGGCATCATC
ACCAAGGAGA TGATCTACGT CGCCGAGCGC GAGAATTTGG GCCGCAAGCA GCAGCTCGAG
CGTGCCGAGG CGGCGCTGGC CGATGGTGAA TCGTTCGGCG CCGCGGTGCC GGCGTTCATC
ACGCCGGAAT TCGTCCGCGA CGAGATCGCC CGCGGCCGCG CCATCATTCC GGCCAACATC
AATCACGGCG AACTCGAGCC GATGATCATC GGCCGCAACT TCCTCACCAA GATCAACGCC
AATATCGGCA ACTCGGCGGT GACCTCGTCG GTCGAGGAGG AAGTCGACAA GATGGTGTGG
GCGATCCGCT GGGGTGCCGA CACCGTGATG GACCTCTCGA CCGGCCGCAA CATTCATACC
ACCCGCGAAT GGATCTTGCG CAACTCGCCG GTGCCGATCG GCACCGTGCC GATCTATCAG
GCGCTGGAGA AGTGCGAAGG CGATCCGGTC AAGCTGACCT GGGAGCTGTA CAAGGACACG
CTGATCGAGC AGGCCGAACA GGGCGTCGAT TACTTCACCA TCCACGCCGG CGTGCGGCTG
CAGTACATCC ACCTCACCGC CAGCCGGGTC ACCGGCATCG TGTCGCGCGG CGGCTCGATC
ATGGCGAAGT GGTGCCTGGC GCATCACAAG GAGAGCTTCC TCTACACGCA TTTCGACGAG
ATCTGCGACC TGATGCGGAA GTACGACGTG TCGTTCTCGC TCGGCGACGG CCTGCGGCCG
GGCTCGATCG CGGACGCCAA CGACCGCGCC CAGTTCGCCG AACTGGAGAC GCTCGGCGAG
CTCACCAAGA TCGCCTGGGC CAAGGGCTGC CAGGTGATGA TCGAAGGCCC CGGCCACGTG
CCGATGCACA AGATCAAGAT CAACATGGAC AAGCAGCTCA AGGAGTGCGG CGAGGCGCCG
TTCTACACCT TGGGCCCGCT GACCACCGAT ATCGCACCGG GCTATGATCA CATCACTTCC
GGCATCGGCG CCGCGATGAT CGGCTGGTTC GGCTGCGCGA TGCTGTGCTA CGTCACGCCG
AAGGAGCATC TCGGCCTGCC CGACCGCAAT GACGTCAAGA CCGGCGTGAT CACCTACAAG
ATCGCCGCCC ACGCCGCCGA CCTCGCCAAG GGCCACCCCG CCGCCCAGCT CCGCGACGAC
GCACTCTCCC GTGCAAGGTT CGAATTCCGC TGGCAGGACC AGTTCAATCT CGGCCTCGAT
CCGGACACGG CGCAGGCCTT CCACGACGAG ACCCTACCGA AGGACGCCCA CAAGGTCGCG
CATTTCTGCT CGATGTGCGG CCCGAAATTC TGCTCGATGA AGATCACGCA GGACGTCCGC
GACTACGCCG CCGGCCTCGG CGACAACGAG AAAGCCGCCC TCTACCCGGT CGGCCACGCC
GGCATGACCA TCTCCGGCAC CATCGAAGAC GGCATGGCCC AGATGAGCGC CAAGTTCAAA
GAGATGGGAA GCAGCGTGTA TCTCGATGCC GACAAGGTGA AAGAGAGCAA CAAGGCGCTG
TCGTAA
 
Protein sequence
MNIRSNPETT RAAVTTGALP SSKKIYATPA SAPDLRVPLR EIILSEGAGE PNLPVYDTSG 
PYTDPTVVID VNKGLPRPRT EWVKQRGGVE QYEGRDIKPE DNGNVGAAHA AKAFTAHHQP
LRGISDAPIT QYEFARRGII TKEMIYVAER ENLGRKQQLE RAEAALADGE SFGAAVPAFI
TPEFVRDEIA RGRAIIPANI NHGELEPMII GRNFLTKINA NIGNSAVTSS VEEEVDKMVW
AIRWGADTVM DLSTGRNIHT TREWILRNSP VPIGTVPIYQ ALEKCEGDPV KLTWELYKDT
LIEQAEQGVD YFTIHAGVRL QYIHLTASRV TGIVSRGGSI MAKWCLAHHK ESFLYTHFDE
ICDLMRKYDV SFSLGDGLRP GSIADANDRA QFAELETLGE LTKIAWAKGC QVMIEGPGHV
PMHKIKINMD KQLKECGEAP FYTLGPLTTD IAPGYDHITS GIGAAMIGWF GCAMLCYVTP
KEHLGLPDRN DVKTGVITYK IAAHAADLAK GHPAAQLRDD ALSRARFEFR WQDQFNLGLD
PDTAQAFHDE TLPKDAHKVA HFCSMCGPKF CSMKITQDVR DYAAGLGDNE KAALYPVGHA
GMTISGTIED GMAQMSAKFK EMGSSVYLDA DKVKESNKAL S