Gene Rmar_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_0078 
Symbol 
ID8566703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp84474 
End bp86357 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content66% 
IMG OID 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003289375 
Protein GI268315656 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCCCA AGCCACCTGC TGCGATTCGG CGCGTGTACG TATCGGGCAA CCGCTATCCG 
GAGCTGCGCG TCCCGTTCAA GGAGGTTCAG CTTTCGCCCA CGCGCCTGCC GGACGGAAGC
CTGGAGCCCA ACGATCCGGT CCTGCTCTAC GACACGCGCG GGCCGTGGGG CGATCCCGAC
TTCCACGGCG ACATCCGCCA GGGCCTGCCG CCGCTGCGGC GTCCCTGGAT CGAGGCCCGT
GGCGACGTCG AGGAATACGA CGGACGGCCG GTGCAGCCCA TCGACGACGG CTATCGGAGC
GAGGAAGAGC GTCGGCGTGC CGAGCAGCGC GGCAAGCTGC ACCGCTGGCC CGGTCCGCGT
CGGCGCCCGC TCCGCGGCAA AAACGGCGCG GCCGTCACAC AAATGCACTA TGCGCGCCAG
GGTATCATCA CGCCCGAGAT GGAGTTTGTC GCCATCCGCG AAAACCAGCG GCGCGAGCGG
CTCATCGAAC TCCAGAAAGA GCTGGGCGAC CGCTGGTCGC TTGCCTTCCA GCATCCGGGC
CAACCCTGGG GCGCCCAGAT TCCGCCCGTC ATCACGCCCG AGTTCGTGCG CGACGAGGTG
GCCCGCGGCC GCGCCATCAT CCCGTGCAAC GTCAACCACC CGGAGTGCGA GCCGATGATC
ATCGGCCGCA ACTTCCTGGT CAAGATCAAC GCAAACATCG GCACCTCGGC CGTCTCCAGC
TCCATCGACG AGGAAGTCGA AAAGCTGCTG TGGGCCATCT ACTGGGGCGC CGACACGGTA
ATGGACCTCT CGACGGGCAA AAACATCCAC GAAACGCGCG AGTGGATCAT CCGCAACAGT
CCGGTGCCCA TCGGAACGGT GCCCATTTAC CAGGCGCTCG AAAAGGTGGG CGGCAAGCCC
GAAGAGCTGA CCTGGGAGAT CTTCCGCGAC ACGCTCATCG AACAGTGCGA GCAGGGCGTC
GACTACATGA CGATCCATGC GGGCGTCCGG CTCGCCTACA TCCCGCTGAC GGCCAACCGG
CGCACGGGCA TCGTCTCGCG CGGCGGCTCG ATCATCGCCA AATGGTGCCT GGCCCACCAC
AGGGAAAACT TCCTCTACAC GCACTTTGAG GAAATCTGCG AGATTCTGCG TCAGTACGAC
GTGTCGATCA GTCTGGGCGA CGGGCTGCGG CCCGGCTCCA TCCAGGACGC CAACGACGAG
GCCCAGTTTG CCGAGCTCAA GACGCTCGGC GAGCTGACCC GGATCGCCTG GAAGTACGAC
GTGCAGGTGA TGATCGAAGG CCCCGGCCAC ATCCCCATGC ATCTGATCAA GGAAAACGTG
GACCGGGAGC TGGAAGACTG CTACGAGGCG CCGTTCTACA CGCTCGGGCC GCTGGTGACC
GACATCGCCC CGGCCTACGA CCACATCACG TCGGCCATCG GCGCGGCAAT GATCGGCTGG
TTCGGCGCGG CCATGCTCTG CTACGTCACG CCCAAGGAGC ACCTGGGCCT GCCCAACAAG
AACGACGTGC GCGAGGGCGT GATCGCCTAC AAGATCGCCG CGCACGCGGC CGACCTGGCC
AAGGGGCACC CCGGCGCCCA GTACTGGGAC AACGCGCTCT CGAAGGCCCG CTTCGAATTC
CGCTGGGAGG ACCAGTTCAA CCTGTCGCTC GATCCGGAGC GGGCCCGCGA GTACCACGAC
GAAACGCTCC CGGCCGAAGG GGCCAAGCTG GCGCACTTCT GCTCGATGTG CGGGCCGAAG
TTCTGCTCCA TGAAGATCAC CGAGGAAATC CGGGCGATGG CCGCCGAAAA AGGCGTCGAT
GCCCGCCAGG TGATCGAAGA AGGGCTGGAG GAAAAGGCCC GCGAGTTCCG CGAAAAAGGC
GCCGAAATCT ACACGGCCCC CTGA
 
Protein sequence
MIPKPPAAIR RVYVSGNRYP ELRVPFKEVQ LSPTRLPDGS LEPNDPVLLY DTRGPWGDPD 
FHGDIRQGLP PLRRPWIEAR GDVEEYDGRP VQPIDDGYRS EEERRRAEQR GKLHRWPGPR
RRPLRGKNGA AVTQMHYARQ GIITPEMEFV AIRENQRRER LIELQKELGD RWSLAFQHPG
QPWGAQIPPV ITPEFVRDEV ARGRAIIPCN VNHPECEPMI IGRNFLVKIN ANIGTSAVSS
SIDEEVEKLL WAIYWGADTV MDLSTGKNIH ETREWIIRNS PVPIGTVPIY QALEKVGGKP
EELTWEIFRD TLIEQCEQGV DYMTIHAGVR LAYIPLTANR RTGIVSRGGS IIAKWCLAHH
RENFLYTHFE EICEILRQYD VSISLGDGLR PGSIQDANDE AQFAELKTLG ELTRIAWKYD
VQVMIEGPGH IPMHLIKENV DRELEDCYEA PFYTLGPLVT DIAPAYDHIT SAIGAAMIGW
FGAAMLCYVT PKEHLGLPNK NDVREGVIAY KIAAHAADLA KGHPGAQYWD NALSKARFEF
RWEDQFNLSL DPERAREYHD ETLPAEGAKL AHFCSMCGPK FCSMKITEEI RAMAAEKGVD
ARQVIEEGLE EKAREFREKG AEIYTAP