Gene Rmar_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1601 
Symbol 
ID8568253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1855096 
End bp1856823 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content64% 
IMG OID 
ProductSqualene/phytoene synthase 
Protein accessionYP_003290875 
Protein GI268317156 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0484297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCT TCGATCGGCT GCTGCTTTAC GGGTTATATG GTTTCACGAT CATTGCCGTC 
GCCGGCTTTG GGATTTTCGG ACGCCACCCG GAATTGCTTG TCCGCTGGCC GGAGCTTGCT
GCGTTCTACG CCCGGTCTTT TGCGCTTTTT GCGCGGGTGC ATGTGCTGCT GACGGCTTTT
GTCCTGTTCG CGTACATGGG GCGTCGTGTC GGTGGGCGCT GGGTGCCGGC CGGACTGCTC
GTCTACGGCG TGAGCCTACT CAGCGAGACG CTGGGGACCA CGTATGGCGT GCCTTTCGGG
ACCTATGGTT ACACCACCCT GCTCGGAGGA AAGTGGTTCG GCCGCGTGCC CTATCTGATT
CCGCTCAGCT GGTTTGTGAT GGCGGTGCCC TGTTATGTGC TGGCCCGCGC CGCCTTTTCG
GAGCGTCGGC AGTGGCCGGC TCGGCTGCTG CTGGCAACCT ATCTGCTGGT GGCCTGGGAT
CTGAGTCTGG ATCCGGCCAT GAGCTACCTG ACGTCGTACT GGACCTGGGG GGAGACCGGC
CCTTACTACG GGATGCCGCT GATCAATCTG GCCGGATGGG CGCTGACCGG TCTGGTGATT
ATGGGCGTAC TGGAGGCGAT GCGCGCGTTT CGCTGGACCG AAGCGTTCAG CGTGCAGTGG
ATGGCGGTGT TCTATGGAGC GGTGTTGCTG ATGCCGCTCG GTATGGTGGC TGTGGCCGGT
CTCTGGGGGG CTGTCGCTGC TACGGTCGCC GCACTGGGCC TGGCGGGAAG TGTCGTCTGG
CTGATCCGGC GCAGGCGGCC GCGCATGGAT ACAAAGGGCG CGCTTCCCGC GCGGGATGCC
TTCGAGGAAG ATGGCACGCG CTTTTTCGCA GCACATGCCC GTTCGTTTTC TTTTGCCGCG
CGGCTGTTTC CGAAGGACTT TCGCCGGGAA GTTGTCCTGC TCTATGGTTT CTGTCGGCTT
ACGGACGATC TGGTAGACGG CGCATCGACG CAGGTTGCGC CCGAGTTGCT GCAGAAGCGT
CTGGATCGGT GGCAACGCCA GGTACGGATG GCCTACGAGG GGCGTCCTTC CGGACTCCCC
TGGCTCGATC GGCTTATGCA ACGCTCGCGC CAGGCCGGAT TGCCCTGGGA AGTCGTGCAG
GCGCTGCTGG ACGGCGTGCG CCAGGACATC GGGCCGGTCC GGGTGGCTTC CTATGAAGAA
CTGGATCGCT ACGCCTACCG CGTGGGTTCG ACGGTGGGCG TCTGGATGTG CTATCTGATG
GGGGTGCGCA TGCCCCGATT GCTTGCGCGC GCCGAAGCGC TCGGCCGCGC CATGCAGTAC
ACGAACATCG TGCGCGACGT GGGGGAAGAT CTGCAGCGCG ATCGCCTCTA TCTGCCAGCG
GATCGGATGG CCGCCTATGG ACTGGACCTC GCGGATCTGC TACGTATGCA GCAGACCGGC
GTGCTCGATC CTTCCTATGT GGCGCTGCTG GAAGAACTCA TGCAGCAGGC CGAGCGCGAC
TATGAGGCCG CCTGGGAGGC CATTCCGGCC CTGCCACCGC GCGTTCGCGG TGCCATTGCC
GTGGCCGCCG AGGTCTATCG GGGGATTCAT GCAGTGCTTC GCCAGAACCA CTACGACAAT
CTGACGCGTC GCGCCTACAC GACGCTCCCC GAAAAAATCG GTCTTTCGGT GGCTGCACTG
CGTCGCCTGC GTCGGGCGGT TTTGATAACA GGTATGCAGG CCCTATGA
 
Protein sequence
MSRFDRLLLY GLYGFTIIAV AGFGIFGRHP ELLVRWPELA AFYARSFALF ARVHVLLTAF 
VLFAYMGRRV GGRWVPAGLL VYGVSLLSET LGTTYGVPFG TYGYTTLLGG KWFGRVPYLI
PLSWFVMAVP CYVLARAAFS ERRQWPARLL LATYLLVAWD LSLDPAMSYL TSYWTWGETG
PYYGMPLINL AGWALTGLVI MGVLEAMRAF RWTEAFSVQW MAVFYGAVLL MPLGMVAVAG
LWGAVAATVA ALGLAGSVVW LIRRRRPRMD TKGALPARDA FEEDGTRFFA AHARSFSFAA
RLFPKDFRRE VVLLYGFCRL TDDLVDGAST QVAPELLQKR LDRWQRQVRM AYEGRPSGLP
WLDRLMQRSR QAGLPWEVVQ ALLDGVRQDI GPVRVASYEE LDRYAYRVGS TVGVWMCYLM
GVRMPRLLAR AEALGRAMQY TNIVRDVGED LQRDRLYLPA DRMAAYGLDL ADLLRMQQTG
VLDPSYVALL EELMQQAERD YEAAWEAIPA LPPRVRGAIA VAAEVYRGIH AVLRQNHYDN
LTRRAYTTLP EKIGLSVAAL RRLRRAVLIT GMQAL