Gene Rmar_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1049 
Symbol 
ID8567690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1198585 
End bp1199793 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content68% 
IMG OID 
Producttryptophan synthase, beta subunit 
Protein accessionYP_003290329 
Protein GI268316610 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.644397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACCG CCGAGACCGA ACTGCTGACC TACGAGGCGC CGGACGCCAC CGGACACTTC 
GGCCCCTACG GGGGCGCATT TGTGCCCGAG ACGCTGGTGC CGGCGCTGGA AGCGTTGAAG
GCGGCCTACG CCGAGGCGCG TCAGGATCCG GGCTTCTGGG AGGAATACCA CGCCCTGCTC
CGGGAATATG TGGGTCGGCC CACGCCGCTC ACGTTCGCAC CGCGCCTCAG CGAAGCGCTG
GGCGGGCTGC AGATCTACCT GAAGCGGGAG GACCTGTGCC ACACGGGTGC CCACAAGATC
AACAACACGA TCGGCCAGAT CCTGCTGGCC CGGCGCATGG GCAAGACGCG CATCATCGCC
GAGACGGGCG CCGGACAGCA CGGCGTGGCG ACGGCCACGG TGTGCGCCCG CTTTGGAATG
CAGTGCGTCG TTTACATGGG CGCCGAAGAT GTGGAGCGCC AGCACCTGAA CGTGCTGCGC
ATGCAGTTGC TGGGCGCCGA GGTGCGACCC GTCGAGAGCG GGAGCCGCAC GCTCAAAGAC
GCCACGAACG AGGCCATCCG CGACTGGGTG ACGAACGTCC ACGACACGTT CTACCTGATC
GGCTCGGTGG TGGGACCGCA CCCGTACCCG ATGCTCGTGC GCGACTTTCA GCGCGTGATC
GGCGACGAGG TGCGGCGGCA ACTGGCCGAA CGCATCGGCC GGGAGACGCC CGACGCACTG
GTGGCCTGCG TGGGCGGCGG CTCGAACGCC ATGGGCTTGT TCTATCCGTT CCTGAACGAC
CGCCATGTGC GCATGTACGG CGTGGAGGCG GCCGGCGAGG GGCTTGACCG CCGTCATGCC
GCCACGCTCA CCTGCGGGCG GCCCGGCATC CTGCACGGCG CCATGAGCTA TCTGTTGCAG
GACGACGACG GTCAGGTGCA GCTGGCCCAT TCCATTTCGG CGGGGCTGGA TTACCCGGGG
GTGGGTCCCG AGCATGCCTA CCTGAAGGAT CTGGGGCGCG TCACCTACGT GACGGCCACC
GACGAGGAGG CGCTGGAAGG CGTGCGGCTA TTGGCCCGCA CCGAAGGGAT TATTCCGGCG
CTGGAAACGG CGCACGCCAT CGCGTTTCTG CCCCTCCTGG CCCGCGAGCT GGGGCCGGAC
GCCGTCGTGG TGGTCAACCT GTCCGGCCGC GGCGACAAAG ACATGGGCAC CATTGCACGG
TATATGTAA
 
Protein sequence
MSTAETELLT YEAPDATGHF GPYGGAFVPE TLVPALEALK AAYAEARQDP GFWEEYHALL 
REYVGRPTPL TFAPRLSEAL GGLQIYLKRE DLCHTGAHKI NNTIGQILLA RRMGKTRIIA
ETGAGQHGVA TATVCARFGM QCVVYMGAED VERQHLNVLR MQLLGAEVRP VESGSRTLKD
ATNEAIRDWV TNVHDTFYLI GSVVGPHPYP MLVRDFQRVI GDEVRRQLAE RIGRETPDAL
VACVGGGSNA MGLFYPFLND RHVRMYGVEA AGEGLDRRHA ATLTCGRPGI LHGAMSYLLQ
DDDGQVQLAH SISAGLDYPG VGPEHAYLKD LGRVTYVTAT DEEALEGVRL LARTEGIIPA
LETAHAIAFL PLLARELGPD AVVVVNLSGR GDKDMGTIAR YM