Gene Rmar_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmar_1723 
Symbol 
ID8568375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodothermus marinus DSM 4252 
KingdomBacteria 
Replicon accessionNC_013501 
Strand
Start bp1998316 
End bp1999998 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content67% 
IMG OID 
ProductPHP domain protein 
Protein accessionYP_003290995 
Protein GI268317276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00699818 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACC GAGACGTTGC CCGCCTGCTG CGTGAGACGG CCCGCCTGCT GGAGCTTCGC 
GGCGAAAATC CGTTTCGCGT GCGGGCCTAC GAGCAGGCCG CCGAAGCCAT CGAGCAACTG
GACGAACCCG TCGCCGAGCG GGTGCGACAG GGCACGCTCA CCGAGGTGCC CGGCATCGGT
CGGGGTCTGG CGGCCCAGAT TCAGGAACTG GTCGAACGGG GCACTTCGGA GATGCTGGAG
CGCCTCCGGC AAGAACTGCC GCCGGGGCTT CCGGAGCTGC TCACGCTGAA AGGTCTGGGT
CCCCAGCGTG TGCGCCAACT CTGGCAGACG CTGGGCATCG CCTCGCTGGA TGACCTGGAG
GATGCACTTC GCGACGGGCG CCTGAACCAG CTCAAAGGCT TTGGTCCACG CCTGCACGAA
CAACTGCTCC ATGCGCTGTC GCTGCGCCGG CGCTACCGTG CGCTTCGCCT GCTGGCCCAG
GTACTGCCCG AAGCCGAAGC GCTCCGCGAA CGGCTTCGGC AGCAGCCCGG CGTGATCCGC
GTCGAGCTGG CCGGGGCCGT CCGGCGTCTG ATGGAGGTGG TGGACCGCGT GGAACTGGTC
GTGGCCGGAT CGGCGGAAGC CGTGCAGCAG GTACTTCCGC AGCTCCGGCA ACAATCCGGC
CTCCACGGCG GAATGCTACT CGAAGGCGCC CTGCCGGATG GTTTTCCCGT CCGGGTGGCG
CTGACCACGC CGGACGCCTT CGGCACCGTG CTCTGGTGGC ATACCGGCTC GGAAGCCCAC
TGCCGGACGT TCGTCCGCAC CTACGGCGCC CCGGAGCCCT GTCCGGAGGA AGCCACCATC
TACGAACAGG CGGGTCTGCC CTTCATTCCA GCCGAGCTGC GCGAAGACCG CGGCGAACTG
GAAGCAGCGG CCCACCACGC GCTGCCCGCG TTGATCGAAC TGGAAGACCT CCGGGGCGTG
CTGCACAACC ATTCCACCTA CAGCGACGGC CGCAATTCCC TTCGCGAAAT GGCCGAGGCC
GCCTGCAACC GGGGCTTCCG CTATTTCGGA ACGGGCGATC ACAGCCAGTC GCTCACCATC
GCCCGTGGGC TTTCGATCGC CGAAGTGCGC CGCCAGCAGG AAGAGATCCA GACCCTGAAC
GAGCAGTTCG CCGCACGGGG CTTTCGGATC CTGAGCGGCA CCGAGTGCGA CATCCTGCCC
GACGGATCGC TGGACTACCC CGACGACGTG CTGGCCAGCT TCGATTATGT GGTGGCCAGC
GTGCATACCC GGCTGAACAT GGACGAAAAG ACGGCCACCG AGCGCATCCT GCGTGCCCTG
CGCAATCCAC ATGTTACGAT ACTGGGCCAC CCGACCGGCC GCCTGCTGCT GCGACGTGAG
GGCTATCCGC TGGACTGGCC TCGAATCATC GACGCCTGCG CCACCTATCG CGTCGCCCTC
GAACTGAACG CCAACCCGTA TCGGCTCGAC ATCGACTGGC GGCGCGTTCG CGATGCCACG
GCTGCCGGCG TGCCCATCGT GATCAATCCG GACGCCCACG CCATCGACGA ACTGGACCAC
GTGCGCTGGG GCGTGGCCGC CGCCCGCAAA GGCTGGCTCA CGCCTGAGGC CTGCCTGAAC
GCCCGGGATC TGGACGAACT GCTCGCCTGG CTCCACCAGC GTCGCCAATC CGTTCAGCCA
TGA
 
Protein sequence
MENRDVARLL RETARLLELR GENPFRVRAY EQAAEAIEQL DEPVAERVRQ GTLTEVPGIG 
RGLAAQIQEL VERGTSEMLE RLRQELPPGL PELLTLKGLG PQRVRQLWQT LGIASLDDLE
DALRDGRLNQ LKGFGPRLHE QLLHALSLRR RYRALRLLAQ VLPEAEALRE RLRQQPGVIR
VELAGAVRRL MEVVDRVELV VAGSAEAVQQ VLPQLRQQSG LHGGMLLEGA LPDGFPVRVA
LTTPDAFGTV LWWHTGSEAH CRTFVRTYGA PEPCPEEATI YEQAGLPFIP AELREDRGEL
EAAAHHALPA LIELEDLRGV LHNHSTYSDG RNSLREMAEA ACNRGFRYFG TGDHSQSLTI
ARGLSIAEVR RQQEEIQTLN EQFAARGFRI LSGTECDILP DGSLDYPDDV LASFDYVVAS
VHTRLNMDEK TATERILRAL RNPHVTILGH PTGRLLLRRE GYPLDWPRII DACATYRVAL
ELNANPYRLD IDWRRVRDAT AAGVPIVINP DAHAIDELDH VRWGVAAARK GWLTPEACLN
ARDLDELLAW LHQRRQSVQP