Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmar_1723 |
Symbol | |
ID | 8568375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodothermus marinus DSM 4252 |
Kingdom | Bacteria |
Replicon accession | NC_013501 |
Strand | + |
Start bp | 1998316 |
End bp | 1999998 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | PHP domain protein |
Protein accession | YP_003290995 |
Protein GI | 268317276 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00699818 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACC GAGACGTTGC CCGCCTGCTG CGTGAGACGG CCCGCCTGCT GGAGCTTCGC GGCGAAAATC CGTTTCGCGT GCGGGCCTAC GAGCAGGCCG CCGAAGCCAT CGAGCAACTG GACGAACCCG TCGCCGAGCG GGTGCGACAG GGCACGCTCA CCGAGGTGCC CGGCATCGGT CGGGGTCTGG CGGCCCAGAT TCAGGAACTG GTCGAACGGG GCACTTCGGA GATGCTGGAG CGCCTCCGGC AAGAACTGCC GCCGGGGCTT CCGGAGCTGC TCACGCTGAA AGGTCTGGGT CCCCAGCGTG TGCGCCAACT CTGGCAGACG CTGGGCATCG CCTCGCTGGA TGACCTGGAG GATGCACTTC GCGACGGGCG CCTGAACCAG CTCAAAGGCT TTGGTCCACG CCTGCACGAA CAACTGCTCC ATGCGCTGTC GCTGCGCCGG CGCTACCGTG CGCTTCGCCT GCTGGCCCAG GTACTGCCCG AAGCCGAAGC GCTCCGCGAA CGGCTTCGGC AGCAGCCCGG CGTGATCCGC GTCGAGCTGG CCGGGGCCGT CCGGCGTCTG ATGGAGGTGG TGGACCGCGT GGAACTGGTC GTGGCCGGAT CGGCGGAAGC CGTGCAGCAG GTACTTCCGC AGCTCCGGCA ACAATCCGGC CTCCACGGCG GAATGCTACT CGAAGGCGCC CTGCCGGATG GTTTTCCCGT CCGGGTGGCG CTGACCACGC CGGACGCCTT CGGCACCGTG CTCTGGTGGC ATACCGGCTC GGAAGCCCAC TGCCGGACGT TCGTCCGCAC CTACGGCGCC CCGGAGCCCT GTCCGGAGGA AGCCACCATC TACGAACAGG CGGGTCTGCC CTTCATTCCA GCCGAGCTGC GCGAAGACCG CGGCGAACTG GAAGCAGCGG CCCACCACGC GCTGCCCGCG TTGATCGAAC TGGAAGACCT CCGGGGCGTG CTGCACAACC ATTCCACCTA CAGCGACGGC CGCAATTCCC TTCGCGAAAT GGCCGAGGCC GCCTGCAACC GGGGCTTCCG CTATTTCGGA ACGGGCGATC ACAGCCAGTC GCTCACCATC GCCCGTGGGC TTTCGATCGC CGAAGTGCGC CGCCAGCAGG AAGAGATCCA GACCCTGAAC GAGCAGTTCG CCGCACGGGG CTTTCGGATC CTGAGCGGCA CCGAGTGCGA CATCCTGCCC GACGGATCGC TGGACTACCC CGACGACGTG CTGGCCAGCT TCGATTATGT GGTGGCCAGC GTGCATACCC GGCTGAACAT GGACGAAAAG ACGGCCACCG AGCGCATCCT GCGTGCCCTG CGCAATCCAC ATGTTACGAT ACTGGGCCAC CCGACCGGCC GCCTGCTGCT GCGACGTGAG GGCTATCCGC TGGACTGGCC TCGAATCATC GACGCCTGCG CCACCTATCG CGTCGCCCTC GAACTGAACG CCAACCCGTA TCGGCTCGAC ATCGACTGGC GGCGCGTTCG CGATGCCACG GCTGCCGGCG TGCCCATCGT GATCAATCCG GACGCCCACG CCATCGACGA ACTGGACCAC GTGCGCTGGG GCGTGGCCGC CGCCCGCAAA GGCTGGCTCA CGCCTGAGGC CTGCCTGAAC GCCCGGGATC TGGACGAACT GCTCGCCTGG CTCCACCAGC GTCGCCAATC CGTTCAGCCA TGA
|
Protein sequence | MENRDVARLL RETARLLELR GENPFRVRAY EQAAEAIEQL DEPVAERVRQ GTLTEVPGIG RGLAAQIQEL VERGTSEMLE RLRQELPPGL PELLTLKGLG PQRVRQLWQT LGIASLDDLE DALRDGRLNQ LKGFGPRLHE QLLHALSLRR RYRALRLLAQ VLPEAEALRE RLRQQPGVIR VELAGAVRRL MEVVDRVELV VAGSAEAVQQ VLPQLRQQSG LHGGMLLEGA LPDGFPVRVA LTTPDAFGTV LWWHTGSEAH CRTFVRTYGA PEPCPEEATI YEQAGLPFIP AELREDRGEL EAAAHHALPA LIELEDLRGV LHNHSTYSDG RNSLREMAEA ACNRGFRYFG TGDHSQSLTI ARGLSIAEVR RQQEEIQTLN EQFAARGFRI LSGTECDILP DGSLDYPDDV LASFDYVVAS VHTRLNMDEK TATERILRAL RNPHVTILGH PTGRLLLRRE GYPLDWPRII DACATYRVAL ELNANPYRLD IDWRRVRDAT AAGVPIVINP DAHAIDELDH VRWGVAAARK GWLTPEACLN ARDLDELLAW LHQRRQSVQP
|
| |