Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0330 |
Symbol | |
ID | 5207265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 419550 |
End bp | 421157 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640593956 |
Product | phosphodiesterase |
Protein accession | YP_001274712 |
Protein GI | 148654507 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000340911 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00581399 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTTTGAGC CCGTGGCCAC AGCCCGGCAC TGGGCTGTAG CATGCTGCAC CTTTACAATC GAGCAGGAGG ATGTTGTGCA ATGGCTCATA TGGGCCTTGC CTGCCCTGGT GATCGGACTT GCCATTGGCG CAGGCATAGG CATTCTTATC TACAAAAAGA GTGTACAGAG CCAGATTCGA CAGATCGAGG CGGAAGCCCG ACTGCAACTG GAAGCGACGC GATCTGAGCA GAAAGATCTC ATTTTGCGCG CGACCGACGA GGCGCTTCGG CTCCGCGCTG AAGCCGAGGC GCAAATACGC GAGGCGCGCG CAGCGCTGGC GAAGCAGGAA GAACGGTTGC AACGGAAAGA GGAAAATCTT GACCGCAAAA TCGAGGGACT CGAACGGCGT GAACGTCAAC TCCAGCAACG CGAGCGCCAG ATGGAGCAAC TTCATCAGGA AGCTGAACAA CTGCGTCAGC AGCAACGCGC GGAACTCGAG CGGATTTCCG CGTTGAGCCA GGAAGAAGCC CGTGCGATTA TTCTGAAACG CGTCGAGGAT GAAACGCGCG ACGAGGCGGC ACGCCGTATT CGTGAAATCG AAAAGAACGT TCGCGAAGAG GCGGATAAAC TGGCACGCAA AGTGATCAGC ATGGCCATTC AGCGGTGCGC TTCGGAGTAT GTCGCTGAAG TGACCGTCTC CACCGTCGCG CTCCCGAGTG AAGAGTTGAA AGGACGAATC ATCGGGCGTG AGGGGCGCAA CATTCGCGCC TTCGAGCAAC TTACCGGCGT TGATATTATT GTCGATGACA CGCCGGAAGC GGTGACCCTC TCGTGCCACG ATCCGGTGCG GCGGGAAGTG GCGCGACTGG CGTTGATCAA GTTGCTCAAG GATGGGCGCA TCCATCCGAC GCGGATTGAA GAAGTCGTCA GCAAAACCCA GCAGGAAGTC GAACAGATCA TGCGCGAAGA GGGCGAACGA GTCGCCTACG AAGCGAACGT TCAGGGGTTG CACCCCGACC TGATCAAACT GCTGGGGCGA CTGAAATATC GCACGAGTTA CGGGCAGAAT GTTCTGCAGC ATTCGCTGGA GTGCGCTCTG CTCGCAGCGC ATATTGCTGC CGAGATCGGC GCAAACATCA ATGTGGCGAA AACTGCCGCG TTACTGCACG ATATCGGGAA AGCAGTCGAC CACGAAGTGC AGGGACCGCA CGCATTGATC GGTGCTGAGA TTGCGCGTCG CCTTGGAAAA TCGCCGGCAA TTGTGCACGC GATTGCCGCC CATCATAACG ATGAAGAACC GCAAACCGTC GAAGCCTGGT TGGTGCAGGC TGTCGACGCC ATCTCCGGCG GGCGCCCCGG AGCGCGCCGC GAAACGCTCG ACCTGTATAT CAAGCGCCTC GAAGCGCTCG AAACGGTCGC AACGTCATTT ACAGGCGTTC AACGCGCCTT TGCTGTTCAG GCCGGGCGCG AGGTTCGCGT GATGGTGCAA CCGGATGCTA TCGACGACCT CGGCAGTATT CACCTTGCCC GTGATGTTGC CAAAAAGATC GAGGAGAGTC TTCAGTATCC CGGCCAGATC AAGGTGACGG TCATCCGCGA GACGCGCGCA GTGGACTATG CACGTTGA
|
Protein sequence | MFEPVATARH WAVACCTFTI EQEDVVQWLI WALPALVIGL AIGAGIGILI YKKSVQSQIR QIEAEARLQL EATRSEQKDL ILRATDEALR LRAEAEAQIR EARAALAKQE ERLQRKEENL DRKIEGLERR ERQLQQRERQ MEQLHQEAEQ LRQQQRAELE RISALSQEEA RAIILKRVED ETRDEAARRI REIEKNVREE ADKLARKVIS MAIQRCASEY VAEVTVSTVA LPSEELKGRI IGREGRNIRA FEQLTGVDII VDDTPEAVTL SCHDPVRREV ARLALIKLLK DGRIHPTRIE EVVSKTQQEV EQIMREEGER VAYEANVQGL HPDLIKLLGR LKYRTSYGQN VLQHSLECAL LAAHIAAEIG ANINVAKTAA LLHDIGKAVD HEVQGPHALI GAEIARRLGK SPAIVHAIAA HHNDEEPQTV EAWLVQAVDA ISGGRPGARR ETLDLYIKRL EALETVATSF TGVQRAFAVQ AGREVRVMVQ PDAIDDLGSI HLARDVAKKI EESLQYPGQI KVTVIRETRA VDYAR
|
| |