Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0887 |
Symbol | |
ID | 5207833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1101214 |
End bp | 1102404 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640594504 |
Product | PUA domain-containing protein |
Protein accession | YP_001275249 |
Protein GI | 148655044 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.201837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000193307 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGACAACCG TCACGTTGCA GCCAGGCAAA GAGCGGCCCG TGGTTCAACG CCATCCGTGG GTATTTTCCG GTGCAATCGC GCGTATTCAG GGTCGTCAGC CCGAACGCGG CGCGGTGGTC GATGTGCGTT CCGCCGAGGG CGAGTGGCTG GCGCGCGGGT GCTGGAGCGC CGGATCGCAG ATTCGCATCC GCCTGTTTAC ATGGGAGCCG GACGAACCGA TCGATGATGC GCTTATCCGT CGGCGCATCG AGCGGGCGAT TGATGGTCGC CGCAGACTCG GCATGCTCGC CAACGAAGGA GCATGCCGCC TGATCTATGC CGAGTCTGAC GGCATACCCG GACTGATCGT CGATTACTAT GCTGGCTTTC TGGTGGTGCA ACTGTTGACC CAGGCAATGG CGCAGCGCAG TGCAGCTGTG ACGCGCATCC TGGTGGAAAC CCTCGCGCCG CGCGGCATTT ATGAGCGCAG CGACGCCGAT GTGCGCGAGA AGGAGGGTCT CCCGCCAGCA TCCGGCGTTC TCTGGGGTGA AACGCCGCCC GCACGCCTGC GTATGCGCCT TCCGGGCGAC ATCTGGCACG TGGTAGACCT GGGCGCCGGG CAAAAAACCG GGGCCTATCT GGATCAGGCG TTCAATCGGT TGCGCGTTGC GGCGCACTGC AACGGTGCGG AGACGCTTGA CTGCTTTTGT TACACCGGCG GCTTCACGAT TGCAGCAGCG CGCGCAGGGG CGCGCCACAT CACGGCAGTC GACACCAGCG AGGCGGCACT GAGTATGCTT CGTGAGGGTC TGACCCTCAA CATGGTCGCA ACACCGGTGG AAACAGCGCC TGGCGATGCC TTCAAACTGC TGCGGCGCTA TCGCGAGGAA CAGCGTCGTT TCGATGTCGT CATTCTCGAT CCGCCCAAAT TCGCCACTTC GCAATCGCAG GTTGAACGTG CTACCCGCGG CTATAAGGAC ATTAACATGC AGGCGATGCA CCTGCTGCGT CCCGGCGGCA TTCTGGCGAC CTTCTCGTGC TCAGGGCTGG TGTCTGCCGA CCTGTTCCAG AAGGTGGTGT TCGGCGCAGC GCTCGACGCA CACCGCGATG TGCAGATCAT CGAACGGCTG TCCCAAAGCC CCGATCATCC GGTGCTGCTG ACCTTTCCCG AAGGAGAATA CCTGAAAGGT CTGATCTGTC GTGTGTGGTA G
|
Protein sequence | MTTVTLQPGK ERPVVQRHPW VFSGAIARIQ GRQPERGAVV DVRSAEGEWL ARGCWSAGSQ IRIRLFTWEP DEPIDDALIR RRIERAIDGR RRLGMLANEG ACRLIYAESD GIPGLIVDYY AGFLVVQLLT QAMAQRSAAV TRILVETLAP RGIYERSDAD VREKEGLPPA SGVLWGETPP ARLRMRLPGD IWHVVDLGAG QKTGAYLDQA FNRLRVAAHC NGAETLDCFC YTGGFTIAAA RAGARHITAV DTSEAALSML REGLTLNMVA TPVETAPGDA FKLLRRYREE QRRFDVVILD PPKFATSQSQ VERATRGYKD INMQAMHLLR PGGILATFSC SGLVSADLFQ KVVFGAALDA HRDVQIIERL SQSPDHPVLL TFPEGEYLKG LICRVW
|
| |