Gene RoseRS_0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0887 
Symbol 
ID5207833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1101214 
End bp1102404 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content62% 
IMG OID640594504 
ProductPUA domain-containing protein 
Protein accessionYP_001275249 
Protein GI148655044 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.201837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000193307 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACAACCG TCACGTTGCA GCCAGGCAAA GAGCGGCCCG TGGTTCAACG CCATCCGTGG 
GTATTTTCCG GTGCAATCGC GCGTATTCAG GGTCGTCAGC CCGAACGCGG CGCGGTGGTC
GATGTGCGTT CCGCCGAGGG CGAGTGGCTG GCGCGCGGGT GCTGGAGCGC CGGATCGCAG
ATTCGCATCC GCCTGTTTAC ATGGGAGCCG GACGAACCGA TCGATGATGC GCTTATCCGT
CGGCGCATCG AGCGGGCGAT TGATGGTCGC CGCAGACTCG GCATGCTCGC CAACGAAGGA
GCATGCCGCC TGATCTATGC CGAGTCTGAC GGCATACCCG GACTGATCGT CGATTACTAT
GCTGGCTTTC TGGTGGTGCA ACTGTTGACC CAGGCAATGG CGCAGCGCAG TGCAGCTGTG
ACGCGCATCC TGGTGGAAAC CCTCGCGCCG CGCGGCATTT ATGAGCGCAG CGACGCCGAT
GTGCGCGAGA AGGAGGGTCT CCCGCCAGCA TCCGGCGTTC TCTGGGGTGA AACGCCGCCC
GCACGCCTGC GTATGCGCCT TCCGGGCGAC ATCTGGCACG TGGTAGACCT GGGCGCCGGG
CAAAAAACCG GGGCCTATCT GGATCAGGCG TTCAATCGGT TGCGCGTTGC GGCGCACTGC
AACGGTGCGG AGACGCTTGA CTGCTTTTGT TACACCGGCG GCTTCACGAT TGCAGCAGCG
CGCGCAGGGG CGCGCCACAT CACGGCAGTC GACACCAGCG AGGCGGCACT GAGTATGCTT
CGTGAGGGTC TGACCCTCAA CATGGTCGCA ACACCGGTGG AAACAGCGCC TGGCGATGCC
TTCAAACTGC TGCGGCGCTA TCGCGAGGAA CAGCGTCGTT TCGATGTCGT CATTCTCGAT
CCGCCCAAAT TCGCCACTTC GCAATCGCAG GTTGAACGTG CTACCCGCGG CTATAAGGAC
ATTAACATGC AGGCGATGCA CCTGCTGCGT CCCGGCGGCA TTCTGGCGAC CTTCTCGTGC
TCAGGGCTGG TGTCTGCCGA CCTGTTCCAG AAGGTGGTGT TCGGCGCAGC GCTCGACGCA
CACCGCGATG TGCAGATCAT CGAACGGCTG TCCCAAAGCC CCGATCATCC GGTGCTGCTG
ACCTTTCCCG AAGGAGAATA CCTGAAAGGT CTGATCTGTC GTGTGTGGTA G
 
Protein sequence
MTTVTLQPGK ERPVVQRHPW VFSGAIARIQ GRQPERGAVV DVRSAEGEWL ARGCWSAGSQ 
IRIRLFTWEP DEPIDDALIR RRIERAIDGR RRLGMLANEG ACRLIYAESD GIPGLIVDYY
AGFLVVQLLT QAMAQRSAAV TRILVETLAP RGIYERSDAD VREKEGLPPA SGVLWGETPP
ARLRMRLPGD IWHVVDLGAG QKTGAYLDQA FNRLRVAAHC NGAETLDCFC YTGGFTIAAA
RAGARHITAV DTSEAALSML REGLTLNMVA TPVETAPGDA FKLLRRYREE QRRFDVVILD
PPKFATSQSQ VERATRGYKD INMQAMHLLR PGGILATFSC SGLVSADLFQ KVVFGAALDA
HRDVQIIERL SQSPDHPVLL TFPEGEYLKG LICRVW