Gene RoseRS_4184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4184 
Symbol 
ID5211168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5240126 
End bp5241178 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content47% 
IMG OID640597773 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001278478 
Protein GI148658273 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0040014 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000116244 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCGTC ACCATCCACC TATCCCTGTT GAAATCGAAC GGCTTCCCAA ACATTTACAA 
CCGACATTTC TCAAACTCTA TAGCGCAGAT CATCCAGAAA ATGGCGGCAT GTATTTCCTT
GAGCCAGACA ACATCTACTG TGGCGATGCC CGCCAACTTT TGCCCCAAAT AGAGCCGAAC
AGCATTGCCT TGAGTGTATG GTCGCCGCCA TACTTCGTCG GGAAAGAGTA TGAAGAGCAT
CTTTCCTTCG ATGAGTGGAA AGATCTTCTT CGTACTGTGA TACATTTACA CTTTCCAATT
ATCAAGCCCG GTGGTTTTCT GGTCATCAAC ATTGCAGATA TACTGGTGTT TAAGGATCCC
CACATGCCTC GCATCCAGGC AGAGGCAGTC AACAGGAAGC GATCTCCCGT TACCAGGGAA
GATATTCTGC GAGCAATTGA ACAACACCCT GACTTTAATC GTTATCAGCT CGCAGAACTT
TTAGGTTGCA GCGAGCAGAC AATAGACAGA CGACTCAACG GCAACAATAT CCGCGGCGGT
AAATATGACA TCCAAACCCG GGTCAAGATC GTGGGGGGTC TTGTTGAAGA ATGGGCGCTG
GATGCAGGCT TTTTCACTTA CGACCGGCGG ATATGGGTCA AGGATGCCGC ATGGGAAAAC
TCTCGCTGGG CAAGTCTTTC CTATCGCTCT GTTGACGAGT TTGAATATAT TTTCTTTTTC
TGGAAACCTG GTGTTACTAA GTTTGACCGG AGAAGATTAT CTTCTGATGA ATGGCGAGAT
TGGGGATCGA GAGGAGTATG GCGCATTCCC TCGGTCCGGT CAAACGATGA TCACGAGGCA
AAATTTCCGG TCGAATTGCC TTCCAGAGCC ATCAAACTTC TTACCGATCC GGGTGATATT
GTGCTGGATT GTTTTATTGG AAGCGGTACA ACAGCAATAG CAGCGATCCG TGCTGGTCGT
CGGTATATAG GCATCGATAT TCTCCAGAAG TATGTTGATC TGGCAAGAAA TAATATCAGG
AGGGAGTTAC AGCAAATTAG TATGGAGATA TAA
 
Protein sequence
MNRHHPPIPV EIERLPKHLQ PTFLKLYSAD HPENGGMYFL EPDNIYCGDA RQLLPQIEPN 
SIALSVWSPP YFVGKEYEEH LSFDEWKDLL RTVIHLHFPI IKPGGFLVIN IADILVFKDP
HMPRIQAEAV NRKRSPVTRE DILRAIEQHP DFNRYQLAEL LGCSEQTIDR RLNGNNIRGG
KYDIQTRVKI VGGLVEEWAL DAGFFTYDRR IWVKDAAWEN SRWASLSYRS VDEFEYIFFF
WKPGVTKFDR RRLSSDEWRD WGSRGVWRIP SVRSNDDHEA KFPVELPSRA IKLLTDPGDI
VLDCFIGSGT TAIAAIRAGR RYIGIDILQK YVDLARNNIR RELQQISMEI