Gene RoseRS_3562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3562 
Symbol 
ID5210540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4457051 
End bp4458418 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content65% 
IMG OID640597157 
Productadenine-specific DNA methylase 
Protein accessionYP_001277869 
Protein GI148657664 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTTG CCCTACAAGC CGATGCCCAA ACCCAGACCA TCAGCGCGAA CAAACTCATC 
TCCACCGACC CGCCCTACTA CGACAACATC GGCTATGCCG ACCTGTCGGA CTTCTTCTAC
GTCTGGCTGC GCCGCACGCT CAGACCGATC TTTCCCGACC TCTACGTCAC CCTGTCCACC
CCCAAGGCTG AGGAGCTGGT CGCGACCCCC TACCGTCACG GCAGCAAACA GGCCGCCGAG
CGTTTCTTCA TGGAAGGCAT GACCCGCGCG CTGCACAACC TGGCTGTACA GGCCCACCCC
GCCTTTCCGG TCACCATCTA CTACGCCTTC AAGCAGCAAG AAGTGAGGGA AGAGAAGAGA
GAAGAGAGAG AAGAGAAAAG AGAAGAGAGA GCGACGGGCG AACCGCGTAA TCCCGACTCT
CACTCCTCAC TCCTCTCCAC TCACTTCTCT CACTCCTCAC TCCTCTCCAC TCACTCCTCC
ACCGGCTGGG AAACCTTCCT GGAAGCGGTC ATCCAGGCCG GCTTCGCTAT CACCGGCACC
TGGCCCATGC GCACGGAGTT GGGAAACCGT ATCCTTGGGC AAGGCGCCAA CGCCCTCGCC
TCCAGCATCG TGCTGGTCTG CCGACCACGC CCGGCAGATG CCCCCATCGC CACCCGCCGC
GAGTTTGTCG CCGCGCTCAA AGCCGAACTG CCGGCGGCGC TGGCGGCGTT GCAACGCGCT
AACATCGCGC CGGTCGACCT GGCGCAGGCG GCGATCGGTC CGGGGATGGC GGTCTATACC
CGCTATGCCC GCGTGGTGGA CGCGCAGGGC AATCCGGTGC GGGTGCGCGA GGCGCTGGCG
CTGATCAATC AGGTGCTCGA CGAGGCGCTG AGTGAGCAAG AGGGTGATTT CGACGCCGAC
ACCCGTTGGG CGCTGGCCTG GTTCGAGCAG TATGGCTTTG CCGAAGGCGA GTACGGCGTG
GCCGAGACGC TCTCGAAAGC CAGGAATACG AGCGTCGAGG GGCTGGTCGC CGCCGGGATG
GTCGAAGCGA AACGGGGCAA GGTGCGCCTG CTCACACCGG CGGAACTCCC GGCCGCCTGG
GACCCGGCCG GTGATAGCCG GGTCACGCAT TGGGAAGCGG TCCATCACCT GATCCGGGTG
CTGGAGACCG GCGGTGAAAT GCAGGCGGCG GATCTGGCGG CGAAGCTGGG CAGTCGGGCT
GATGTGGCCC GCGAGCTGGC GTACCGGCTC TACACCATCT GCGAGCGCAA GAAGCGCCCG
GATGAAGCCT TTGCCTACAA CGCCCTGGTG CAGAGCTGGG GGGAGATTGC GCGGCTGGCG
TGGGAGCGGC GCAGTGATGC GCCGGTTCAG ATGAGTTTTG AAGAGTGA
 
Protein sequence
MGFALQADAQ TQTISANKLI STDPPYYDNI GYADLSDFFY VWLRRTLRPI FPDLYVTLST 
PKAEELVATP YRHGSKQAAE RFFMEGMTRA LHNLAVQAHP AFPVTIYYAF KQQEVREEKR
EEREEKREER ATGEPRNPDS HSSLLSTHFS HSSLLSTHSS TGWETFLEAV IQAGFAITGT
WPMRTELGNR ILGQGANALA SSIVLVCRPR PADAPIATRR EFVAALKAEL PAALAALQRA
NIAPVDLAQA AIGPGMAVYT RYARVVDAQG NPVRVREALA LINQVLDEAL SEQEGDFDAD
TRWALAWFEQ YGFAEGEYGV AETLSKARNT SVEGLVAAGM VEAKRGKVRL LTPAELPAAW
DPAGDSRVTH WEAVHHLIRV LETGGEMQAA DLAAKLGSRA DVARELAYRL YTICERKKRP
DEAFAYNALV QSWGEIARLA WERRSDAPVQ MSFEE