Gene RoseRS_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2008 
Symbol 
ID5208970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2489678 
End bp2491396 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content62% 
IMG OID640595615 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_001276344 
Protein GI148656139 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00291083 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00122076 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCCACAT CTCGATTGTC CGCACGCAAC AAACGCTGGC ACATTCTCGA AACGCCGCCG 
GAGTTCATCG CCGCCTGTCG CGCATTGCCG CCGTTGATCG CGGCGTTGCT GTATCAGCGC
GGGGTGCGCA GCGATGCTGC GATGCGCGAG TTCTTCAGCG CCGACTATCG CCTGAACGAT
CCTTTCAGCA TGCGCGGGAT GGAGACCGCG GTGCAACGTA TCGCCGCAGC CATTCACGAT
CACGAGTTGA TGGCGGTGTA CGGCGATTAC GATACCGACG GCGTGACCGC AGTGACGCTG
CTGGTGCAGG CGATCAGCGC CATGGGCGGG TTGATTCTGC CCTATGTTCC ACACCGCATC
CGTGAGGGGT ATGGCTTGAA TATCGAGGCC ATCGATGCGC TTGCCCGTGA AGGGGTTCGC
CTGCTGATCA CCGTCGATTG CGGCATCAGC AATGTACGCG AAGCCGCCCA TGCGCGCGCG
GTTGGCATCG ATCTGATCAT TACCGATCAT CACCATCCCC CCGCTCGCCT GCCGGATGCG
GTGGCGATCA TCAATCCCCG CCAGCCGGGG TGTTCCTATC CGTTCAAACA TCTGGTCGGC
GTCGGGATCG CCTTCCAGCT GGTGCGCGCA CTGGCGCGAC GCGGGTTCCG TTCTACCCTG
CAAAAGGACG ATCTCCTCGA TGTGGTCGCC CTCGGCACGG TGACCGATAT GGGTCCGCTG
ATCGGCGAGA ATCGCGTGCT GGTGACGCAC GGGCTGAAGG CGCTCAATGC GGCGCAGCGA
CCAGGGGTGC GCGCACTGAT CCAGGCTGCC GGCATGACTC CCGGCAGGGT GACCTCGACC
GACATCAGTT TCGGGTTGGG ACCGCGGTTG AATGCGTCCG GTCGCATCGA TAACGCGCGA
TTGAGTTATG AACTGTTGCT TGCAGAAGAG TTCGAGACGG CGCAACGCCT GGCGCACGAA
CTGAATCTTC AGAACCGTCA GCGCCAGGAG TTGTCGAAGA CGGTTCACGA ACAGGCCAGC
GCCCAGATTC AGGCGCTCGG CAAACAGAAC CAGCGGCTGA TCATCCTCGA TGGAACCGAT
TACCCCGCCG GTGTGGTCGG TCTGGTGGCA GCGCGTCTGG TCGAGGAGTA TGGGCGCCCG
ACGGTGCTGA TTGAACGCGG TGAGCAGTGG TCGCGCGGTT CGGCGCGCTC GGTGCCCGGA
TTCAGCATTA TCGATGCGTT GACCGACTGC GCCGATCTGT TCGAGCGTTT CGGCGGGCAT
ACCGAAGCAG CCGGGTTTAC CATCGCCACC GATCGGTTGC CGGCGCTTGA AGACCGCCTG
CTCCGCTATG CCGCAGAGCA TCTGAGCGAT GATCTGTTGA TGCCAGGGAT CACTATCGAC
GCCGAAGTTC CGCCAGCGTC GTTGTCGTAT GAATTGCTGG GGGAACTGGC AAAACTCGAA
CCGTTTGGGC ACGGGAATCC GCAACCGGTG TTGATGAGCC GACGCTTGCA GGTCACCGGT
GCCTGGCCGC GCGGCAGCGA GGGACAGCAC CTGAAACTCC GGTTGGTCGG CGCCGATGGC
AGTGGACCCT TCGACGCAAT TGCGTTCCGT TTCGGGCATC TGGCGCGCTA CTTCGAGCAG
CCACGCTGGA TAGATATTGT CTACACGCTC GAAGCCGATG AGTGGAACGG AAGCCCTTCA
CTTCAGTTGA ATGTCAAAGA TTTCAGGAGC GCGCGGTAA
 
Protein sequence
MSTSRLSARN KRWHILETPP EFIAACRALP PLIAALLYQR GVRSDAAMRE FFSADYRLND 
PFSMRGMETA VQRIAAAIHD HELMAVYGDY DTDGVTAVTL LVQAISAMGG LILPYVPHRI
REGYGLNIEA IDALAREGVR LLITVDCGIS NVREAAHARA VGIDLIITDH HHPPARLPDA
VAIINPRQPG CSYPFKHLVG VGIAFQLVRA LARRGFRSTL QKDDLLDVVA LGTVTDMGPL
IGENRVLVTH GLKALNAAQR PGVRALIQAA GMTPGRVTST DISFGLGPRL NASGRIDNAR
LSYELLLAEE FETAQRLAHE LNLQNRQRQE LSKTVHEQAS AQIQALGKQN QRLIILDGTD
YPAGVVGLVA ARLVEEYGRP TVLIERGEQW SRGSARSVPG FSIIDALTDC ADLFERFGGH
TEAAGFTIAT DRLPALEDRL LRYAAEHLSD DLLMPGITID AEVPPASLSY ELLGELAKLE
PFGHGNPQPV LMSRRLQVTG AWPRGSEGQH LKLRLVGADG SGPFDAIAFR FGHLARYFEQ
PRWIDIVYTL EADEWNGSPS LQLNVKDFRS AR