Gene RoseRS_4245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4245 
Symbol 
ID5211230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5326315 
End bp5327421 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content59% 
IMG OID640597834 
Productrestriction endonuclease 
Protein accessionYP_001278538 
Protein GI148658333 
COG category[V] Defense mechanisms 
COG ID[COG1715] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00114082 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.143836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGTC GTCGTTCACG TCACTCGCCC GATGCCGGGT CTGCCATCGG CGCTCTTTTC 
CTGCTCGGAT TGATCGGATT TCGTCCTTTC TGGCAAACCG TAACGAACCT TGCCCTCCCC
TGGCAGATCG CTGCTGCGGT GCTGATCCTC TCGGTTGCGT TTGTGATCCT GTGGTTTGTG
CGTCTTTTGA TACGTCATGC GCGCCAGCGG AGCCTGGTGC GTAAGGAACT GTACGCGCTC
ACGCCGACCG AATTTGAAGA ACGGGTGCTG CTCTTGTTGA AAGACCTGGG CTGGAGTCAT
CTCAGATTGC GCGGCGGCAG TGGAGATCGC GGCGTTGATC TGGAAGGCGA GTTTCAGGGT
ACACGGTATG TCGTCCAGTG TAAGCGCTAC CATCACAATA AGTCGGTTTC TCCCTCAGCG
GTGCGCGATC TCGTCGGAGC GTTGCACATT CAGAAAGCCG ACCGCGCATT GCTGGTGACG
ACAAGTTCGT TTACGCCGCA GGGGTATGCC GAGGCGCGCG ATCAGGCAGT GGAACTGTGG
GATGGCGCTA TTCTGGAGCA GAAGATAAGC GAAGCTGCCA GGTTGCGTGA AGACCCGACA
CGTAGACAGG CGGTGCAACG GCGACGCCTT GCAACATTCA TTACACTGGT GGTGATCAAC
GGGTTAAGCG TTCTGTCAGC ATTTGCGATT GCAGGTCCGC CATCGTCTGC GCCACCAACT
ATTCGCACAG CGCCGACGCC CTCTCCTGAA AGTATTGCCG GATCACCTCT GGGAAGAACC
GCATCTTCTC TTCCCTTGCC TACCCAGACT CCTTCTTCGG AAGAACCTCA ACCGACAGCG
CTGCCGACAC CGACGGTGGC GCCGACAGAA CCCCCCGTCC CGACCACAAC CGTTTTCAAT
GGCGGGAATG TGCGCGCTGC GCCGAATATG CGGGGCACGG TGCTCGATCA GGTGCACGCT
GGCGAGATTG TCGAACTGCT CGGTCGTTCG GCGGACGGAA ACTGGCTCTA TATCCGCAAT
CCGCGCGGTC AGGTTGGCTG GACGCACCGC ACCCTGCTGA CTCTCGAAGC AGACATCAGT
GAGCGTCTGG AGGTGGTAGC GCCGTGA
 
Protein sequence
MSRRRSRHSP DAGSAIGALF LLGLIGFRPF WQTVTNLALP WQIAAAVLIL SVAFVILWFV 
RLLIRHARQR SLVRKELYAL TPTEFEERVL LLLKDLGWSH LRLRGGSGDR GVDLEGEFQG
TRYVVQCKRY HHNKSVSPSA VRDLVGALHI QKADRALLVT TSSFTPQGYA EARDQAVELW
DGAILEQKIS EAARLREDPT RRQAVQRRRL ATFITLVVIN GLSVLSAFAI AGPPSSAPPT
IRTAPTPSPE SIAGSPLGRT ASSLPLPTQT PSSEEPQPTA LPTPTVAPTE PPVPTTTVFN
GGNVRAAPNM RGTVLDQVHA GEIVELLGRS ADGNWLYIRN PRGQVGWTHR TLLTLEADIS
ERLEVVAP