Gene RoseRS_3154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3154 
Symbol 
ID5210124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3969287 
End bp3971296 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content61% 
IMG OID640596745 
Productexcinuclease ABC, C subunit 
Protein accessionYP_001277465 
Protein GI148657260 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGACC TGACGCATAC CGCAATCGCG GATCCGACCG CCTTCGAAGA ACGGTTGCGC 
GCTGTTCCGC TCGCGCCCGG CGTCTATCTC TGGAAGAACG CCCAGGGGAA GATCATCTAC
GTTGGCAAGA GCAAGCGTCT GCGCGATCGG ATGCGATCCT ACTTCAACCA GACTGGCGAT
CCTTATGGGA AGACGGCACG ACTGGTCGAA CAGATCGCCG ATTTTCAGGT GATCGTCACG
TCCACCGAAC TCGAAGCGCT GTTGCTCGAA ATGACCCTCA TCAAGCAGCA CCGTCCGCGC
TTCAATGTGC TGTTGAAGGA CGACAAGAGT TACCCGTACA TCAAAGTAAC GTTGCACGAA
ACCTGGCCCC GCATCTTTGC CACGCGCAAC CCGCGCTGGG AAGAGGGCGC GCGCTACTTC
GGACCCTACT CCAGCGCTGG AGCGGTCTAC CGCACGCTCG ACCAGCTCAA CCGGCTGTTC
GCCTTTCGCC CGCCGTCGCG CTGCCCCGAC GACAAGTTCA ACCGCCATCG GCGGATGGGT
AAGCCATGCC TGTACTACGA CATCAAGCGA TGCCTGGGTC CGTGCGTGCC GGGGCTGGTG
AACCAGGAAG AATACCGCGC CACCATTGAG TCGGTGTGTC GTTTTCTGGA AGGGAAGAGC
GACCTGGTGG TGAAAACGTT GCGCCGCCAG ATGGAAGAAG CGGCGGAACG GCTCGATTTC
GAGCGCGCCG CCCGGTTGCG CGACAGCATC CGCGACATCG AGTTGATCAG CCAGCGCCAG
CAGGTGCTGC GCCACGACGA TGCCGATCAG GACGTCATTG GCCTGGCGCG CGAGGAAGGG
ATGGCGGTCG TTCAGGTCTT GTGCATTCGC GCCGGGAAAC TGATCAGCGC CGAATCGTTT
CCGCTGCAAA ATGCCGAAGG GGAGCGCAAC GAAGACCTGC TTGCCTCATT CCTGACCCAG
TTCTACGATA CGGCTGCTGA ACTCCCCGCA GCGTTACTGT TGCCGATGCC GCTCGATGAA
CCGGCAGTGA TCGAGCAATG GCTGGCGCAG AAAGCGGGGC GCAAAATCAC ACTGCACACG
CCACAGCGCG GCGAAAAGCG TCGTCTCGTC GAACTGGCGG AACAGAACGC GCGCCAGAAA
CTCGAAGAAC TGCGGTTGCA ATGGCTCAAC AGCGAGCAAC GCGCCGTGGC AGGGTTGAGC
GAGGTGCGCG ATCTGCTTGG CTTGAGCGCA CTGCCGACGC GCATCGAATG CTTCGATGTC
TCGAACATCC AGGGAAGCCA CGCCGTCGGA GCAATGGTCG TCTTCGAGCG CGGCGAGCCA
AAGAAGAGCC GCTACCGCAA ATTCAGGATC AAAACCGTTC AGGGCGCCAA CGATGTCGCG
TCGATCCAGG AAATCCTGCG CCGCCGCTTC CGTCGCGCTG CGATGGTCAT CGGGGAAGAA
GAACAGCAAT CGGAAGCGCA AAGGATGAAC GGCAGCGTCA ACGGTCCGGC GGAAGCGGAC
GACCAGGAAG AAGCGGAGAA GACCGACACG CCGGGGTCGC ACGCCGACAT TGAACGCCAG
GAAACCTGGG CGGAACTGCC CGATCTGATC CTGATCGATG GCGGCGTCGG ACAACTGAAC
GGAGCGATCC AGGCGCTGCG CGATCTGCGC TTCGACCACA TTCCTGTGGT TGGGGTGGTC
AAAGGACCGA ACCGTGATCG CTTCGACCTG CTGCTCCCCG GTGCGAGTGA TGTCATCGTG
CTTGAACGCG ACAGCGCTGC GTTGCGCCTT ATCCGCATGA TCGACGAAGA AGCCGACCGT
TTCGCAAAAG ACTACCACCG CAAACTGCGC AGCAAATCGG CGACATCGTC GCGGCTCGAA
GAAATCCCTG GCATCGGTCC GAAGCGGCGA CAGTTGCTGC TGAAGCGCTT TGGGTCACTC
GAAGGCATTC GCAACGCCAC CGTCGATGAA CTCGCCGCCG TGCCGGGCAT GACGCGCAAA
GCCGCCGAAG AGTTGAAGAG TATGCTGTAG
 
Protein sequence
MPDLTHTAIA DPTAFEERLR AVPLAPGVYL WKNAQGKIIY VGKSKRLRDR MRSYFNQTGD 
PYGKTARLVE QIADFQVIVT STELEALLLE MTLIKQHRPR FNVLLKDDKS YPYIKVTLHE
TWPRIFATRN PRWEEGARYF GPYSSAGAVY RTLDQLNRLF AFRPPSRCPD DKFNRHRRMG
KPCLYYDIKR CLGPCVPGLV NQEEYRATIE SVCRFLEGKS DLVVKTLRRQ MEEAAERLDF
ERAARLRDSI RDIELISQRQ QVLRHDDADQ DVIGLAREEG MAVVQVLCIR AGKLISAESF
PLQNAEGERN EDLLASFLTQ FYDTAAELPA ALLLPMPLDE PAVIEQWLAQ KAGRKITLHT
PQRGEKRRLV ELAEQNARQK LEELRLQWLN SEQRAVAGLS EVRDLLGLSA LPTRIECFDV
SNIQGSHAVG AMVVFERGEP KKSRYRKFRI KTVQGANDVA SIQEILRRRF RRAAMVIGEE
EQQSEAQRMN GSVNGPAEAD DQEEAEKTDT PGSHADIERQ ETWAELPDLI LIDGGVGQLN
GAIQALRDLR FDHIPVVGVV KGPNRDRFDL LLPGASDVIV LERDSAALRL IRMIDEEADR
FAKDYHRKLR SKSATSSRLE EIPGIGPKRR QLLLKRFGSL EGIRNATVDE LAAVPGMTRK
AAEELKSML