Gene Rcas_4093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4093 
Symbol 
ID5541604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5303902 
End bp5306970 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content63% 
IMG OID640896205 
ProductSMC domain-containing protein 
Protein accessionYP_001434143 
Protein GI156744014 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.240758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.708538 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCTC GAACGCTGAC ACTCCAGAAC TTCATGTGCT ACCGCGAAGG GTTGCCGCCG 
CTGGTGCTCG ACGGCATCTC GATTGCGTGT CTGGCGGGTG ACAATGGCGC CGGCAAATCG
GCGCTGCTCG ATGCGATCAC CTGGGCATTA TGGGGTGAAG CGCGTCTGAA GAGCGACGAT
GATCTGGTAG CGCTTGGCGC CACCGAGATG ATGGTGGACC TGGAGTTCAC CCTCGATGGG
CAGGATTATC GGGTGATCCG CCGCCGTATC CGCGGCAAGC GCGGCGGTCA GAGCCAGCTC
GACTTCCAGG TGCGCGATGA GAACGGCTGG CGCTCGCTGA CCCCAGGTGG CATTCGTGAA
ACGCAGCAGT TGATCATCCG CACGCTGCGC ATGGATTATG AGGTCTTCGC CAATTCGGCG
TATCTCCGCC AGGGACACGC CGATGAGTTC ACCCGCAAAG AACCGGCGAA ACGGAAGCAG
GTGCTGGCGG ATATTCTTGG TTTGAGCGTG TATGAGGACC TGGAAAGTCG GGCGAAGGAG
CGCGCGCGCG CCATCGAAGG GCAGATCCGC GGTCTCGAAG GTCAGATCGG CGAATTGCGC
CGCCAGGCGG AACGACGCGA CGTGCTCGTG GGGTTCGTGC GCGACGCTGA ACAGCGTGTC
GCAGATGCGC GGCAGCGCAT CACAGAAGCG GAACAGGCGT TCCAAGCGGC GGTTGCCAAA
GTGCAGGAAC TCGAAACAGT GCGCACTATC CGCGATAACC GTCAGGAGCA GATTCATCAG
CGCCGCGCCG AAAGGGATGC GCAGAAGCAG TGGCTGGATC GGCAGATGGA CATCCGGGAA
CGCGCTGAGG GCTGGATTGC GCGCCGTGCC GAGATTGAGG AAGGCATTCG CGCGCTGCGC
GCTGCCGAAG CAGAGCGTGA TCGCCTCGCC GCGCTGCGCG ATGAGTATGA TCGGCTTCAG
CAGCGTCGCG CGACCCTCGT GCAGGCGCTT GCCGAAGCCG AACACGCTCT CCGTGCCGAT
CTGCGCGTTG CTGAGACGCA GGTGCAGACG ATGCGCGAAC GTGCTGCTCG CCGCCCGAAA
CTGGTGGCGG AACGGGAGCG CCTGGCGGCA CAGTTGACGG ATCAGACGCC GATGATAGAG
GCGTTGTCGG CTGCGCGCAC CCGCCGTACT GACCTGACCG ACCGCCTCCG TCGCGTCAAT
GAGCTGCTGC GCCGCCGCAC GGAGCTGGAA GGGGATATCA AACTGAAACA CGACTCGCTG
GTCGCCACGC GCGAGGAGCA GAAGCGAATT CTGCGCACGC TGGCAGATCA ACTGAAACAC
GAAACGCGCT GGCGCGCCGA ACTTGCCGAA GCGCTCGCGG AACGCACCCG GATCAACGAA
GAAAAAGACT GCCTTGAAAT ACTCCGCAAC GATGAGCGCG TCCTTGCCGA GCAGGTTGGC
GGCATCCGCG CTGAATGCGA GACGGTTCGA CGACAGGGTG AGCAGATCAA CGAAAAACTA
CGCTTGCTTG GTCCTGATGT GACGGTGTGC CCGCTCTGCA AGAGCGAACT CGGTCACGAC
GGCATTGTCC ACATCCAGGC GGAATACGAG CGTGAGCGTC AGGCGCTGCG CCAGTGGTAT
GCCGCTGCCA AACGCGATGC CGATCAGCTC GAAGCGCAGC TCAAACGCCT GCGCAACGAC
ATTCGTGCTG CCGAGAACCG TATTGCTGCG CTTCCCGATC TTCAGGGGCG CATTGCGCGT
CTGGAAAGCG ACCTTGCCAG ATGCGACACA CTTCGCCAGC AACAGATCGA GGCGCAGCGC
CTGCATGATG ATGTGGCGAT GCGCCTGATG AAGAACGATT ATGAGTTGGC GGCGCGCGAA
GAGTTGAAGC GCATCGATGC CGAGATGACG GCGCTTGGCG CCATCGAGAC GCTCGAACGT
GAAATCGGCG CGCTTGATCG CCAGGTTGCC GCCCTCGAGA ATCGCAGCCG CGAACAGGCG
ACGCTTCAGG CGCAGGTCGA TGCGCTGCAC CGTGAAATCC GGCAGATCGA CGACGACGAC
CCGGCGTTGC ACGAACAGGA GCAGATTGTC GCGGAATTGA GCAGACGCCT GGCGCAGAAC
GATTTTGCGC ACGACGAGCG CGCAGCGCTT GCCACGCTCG ACGAGCAGAT TGCGGCGTTG
GGGTATAGCC GCGAACGGTA TGATCAGGCG AAGGCTGAGG CACAGGCGTT GACTCGTTGG
GAGGAAGACC TGACGCGCCT GCAACACGCT GAAGAGTGGA TTGCCGAGAA CGACGATGAC
ATCGCGCGCG CGGCGGAGCG CCTCCGGCAA CTTGACGCGC AGATCGCCTC CGACGAAGCC
GAGGTGCAAC GCCTCGATGA ACGCCTGCGC GACCTGGCGC CCGCCGCGCG CGCGCGCGAC
GCCGCCAGAG CCAGGCTCGA CGACCTCCAC CGCGAATTGC TGGCGTTCCA GAAGGACCTT
GGCGAACACG AGGCGAACCT GCGCCGCGCT GAGGAAGCTG CGCGCGGCCT CGCTGATGCC
GAAGCGCTCC GCCTGGCGCT TCTCGAACGG AAAAGTCTGT TCGATGAATT GACCCTGGCG
TTTGGCAAAA AGGGCGTTCA GGCGATGCTG ATTGAAACCG CTCTTCCCGA ACTGGAGCGT
GAAGCCAATC GCCTCCTCGA CCGGATGACC GATAATCAGT TGCATCTGAC GTTCGAGACG
CAGCGCGATA CGAAGAAGGG CGATGTCGTC GAAACGCTGG AGATCAAAAT CGCTGATGCG
CTCGGTACGC GCGTCTACGA CGCCTATAGC GGTGGTGAAG CGTTCCGTCT CGACTTCGCC
ATCCGGATCG CGCTCTCGAA ACTGCTGGCG CGCCGCGCTG GCGCGCGCCT CGAGACCCTG
ATCATCGATG AAGGGTTCGG CTCGCAGGAC GCGCGCGGAC GCGAACGCCT GGTTGAGGCG
ATCATCTCGG TTCAGCACGA CTTTCGCCGT GTGCTGGTGA TCACGCACAT TCAGGAATTG
AAGGATATGT TTCCTGTGCA GATCGAAATT GTCAAAACGC CGCACGGCAG CGTCTGGAGT
CTCGCGTGA
 
Protein sequence
MIPRTLTLQN FMCYREGLPP LVLDGISIAC LAGDNGAGKS ALLDAITWAL WGEARLKSDD 
DLVALGATEM MVDLEFTLDG QDYRVIRRRI RGKRGGQSQL DFQVRDENGW RSLTPGGIRE
TQQLIIRTLR MDYEVFANSA YLRQGHADEF TRKEPAKRKQ VLADILGLSV YEDLESRAKE
RARAIEGQIR GLEGQIGELR RQAERRDVLV GFVRDAEQRV ADARQRITEA EQAFQAAVAK
VQELETVRTI RDNRQEQIHQ RRAERDAQKQ WLDRQMDIRE RAEGWIARRA EIEEGIRALR
AAEAERDRLA ALRDEYDRLQ QRRATLVQAL AEAEHALRAD LRVAETQVQT MRERAARRPK
LVAERERLAA QLTDQTPMIE ALSAARTRRT DLTDRLRRVN ELLRRRTELE GDIKLKHDSL
VATREEQKRI LRTLADQLKH ETRWRAELAE ALAERTRINE EKDCLEILRN DERVLAEQVG
GIRAECETVR RQGEQINEKL RLLGPDVTVC PLCKSELGHD GIVHIQAEYE RERQALRQWY
AAAKRDADQL EAQLKRLRND IRAAENRIAA LPDLQGRIAR LESDLARCDT LRQQQIEAQR
LHDDVAMRLM KNDYELAARE ELKRIDAEMT ALGAIETLER EIGALDRQVA ALENRSREQA
TLQAQVDALH REIRQIDDDD PALHEQEQIV AELSRRLAQN DFAHDERAAL ATLDEQIAAL
GYSRERYDQA KAEAQALTRW EEDLTRLQHA EEWIAENDDD IARAAERLRQ LDAQIASDEA
EVQRLDERLR DLAPAARARD AARARLDDLH RELLAFQKDL GEHEANLRRA EEAARGLADA
EALRLALLER KSLFDELTLA FGKKGVQAML IETALPELER EANRLLDRMT DNQLHLTFET
QRDTKKGDVV ETLEIKIADA LGTRVYDAYS GGEAFRLDFA IRIALSKLLA RRAGARLETL
IIDEGFGSQD ARGRERLVEA IISVQHDFRR VLVITHIQEL KDMFPVQIEI VKTPHGSVWS
LA