Gene Rcas_3530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3530 
Symbol 
ID5541029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4598986 
End bp4600998 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content60% 
IMG OID640895648 
Productexcinuclease ABC subunit B 
Protein accessionYP_001433598 
Protein GI156743469 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.220409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTCC GCATCGAAGC GCCCTTTCAA CCGGCCGGCG ATCAGCCGAA GGCGATCGCG 
CAACTGGTCG AGGGGATTCG CGCAGGATAT AAACATCAGA CATTGCTCGG CGCGACCGGC
ACCGGCAAGA CGCTCACCAT GGCGCATGTC TTCAGTCAGT TGGAGCGTCC GGCGCTGGTC
ATGGCGCACA ACAAAACACT GGTCAGTCAG TTGTACGCCG AGTTTCGCGA ACTGCTGCCC
GACGCGGCGG TGGAAATGTT CATTTCGTAC TACGACATGT ACACCCCCGA AGCGTATGTG
CCGAGCAAAG ACCTGTACAT TGAGAAGGAA GCCGAGATCA ACGAAGAGAT CGACCGCCTG
CGCCACGCGG CGACGCAGGC TCTCTTCACC CGGCGCGATG TCCTGATTGT CGCGTCGGTG
TCGGCGATCT ACGGTCTCGG TTCGCCCCAT GAGTATGGTC AGGTCGTCAT TCCTATCCGC
GTCGGCGAGG TGCGCAACCG CGATAAACTG CTGCGTCAGT TGCTCGACCT GCAATTCGAG
CGCAACGATA TGGACTTTCA TCGCGGCACG TTCCGTGTGC GTGGCGATAC CCTCGACATC
TTCCCCGCCA ATCAGGAGAT TGCGCTTCGC GTCGAGTTCT GGGGCGATGA TGTCGAGCGT
ATTACCGAGT TCGACCCACT GACCGGCGAG GTGCTGATCG AGCGCACGGC GGTGAATATC
TATCCGGCGA AGCACTTCAT TACAACCGCC GAAACGCTCA AACTGGCGAT CACCGACATT
CAGGCGGAAC TGGCAGTGCG TCTTGCCGAA CTCGAGCAGC AGGGGAAACT GCTCGAAGCA
GCGCGCCTCA AGCAGCGCAC CAACTACGAC CTCGAAATGC TCTCCGAGGT CGGCTACTGT
TCCGGTATCG AGAACTACTC GCGGCACCTC GACCGTCGCG CGCCCGGGCA GACGCCGTGG
ACGCTGCTGG ACTATTTTCC CGATGATTTC ATTCTCTTCA TCGATGAGTC GCACATAACG
CTGCCGCAGA TTCGCGGCAT GTACGCCGGC GACCGATCGC GCAAGGAGAC GCTCGTCGAC
TACGGCTTCC GCCTCCCCTC CGCGCTCGAC AACCGACCGC TGCGATTCGA TGAGTTCGAG
CGCCATATCC ATCAGGTGAT CTATGTCTCG GCGACGCCGG GACCCTATGA ATATGAACAT
TCGCAGCAGA TCGTCGAGCA GATCATTCGC CCGACCGGTC TGCTCGATCC GACGGTCGAG
GTGCGCCCGA CGCGCGGACA GATTGACGAT CTGGTCGGCG AGATCAAGCG GCGGGTGCAG
AAAGGGCAGC GCGCACTGGT CACGACGCTG ACCAAGCGCA TGGCGGAGGA CCTGGCGGAT
TACCTGAAGG AAATGGGCAT CCGCACCAGT TATCTGCACT CCGACATCGA GACGCTCGAA
CGGGTCGAAA TTCTGCGCGA CCTGCGGCTT GGCGTGTACG ATGTGGTTGT TGGCATCAAC
CTGCTGCGCG AAGGTCTCGA CCTGCCGGAA GTATCGCTGG TCGCTATTCT CGACGCCGAC
AAGGAAGGAT ACCTGCGCAG CGGATCGTCG CTCATTCAGA TTATCGGGCG CGCCGCGCGG
CACATCGAGG GCGCGGTGAT TATGTACGCC GATACGATCA CGCCATCGAT GAAGTTCGCC
ATCGACGAAA CCAACCGTCG TCGTGCCATC CAGGAAGCGT ACAACCGCGA GCACAACATC
ACGCCGGTCG GCATCTCGAA AGCCGTGCGC GACCTGACCG ACCGGGTGCG TAAAGTCGCC
GAGGAGCGTG GTGTCTACCA GGCTGCCGTA CCAGGGGAAG AATTGCCCAT TCCGAAGGAC
GAGATCGTCA AACTGATCAA GGAACTGGAG AAGCAGATGA AGCAGGCGGC GAAGGAGTTG
GCGTTCGAGA AGGCTGCTGC CCTGCGTGAT CAGATCATCG AACTGCGGCG CACCCTGGCG
CTTGATGAGG AGCAGGCGCC GGCGCACTCG TAG
 
Protein sequence
MSFRIEAPFQ PAGDQPKAIA QLVEGIRAGY KHQTLLGATG TGKTLTMAHV FSQLERPALV 
MAHNKTLVSQ LYAEFRELLP DAAVEMFISY YDMYTPEAYV PSKDLYIEKE AEINEEIDRL
RHAATQALFT RRDVLIVASV SAIYGLGSPH EYGQVVIPIR VGEVRNRDKL LRQLLDLQFE
RNDMDFHRGT FRVRGDTLDI FPANQEIALR VEFWGDDVER ITEFDPLTGE VLIERTAVNI
YPAKHFITTA ETLKLAITDI QAELAVRLAE LEQQGKLLEA ARLKQRTNYD LEMLSEVGYC
SGIENYSRHL DRRAPGQTPW TLLDYFPDDF ILFIDESHIT LPQIRGMYAG DRSRKETLVD
YGFRLPSALD NRPLRFDEFE RHIHQVIYVS ATPGPYEYEH SQQIVEQIIR PTGLLDPTVE
VRPTRGQIDD LVGEIKRRVQ KGQRALVTTL TKRMAEDLAD YLKEMGIRTS YLHSDIETLE
RVEILRDLRL GVYDVVVGIN LLREGLDLPE VSLVAILDAD KEGYLRSGSS LIQIIGRAAR
HIEGAVIMYA DTITPSMKFA IDETNRRRAI QEAYNREHNI TPVGISKAVR DLTDRVRKVA
EERGVYQAAV PGEELPIPKD EIVKLIKELE KQMKQAAKEL AFEKAAALRD QIIELRRTLA
LDEEQAPAHS