Gene Rcas_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1304 
Symbol 
ID5538776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1682693 
End bp1683949 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content62% 
IMG OID640893442 
Productnuclease SbcCD, D subunit 
Protein accessionYP_001431419 
Protein GI156741290 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.611964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.601138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGC TTCATCTGGC AGACCTGCAC ATTGGCATCG AAAATTATGG TCGCGTTGAT 
CCATCTACCG GCTTGCACAG CCGTCTGCGC GACTACCTGG AGCGCCTGGA TGAGGCGATT
GACGTAGGAC TTGCCGAAGG CGTAGATGCG GTCCTTATCG CCGGTGATGT GTACAAGAAC
CGCACCCCCA ATCCCACACA GCAACGCGAG TTTGCGCGGC GCATCCATCG CCTGCGGGCG
TGCGGTCTGC CGGTCTTTAT CCTCATCGGC AACCATGATG TGTCGCCTGC CGCCGGACGC
GCACACGCCG TCGAAATCTT CGATACGCTG GCGGTCGATG GGGTGACGAT CGCCGATCGA
CCGCGCATCC ATACGCTACA GACACGATCC GGTCCACTTC AGGTCATTGC GCTTCCTTGG
GTGACGCGCC ACGCCCTGCT GACGAAAGAG GAATTGCGCA TGGCGTCGTT CCTGGAAATT
GAAACGATGC TGATCGAGCG GGTAGAACGC TTTCTGCGCC AGGCTGCCGA CGACCTTGAT
CCGGCGCTGC CGGCAGTGTT GACCGTGCAC GGCACGATTG ACGGCGCCAC GTTCGGCGCC
GAACGGCAGG TGCTGCTCGG GCGCGACCTG ATCTATCCGC GCAGCCTGAT GGCGCTGCCG
AACGTCGATT ATGTGGCGAT GGGGCATATC CACCGTCATC AGGCGCTTGG CGACCATCCG
CCCGTCGTCT ACCCCGGAAG CATCGAGCGG ATCGATTTTG GCGAGGAGGA CGAAGACAAG
GGATGTGTCA TCGTTGATCT GAAGGCGAAG GGTGAGGCGC ATTGGCGTTT TCACAAACTG
GCGGCGCGTC CATTCGTCAC AATTGCGGTC GATGTGCGCA GTATAAACGA CCCGATGCAG
CGCGTGCTGG CTGCCATCGA GCGGCGCTCG CTGCGCGGCG CAGTGGTGCG GGTGAAGATC
GACGCGCGTC CCGAACAGGC GGATGCGCTT CAGACTGAGG CGATCCGGCG TGCGCTCGAC
GATGCCGGCG CCTATGTCAT CGCTGCCGTG ACGGTCGAGG TCGAACGGAG CACCCGCGGA
CGGTTGGGAA ATAGTGACGC AAGCATCCTC GACGGATTGA CCCCGCGCCG CGCACTGGAA
TTGTACTTGC GCCAGAAAAC ACCGCCGCTC TCGGAAGAAC GTATCGCCGC GCTCCTCGCC
GCTGCCGATG AACTGCTGGC AGAAGGCGCT CAGGATCGCG TGGAACTGTT GTCCTGA
 
Protein sequence
MRLLHLADLH IGIENYGRVD PSTGLHSRLR DYLERLDEAI DVGLAEGVDA VLIAGDVYKN 
RTPNPTQQRE FARRIHRLRA CGLPVFILIG NHDVSPAAGR AHAVEIFDTL AVDGVTIADR
PRIHTLQTRS GPLQVIALPW VTRHALLTKE ELRMASFLEI ETMLIERVER FLRQAADDLD
PALPAVLTVH GTIDGATFGA ERQVLLGRDL IYPRSLMALP NVDYVAMGHI HRHQALGDHP
PVVYPGSIER IDFGEEDEDK GCVIVDLKAK GEAHWRFHKL AARPFVTIAV DVRSINDPMQ
RVLAAIERRS LRGAVVRVKI DARPEQADAL QTEAIRRALD DAGAYVIAAV TVEVERSTRG
RLGNSDASIL DGLTPRRALE LYLRQKTPPL SEERIAALLA AADELLAEGA QDRVELLS