Gene Rcas_1545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1545 
Symbol 
ID5539021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1972431 
End bp1975265 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content61% 
IMG OID640893683 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_001431656 
Protein GI156741527 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases
[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01407] DnaQ family exonuclease/DinG family helicase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.676592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000191845 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGATCAAA TCTATGTAGC AATCGATGTC GAAACCACCG GTCTCGAGGC GGGAGTCGAT 
GAAATTATCG AAATTGCAGC GGTCAAGTTT CGCACCGGTG AGGTGATCGA AACGTTCGAC
ACCCTCGTGC AACCGCGTCA TTCTCTGCCC CTTAATTCCA GCCGTCTGAC GGGCATCACC
GCTGAGATGC TTGCCGGTGC GCCGCGCTTT TCGGAGGTCG CGCCGCGCTT CGCTGCGTTC
CTCAAGAACT ATCCGCTCGT CGGGCACAAT GTCCGCTTCG ATATCAATAT GCTTCAGGCG
CAGGGTATGC GCCTGCCGCA ACCGGCGTTC GACACCTTTG AACTGGCGAC GCTCCTGATG
CCGCGCACAC CCGCCTATCG CCTCAGCGCA CTGGCGGAAA CGCTTGGCAT CGTTCACGAT
GAGGCGCATC GCGCCCTTAG CGACGCCGAT GTGACCCGGC AGGTCTTTCT GCATCTGCTC
CGGCGTATCG ATGCTCTCAG CCTGAACGAC CTGAATGAGA TTGTGCGCCT GACATCGCGT
GTCGATTGGA CGCTGCGACC GCTCTTCGAA GCAGCGCAGC GCGCCAAGGC TCTGCGCGTC
TTTGTGGACG AAACGCCGAT CAGCGACACT GATTCGCGGG AGTCAGACGA GAAACTGACA
CCGCTCAAAC CAACCGGGAA TGACCGCCCA ATCGACCTGG CGGAAATCCG GTGGTTCTTC
AGCCCTGCCG GTGCGCTTGG GCGCGCTTTC GAGGGGTATG AGCAGCGCAA TCAGCAGGTG
CGGATGTCCG AAGCGGTCGC CGACACCCTT AATCAGGGCG GGACGCTGAT CGCCGAAGCC
GGCACCGGCA CCGGCAAAGG TCTGGCGTAT CTGGTTCCGG CGGCGCTACA CGCCGCGCGC
CGCGGCGAGC GCGTCGTCAT TTCGACCAAT ACGATCAATT TGCAGGATCA ACTCTTCTTC
AAGGATATTC CGGCGCTTCA GAGGGTGATG TCCAACGGCG TGGACGACAA ACCGCCGTTT
ACTGCGGCGT TGCTCAAGGG GCGCAGCAAT TATTTGTGCC TCAAACGGTA CCACGATCTG
CGCCGTGATC GCGATCAGCG GCTGATGTCG GACGATGTGC GCGCGCTGCT CAAGGTGCAG
TTGTGGCTGC ATGCGACCGA GAGCGGCGAC CGCGCGGAAT TGCCGCTTCA GGAAGGTGAA
CATGCGACAT GGAGCAAATT GAGCGCCGCC TGGGATCAGT GCACCGGTCC GCGGTGCAGT
GAGTTCCATC GTTGCTTCTT CTTCAAGGCG CGCCGGCAGG CGGAGGCCGC GCACCTGGTG
ATTGTCAATC ACGCGCTTCT CGTGGCAGAC CTCGCAGCCG AAAATGATGT CATTCCGCCC
TATGATTATC TCATTATCGA CGAGGCACAT AATCTGGAAG ATGTCGCCAC CGATCAGTTG
AGTTTCAATG TTGATCGGGA AGGGCTGCTT GCGTTCCTCG ATGATATTTT TGTCGAAGAC
CAGGCGCAGA TCGTCGGCGG GTTGCTGAGC GAACTGCCGA ACCATTTCCG CGAAAGCATG
GTTACCCGGA TCGATATTGA CCGCGCCGAC ACGATCACGG CGGCGCTGCG TCCGGCGGTG
GCGCGCGCGC GCGATGCGGT CTACGGGTGC TTCAACACGT TGATCGCGTT CGTCCGACGC
GATGCCGAAC TGTCGGCTGC CGATGCACGC CTGCGCATCT CCAGCGCGCT GCGCCGCAAA
CCGGCTTGGG CAGAGGTCGA ACGCGCCTGG GACATGCTCA ACAACGCGCT TACCGCCATC
GGTGAGGGAT TGGGACAACT GGAAACGCTC CTGATCGACC TGAAGGACGC CGAATTGCCG
GAGTATGATG CGCTGATGCT GCGGGTGCAG ACGCTCAAGC GGTATGCGAC CGAGGTGCGC
ATTAATATCG GGCATATTCT GACCGGCGGC GCTGAGGAAA AAGTCACCTG GCTGACCCAC
GACCGTCTGC GTGACACGTT GACCCTTTCC GCTGCACCCC TCTCCGTTGC CGAGATTTTG
CGCACCAACC TGTTCGAGCG CAAAAGCGCT ACAGTACTGA CCTCGGCGAC GCTGTCGGTC
GGCGGCGATT TCCGCTTCGT CCGCGAGCGC ATCGGCCTGG ATGAAGCCGA AGAACTGGCG
CTCGAATCGC CGTTCGATTA CACCCGTCAG GCGCTCCTCT ATATTCCGAA CGATATTCCT
GAGCCGTCAC ATCCGGGGTA TCAGCGCGCA ATGGAGCAGG CGATCATCGA CCTGGCGCGT
GCGACGAACG GGCGCATGCT GGTGTTGTTT ACTGCCATCA ATGCGCTGCG GCAGACGTAT
CGCGCCATTC AGGAACCGCT GGAAGACGCC GGGATTGCCG TGCTCGGTCA GGGGATCGAC
GGCTCGCGCC GCAGTCTGCT CGAACGCTTC AAGGAGTTTC CCGGCACCGT GCTGCTCGGC
ACATCAAGTT TCTGGGAAGG GGTCGATGTG GTCGGCGATG CGCTCTCGGT GCTGGTGATC
GCCAAACTCC CCTTCAGCGT GCCGACCGAC CCGATCTTCG CCGCGCGGTC GGAGCAGTTC
GACGATGCGT TCAATCAGTA CGCCGTTCCA CAGTCGATCC TGCGCTTCAA GCAGGGGTTC
GGGCGCCTGA TCCGCTCAAA GGACGACCGC GGTATCGTGG CAGTGCTCGA CCGCCGCCTC
CTGACGAAAA AATATGGGCA GACGTTTCTC GACTCATTGC CGCACACCAC CGTGCGCAGT
GGTCCGTTGC AGCGCCTTCC CGACCTGGCA AAGCGTTTCC TGGCTGCGAC GAATGGTATG
TCGGGGACGG CGTAG
 
Protein sequence
MDQIYVAIDV ETTGLEAGVD EIIEIAAVKF RTGEVIETFD TLVQPRHSLP LNSSRLTGIT 
AEMLAGAPRF SEVAPRFAAF LKNYPLVGHN VRFDINMLQA QGMRLPQPAF DTFELATLLM
PRTPAYRLSA LAETLGIVHD EAHRALSDAD VTRQVFLHLL RRIDALSLND LNEIVRLTSR
VDWTLRPLFE AAQRAKALRV FVDETPISDT DSRESDEKLT PLKPTGNDRP IDLAEIRWFF
SPAGALGRAF EGYEQRNQQV RMSEAVADTL NQGGTLIAEA GTGTGKGLAY LVPAALHAAR
RGERVVISTN TINLQDQLFF KDIPALQRVM SNGVDDKPPF TAALLKGRSN YLCLKRYHDL
RRDRDQRLMS DDVRALLKVQ LWLHATESGD RAELPLQEGE HATWSKLSAA WDQCTGPRCS
EFHRCFFFKA RRQAEAAHLV IVNHALLVAD LAAENDVIPP YDYLIIDEAH NLEDVATDQL
SFNVDREGLL AFLDDIFVED QAQIVGGLLS ELPNHFRESM VTRIDIDRAD TITAALRPAV
ARARDAVYGC FNTLIAFVRR DAELSAADAR LRISSALRRK PAWAEVERAW DMLNNALTAI
GEGLGQLETL LIDLKDAELP EYDALMLRVQ TLKRYATEVR INIGHILTGG AEEKVTWLTH
DRLRDTLTLS AAPLSVAEIL RTNLFERKSA TVLTSATLSV GGDFRFVRER IGLDEAEELA
LESPFDYTRQ ALLYIPNDIP EPSHPGYQRA MEQAIIDLAR ATNGRMLVLF TAINALRQTY
RAIQEPLEDA GIAVLGQGID GSRRSLLERF KEFPGTVLLG TSSFWEGVDV VGDALSVLVI
AKLPFSVPTD PIFAARSEQF DDAFNQYAVP QSILRFKQGF GRLIRSKDDR GIVAVLDRRL
LTKKYGQTFL DSLPHTTVRS GPLQRLPDLA KRFLAATNGM SGTA