Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1545 |
Symbol | |
ID | 5539021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 1972431 |
End bp | 1975265 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640893683 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_001431656 |
Protein GI | 156741527 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases [COG2176] DNA polymerase III, alpha subunit (gram-positive type) |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family [TIGR01407] DnaQ family exonuclease/DinG family helicase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.676592 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000191845 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGATCAAA TCTATGTAGC AATCGATGTC GAAACCACCG GTCTCGAGGC GGGAGTCGAT GAAATTATCG AAATTGCAGC GGTCAAGTTT CGCACCGGTG AGGTGATCGA AACGTTCGAC ACCCTCGTGC AACCGCGTCA TTCTCTGCCC CTTAATTCCA GCCGTCTGAC GGGCATCACC GCTGAGATGC TTGCCGGTGC GCCGCGCTTT TCGGAGGTCG CGCCGCGCTT CGCTGCGTTC CTCAAGAACT ATCCGCTCGT CGGGCACAAT GTCCGCTTCG ATATCAATAT GCTTCAGGCG CAGGGTATGC GCCTGCCGCA ACCGGCGTTC GACACCTTTG AACTGGCGAC GCTCCTGATG CCGCGCACAC CCGCCTATCG CCTCAGCGCA CTGGCGGAAA CGCTTGGCAT CGTTCACGAT GAGGCGCATC GCGCCCTTAG CGACGCCGAT GTGACCCGGC AGGTCTTTCT GCATCTGCTC CGGCGTATCG ATGCTCTCAG CCTGAACGAC CTGAATGAGA TTGTGCGCCT GACATCGCGT GTCGATTGGA CGCTGCGACC GCTCTTCGAA GCAGCGCAGC GCGCCAAGGC TCTGCGCGTC TTTGTGGACG AAACGCCGAT CAGCGACACT GATTCGCGGG AGTCAGACGA GAAACTGACA CCGCTCAAAC CAACCGGGAA TGACCGCCCA ATCGACCTGG CGGAAATCCG GTGGTTCTTC AGCCCTGCCG GTGCGCTTGG GCGCGCTTTC GAGGGGTATG AGCAGCGCAA TCAGCAGGTG CGGATGTCCG AAGCGGTCGC CGACACCCTT AATCAGGGCG GGACGCTGAT CGCCGAAGCC GGCACCGGCA CCGGCAAAGG TCTGGCGTAT CTGGTTCCGG CGGCGCTACA CGCCGCGCGC CGCGGCGAGC GCGTCGTCAT TTCGACCAAT ACGATCAATT TGCAGGATCA ACTCTTCTTC AAGGATATTC CGGCGCTTCA GAGGGTGATG TCCAACGGCG TGGACGACAA ACCGCCGTTT ACTGCGGCGT TGCTCAAGGG GCGCAGCAAT TATTTGTGCC TCAAACGGTA CCACGATCTG CGCCGTGATC GCGATCAGCG GCTGATGTCG GACGATGTGC GCGCGCTGCT CAAGGTGCAG TTGTGGCTGC ATGCGACCGA GAGCGGCGAC CGCGCGGAAT TGCCGCTTCA GGAAGGTGAA CATGCGACAT GGAGCAAATT GAGCGCCGCC TGGGATCAGT GCACCGGTCC GCGGTGCAGT GAGTTCCATC GTTGCTTCTT CTTCAAGGCG CGCCGGCAGG CGGAGGCCGC GCACCTGGTG ATTGTCAATC ACGCGCTTCT CGTGGCAGAC CTCGCAGCCG AAAATGATGT CATTCCGCCC TATGATTATC TCATTATCGA CGAGGCACAT AATCTGGAAG ATGTCGCCAC CGATCAGTTG AGTTTCAATG TTGATCGGGA AGGGCTGCTT GCGTTCCTCG ATGATATTTT TGTCGAAGAC CAGGCGCAGA TCGTCGGCGG GTTGCTGAGC GAACTGCCGA ACCATTTCCG CGAAAGCATG GTTACCCGGA TCGATATTGA CCGCGCCGAC ACGATCACGG CGGCGCTGCG TCCGGCGGTG GCGCGCGCGC GCGATGCGGT CTACGGGTGC TTCAACACGT TGATCGCGTT CGTCCGACGC GATGCCGAAC TGTCGGCTGC CGATGCACGC CTGCGCATCT CCAGCGCGCT GCGCCGCAAA CCGGCTTGGG CAGAGGTCGA ACGCGCCTGG GACATGCTCA ACAACGCGCT TACCGCCATC GGTGAGGGAT TGGGACAACT GGAAACGCTC CTGATCGACC TGAAGGACGC CGAATTGCCG GAGTATGATG CGCTGATGCT GCGGGTGCAG ACGCTCAAGC GGTATGCGAC CGAGGTGCGC ATTAATATCG GGCATATTCT GACCGGCGGC GCTGAGGAAA AAGTCACCTG GCTGACCCAC GACCGTCTGC GTGACACGTT GACCCTTTCC GCTGCACCCC TCTCCGTTGC CGAGATTTTG CGCACCAACC TGTTCGAGCG CAAAAGCGCT ACAGTACTGA CCTCGGCGAC GCTGTCGGTC GGCGGCGATT TCCGCTTCGT CCGCGAGCGC ATCGGCCTGG ATGAAGCCGA AGAACTGGCG CTCGAATCGC CGTTCGATTA CACCCGTCAG GCGCTCCTCT ATATTCCGAA CGATATTCCT GAGCCGTCAC ATCCGGGGTA TCAGCGCGCA ATGGAGCAGG CGATCATCGA CCTGGCGCGT GCGACGAACG GGCGCATGCT GGTGTTGTTT ACTGCCATCA ATGCGCTGCG GCAGACGTAT CGCGCCATTC AGGAACCGCT GGAAGACGCC GGGATTGCCG TGCTCGGTCA GGGGATCGAC GGCTCGCGCC GCAGTCTGCT CGAACGCTTC AAGGAGTTTC CCGGCACCGT GCTGCTCGGC ACATCAAGTT TCTGGGAAGG GGTCGATGTG GTCGGCGATG CGCTCTCGGT GCTGGTGATC GCCAAACTCC CCTTCAGCGT GCCGACCGAC CCGATCTTCG CCGCGCGGTC GGAGCAGTTC GACGATGCGT TCAATCAGTA CGCCGTTCCA CAGTCGATCC TGCGCTTCAA GCAGGGGTTC GGGCGCCTGA TCCGCTCAAA GGACGACCGC GGTATCGTGG CAGTGCTCGA CCGCCGCCTC CTGACGAAAA AATATGGGCA GACGTTTCTC GACTCATTGC CGCACACCAC CGTGCGCAGT GGTCCGTTGC AGCGCCTTCC CGACCTGGCA AAGCGTTTCC TGGCTGCGAC GAATGGTATG TCGGGGACGG CGTAG
|
Protein sequence | MDQIYVAIDV ETTGLEAGVD EIIEIAAVKF RTGEVIETFD TLVQPRHSLP LNSSRLTGIT AEMLAGAPRF SEVAPRFAAF LKNYPLVGHN VRFDINMLQA QGMRLPQPAF DTFELATLLM PRTPAYRLSA LAETLGIVHD EAHRALSDAD VTRQVFLHLL RRIDALSLND LNEIVRLTSR VDWTLRPLFE AAQRAKALRV FVDETPISDT DSRESDEKLT PLKPTGNDRP IDLAEIRWFF SPAGALGRAF EGYEQRNQQV RMSEAVADTL NQGGTLIAEA GTGTGKGLAY LVPAALHAAR RGERVVISTN TINLQDQLFF KDIPALQRVM SNGVDDKPPF TAALLKGRSN YLCLKRYHDL RRDRDQRLMS DDVRALLKVQ LWLHATESGD RAELPLQEGE HATWSKLSAA WDQCTGPRCS EFHRCFFFKA RRQAEAAHLV IVNHALLVAD LAAENDVIPP YDYLIIDEAH NLEDVATDQL SFNVDREGLL AFLDDIFVED QAQIVGGLLS ELPNHFRESM VTRIDIDRAD TITAALRPAV ARARDAVYGC FNTLIAFVRR DAELSAADAR LRISSALRRK PAWAEVERAW DMLNNALTAI GEGLGQLETL LIDLKDAELP EYDALMLRVQ TLKRYATEVR INIGHILTGG AEEKVTWLTH DRLRDTLTLS AAPLSVAEIL RTNLFERKSA TVLTSATLSV GGDFRFVRER IGLDEAEELA LESPFDYTRQ ALLYIPNDIP EPSHPGYQRA MEQAIIDLAR ATNGRMLVLF TAINALRQTY RAIQEPLEDA GIAVLGQGID GSRRSLLERF KEFPGTVLLG TSSFWEGVDV VGDALSVLVI AKLPFSVPTD PIFAARSEQF DDAFNQYAVP QSILRFKQGF GRLIRSKDDR GIVAVLDRRL LTKKYGQTFL DSLPHTTVRS GPLQRLPDLA KRFLAATNGM SGTA
|
| |