Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1899 |
Symbol | |
ID | 5208860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 2356599 |
End bp | 2359469 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640595508 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_001276238 |
Protein GI | 148656033 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases [COG2176] DNA polymerase III, alpha subunit (gram-positive type) |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family [TIGR01407] DnaQ family exonuclease/DinG family helicase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0589249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGA TCTACATCGC AATCGATGTC GAAACGACCG GGCTTGAGGC GGGGGTTGAT GAGATTATCG AGATCGCTGC GGTCAAGTTC AACGCAGATG AGGTGCTCGA AACGTTCAGC ACCCTGGTGC AACCGGTCCA TTCACTCCCC CTTAATTCCA GCCGGTTGAC AGGCATCACT GCAGATATGC TCGCCAGCGC CCCGCGCTTT GCCGAAGTTG CGCCACGTTT TGCCGCGTTC CTCAAGAATT ACCCGCTCGT CGGGCACAAT GTCGAGTTCG ACCTGCGTAT GCTGCGCGCT CAGGGGATGC GACTGCCCCA ACCAGCATTC GATACGTTTG AACTGGCAAC GTTGCTGATG CCGCGCACGC CAGCGTACCG CCTTAGCGCG CTCGCGGCAA CCCTGGGTAT TCATCACGAT GAAGCTCACC GCGCGCTGAG CGACGCAGAT GTAACCCGTC AGATCTTCCT CTACCTGCTG CGCCGGATTG AGGCGCTCCC CCTGGACGAT CTCAATGAGA TCATGCGCCT GACCAGCCAG ATTGAGTGGG GTCTGCGGAG CCTGTTCGAG GAAGCGCAAC GCGCAAAAGC GCGACGCGTC TTTGTGGACG CAATACCCAT CCAGGCAGAC CGTTACGACG ATCGTGATGA GAAGATCGTG CCGCTGAAAC CGACCGGTGA TGAGCGCCCG ATCGATCTGG CGGACATCCG TCGGTTCTTC AGTCCTGATG GCGCGCTCGG TCGCGCGTTT GCGGGGTATG AACAGCGCGA TCAGCAGGTG CGTATGTCCG AAGCGGTCGC CGATACGTTC AACCAGGGAG GGGCGCTGAT CGCCGAAGCC GGCACCGGCA CCGGCAAAGG GCTGGCGTAC CTGGTTCCTG CGGCGCTGCA CGCTGCGCGC CGTGGCGAAC GGGTGGTGAT TTCGACCAAT ACGATCAATC TGCAAGATCA ACTCTTCTTC AAAGACATCC CGGCGTTGCA ACAGGTGATG TCCAACGGCG CCGACGAAGC GCCGTTCACT GCGGCGCTGC TCAAAGGGCG CAGCAACTAT CTCTGTCTCA AGCGCTATAA AGACCTGCGC CGCGATCAGC GGTTGATGTC GGACGATGTG CGCGCGCTGC TCAAGGTGCA GTTGTGGCTG CCGACGACCG AAAGCGGGGA TCGGGCGGAA CTGCCGCTTC ACGAGGGCGA GAATGCGTCC TGGAACAGGA TGAGCGCCGC CTGGGAACAG TGCACCGGTC CACGGTGCAG CGAGTTTCAG CGCTGCTTCT TCTTCAAGGC GCGCAAACAG GCGGAAGCGG CGCACCTGGT GATCGTCAAC CACGCGCTGC TCATGGCGGA TCTGGCGGTC GAAAACGATG TCATCCCACC CTACGACTAC CTCATCATCG ATGAGGCGCA CAATCTGGAA GACGTGGCTA CCGACCAGTT GAGTTTCAAT GTTGATCGTG AAGGATTGCT TGCATTTCTC GACGATATTT TCACCGAAGG GCAGGCGCAG GTGGTCGGCG GTCTGCTGAG CGAACTGCCC AACCACTTCC GCGAAAGTAT GGCAACCCGC ACCGACATCG ACCGCGCCGA TGCGATTGCC GATACGTTGC GCCCGGCAGT GGCGCGCGCG CGCGATGCGG TGTATGGGTG CTTCAATGTG CTGATGACAT TCATCAAGCG CGAAGCCGAA CTGACCGCCT ATGACTCGCG GTTGCGCATC ACCGATTCGC TGCGACGCAA ACCGTCGTGG GCAGAGGTCG AACGCGCCTG GGACGCGCTC AGCGTGGCGC TCCACGCTGT CGGCGAGGGG CTGGGGCGTC TGGAGACGTT GTTGATCGAC CTTAAGGATG CAGAATTACT GGAGTACGAC GCCCTGATGC TACGGGTGCA GGCGCTCAGG CGCTACGCCA CCGACGTGCG CGTCAATATC GGGCATATCC TGACCGGCGG CGCGGAAGAA AAGGTCACCT GGCTGACTCA CGATCGTATG CGCGATACGC TGACTCTTTC CGCAGCGCCG TTGTCGGTCG CTGACGTGCT GCGCACCAAC CTGTTCGAAG CGAAGACTGC GACCATCCTG ACCTCGGCGA CGCTGTCGGT CGGCGGCAAC TTTGCCTTCA TCCGTGAACG CATTGGTCTT GATCACGCCG AAGAGTTGAC GCTGGACTCG CCGTTCGACT ATACCCGGCA GGCGTTGCTC TACATCCCGC AGGATATTCC CGAACCAAAC CAGAATGGCT ACCAGCGGGC GCTGGAGCAG GCGATCATCG ACCTGGCGCG CGCGACCGGC GGTCGAATGC TGGTGCTGTT TACCGCTACA AATGCGCTAC GCCAGACGTA CCGCGCCATC CAGGAGCCGC TGGAAGATGC AGGGATCGCG GTGCTCGGTC AGGGGATCGA CGGTTCGCGC CGCAGTCTGC TCGAACGTTT CAAGGAGTTT CCCGGTACCG TGCTGCTCGG CACGTCGAGT TTCTGGGAAG GGGTTGATGT CGTCGGCGAT GCGCTCTCGG TGCTGGTGAT CGCCAAACTC CCTTTCAGCG TGCCCACCGA CCCGATCTTT GCAGCGCGGT CGGAACAGTT CGACGATCCC TTCAACCAGT ATGCAGTGCC ACAGTCGATC CTGCGCTTCA AGCAGGGGTT CGGGCGCCTG ATCCGTTCCA GAGAGGATCG CGGCGTGGTG GCAGTCCTCG ACCGTCGCCT GCTGACGAAG AAGTATGGAC AGATGTTCCT TGAGTCGCTG CCGCATACCA CCGTGCGCAG CGGACCGTTG CAACGCCTGC CCGATCTTGC GAAACGCTTC CTGGCGGTCG GCAACGGTGC GATCAACGGC GCTCCCGGCG CAACGGCAAC CGCTTCTGAA ATGAAACGCA CGCAGCGATA A
|
Protein sequence | MNQIYIAIDV ETTGLEAGVD EIIEIAAVKF NADEVLETFS TLVQPVHSLP LNSSRLTGIT ADMLASAPRF AEVAPRFAAF LKNYPLVGHN VEFDLRMLRA QGMRLPQPAF DTFELATLLM PRTPAYRLSA LAATLGIHHD EAHRALSDAD VTRQIFLYLL RRIEALPLDD LNEIMRLTSQ IEWGLRSLFE EAQRAKARRV FVDAIPIQAD RYDDRDEKIV PLKPTGDERP IDLADIRRFF SPDGALGRAF AGYEQRDQQV RMSEAVADTF NQGGALIAEA GTGTGKGLAY LVPAALHAAR RGERVVISTN TINLQDQLFF KDIPALQQVM SNGADEAPFT AALLKGRSNY LCLKRYKDLR RDQRLMSDDV RALLKVQLWL PTTESGDRAE LPLHEGENAS WNRMSAAWEQ CTGPRCSEFQ RCFFFKARKQ AEAAHLVIVN HALLMADLAV ENDVIPPYDY LIIDEAHNLE DVATDQLSFN VDREGLLAFL DDIFTEGQAQ VVGGLLSELP NHFRESMATR TDIDRADAIA DTLRPAVARA RDAVYGCFNV LMTFIKREAE LTAYDSRLRI TDSLRRKPSW AEVERAWDAL SVALHAVGEG LGRLETLLID LKDAELLEYD ALMLRVQALR RYATDVRVNI GHILTGGAEE KVTWLTHDRM RDTLTLSAAP LSVADVLRTN LFEAKTATIL TSATLSVGGN FAFIRERIGL DHAEELTLDS PFDYTRQALL YIPQDIPEPN QNGYQRALEQ AIIDLARATG GRMLVLFTAT NALRQTYRAI QEPLEDAGIA VLGQGIDGSR RSLLERFKEF PGTVLLGTSS FWEGVDVVGD ALSVLVIAKL PFSVPTDPIF AARSEQFDDP FNQYAVPQSI LRFKQGFGRL IRSREDRGVV AVLDRRLLTK KYGQMFLESL PHTTVRSGPL QRLPDLAKRF LAVGNGAING APGATATASE MKRTQR
|
| |