Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3998 |
Symbol | |
ID | 5714527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009957 |
Strand | + |
Start bp | 60274 |
End bp | 63624 |
Gene Length | 3351 bp |
Protein Length | 1116 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641276910 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_001542206 |
Protein GI | 159046536 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGTT ATGCGGAGCT CTGCGTGACG AGCAATTTCA CCTTCCTGAC CGGCGCCTCG CATCCTGAGG AGCTGGTCAC GCGGGCTGCC GAACTGGGTC TGAAGGCCAT CGCCATCACC GACCGCAACT CGGTGGCAGG CGTGGTGCGG GCCTTCTCGG CCCTGAAAGA GCTGGCCCGA CTGCGCGAGG AGGCCCGTGC AGCGAGCGAC GGGGCCGAGG CCGGACCAGT GGTCCGATCC CGGCAAGTGA CCGATCATTC CAGCCGCCAG ACGATGCAGC ACATGCCGGC AGGCGATGCG CCCCGAATTC CGCCGGACAT GGTGTTGCCG AAGCTGATCC CTGGCGCCCG GATCGTGCTG ACCGACAGCC CGGTCGACTG GCTCGCCCTA CCCGCCGACA TCGCGGCCTG GTCGCGGCTG ACCCGGCTCC TGTCCCTCGG CAAGCGGCGG GCCACGAAAG GCGAATGCCA TCTGACGCGC AAGGATCTGT TGGATTGGGG CGAGGGCATG GTGCTGATCG CGCTGCCGCC TGACCCAATG GAGCACCCGG CCCGGGCATC TTTAAGCGAC CTGCGGCATA TGCTGCGCAT CTTCCCCGGC CAGTGCTTTC TTGGCGCGGC CCCCCGCTAT GACGGGCGAG ACCCAACCCG CCTCGACCAG CTGGCCCGGA TCGCCCAGGA CACCGGCCTG CCGCTCGTGG CCCTCGGCGA GGTGATGATG CATCGCTCGT CCAGGCGTCC GCTCGCCGAT GTCCTGACCT GCCTGCGCGA GGGCTGCACG ATCGACACGA TCGGGGAGCG CCGGCTGACC AATGGCGAGC ACCGGCTCAA ATCCCCGGCC GAGATGGCGC GGATGTTCCA TCGCTATCCG GCGGCGATCC GGCGGACACT GGAGATCGCC GACCGCTGCG CTTTCCGGCT GGACGATCTG CGCTATCAGT ATCCCGACGA GGCGCGAGAC GGCGAACCCG CGCAGGCGCG GCTCGAACGA TTGTCGCGGG AGGGGCTGCA CTGGCGCTAT CCCGAGGGGC CGCCGGCCCG GATCGTCACG CGGGTCGACA AGGAACTGAA GCTCATCGGC GAGATGGGCT ATGCACCGTA TTTCCTGACG GTCCATGACA TCGTGGCTTT CGCGCGCTCG AAGGGCATTC TCTGCCAGGG CCGCGGGTCG GCCGCCAACT CGGTCGTCTG CTACCTGCTC GGCGTCACCG AAGTGCCCCC CGAGAGCATC ACCCTGATCT TCGAACGCTT CATCTCGAAG GAACGCGGTG AGCCGCCTGA TATCGACGTG GATTTCGAGC ACGAGCGGCG CGAGGAAGTG ATCCAATGGA TCTACGAGCA ATATGGCCGG CATCGCGCCG GGCTGACTGC GACCGTGATC CATTTCCGTT CACGCGCCGC GATCCGCGAG GTCGGCAAGG TGATGGGACT GAGCCAGGAC GTAGTCGCCC GGCTGTCGGG GCAGATCTGG GGCTGGTCGT CCAGCGCACC AGGAGAAGAT CGGATGCGCG ACGCCGGAAT CGATCCGGCA GACGGGCGCG TGGCGCTGGC GGCGAAACTG ATCGGTGAGA TCATCGGGTT TCCCCGGCAT CTGAGCCAAC ATGTCGGCGG CTTTGTCATC ACCCATGGCC GACTCGACGA GCTCTGCCCC ATCGAGAACG CGGCAATGGA AGATCGCACG GTCATCGAAT GGGACAAGGA CGATATCGAC GCGCTTGGGC TCCTGAAGGT CGATGTGTTG GCGCTCGGCA TGTTGACTTG CATCCGGAAG GCCTTCGGGC TGCTGGACGA TCACCGTCAC CTGCAGCTGA CCCTCGCCAA TGTGCCGCCT GAGGACCCGG TGGTCTATGA CATGCTCTGC AAGGCGGATG CGGTCGGGGT CTTCCAGGTC GAGAGCCGGG CACAGCTGAA CTTCTTGCCG CGCATGCAGC CGCGAAAATT CTACGATCTG GTCTGCGAGG TCGCGATCGT CCGCCCCGGT CCGATCCAGG GCGGCATGGT TCATCCCTTC ATCAACCGCC GTCAGGGCAA GGAGAAGGTC GAAGACCTCG GTCCTGCCAT GATGGAAGTA CTAGGCCGGA CCTATGGTGT CCCCCTCTTC CAGGAGCAGG CCATGCAGAT CGCGGTGGTC GCGGCCGGCT TTTCGGCGGC CGAGGCCGAC CGGCTGCGCC GGTCGCTCGC AACCTTCAAG CGGATGGGCA CGATCGGCGC GTTCCGCGAA CGGTTCATAT CGGGCATGTT GGCCCGCAGC TATGAGGCGG GGTTCGCAGA ACGTTGCTTC GCCCAGATCG AAGGCTTCGG CAGCTATGGC TTCCCCGAAA GCCATGCGGC GAGTTTCGCG CGGCTGGTCT ATATCTCGGC CTGGCTGAAG CGGCACCATC CGGCCGTGTT CACCTGCGCC CTGCTGAACA GCCAGCCGAT GGGCTTCTAC GCCCCGGCTC AGCTGGTTCG CGACGCGCGT GAGCATGGCG TCGAGATCCG GGCCATCTCG GTCAATCATT CGGCCTGGGA CTGCACCCTG GAGCCGCGCG GTGACGGGGC GCTGGCGCTG CGGCTCGGGT TTCGGCAGAT CAAGGGGATG CGGGAGGAGG ATGCGAACTG GATTGAGGCC GCCCGCGGCA ATGGCTACCC CGATGTTGAG GGCCTCTGGC GCCGGGCCGG GGTCAGGCCC GATGCACTGG AACGGTTGGC AGAAGGGGAT GCCTTCGCCT CGCTCGGTCT CAATCGGCGC GATGCGCTAT GGGCCGCCCG GGCATTGCGA GGGCCAAATC CCCTGCCCCT TTTCGGGGCC GATGGTGAAG GCGGGGCGGA ACCGGAGGTT GCGCTGCCGG CGATGACGCT GGGCCAGGAG GTGATTGAGG ACTACCTGGC ACTGCGGCTT TCTCTGCGCG CCCATCCTAT GGAACTGCTC CGACCGCGCC TGCCGGAAAG CCTGCCGCAT GAACGGCTGG ACCGGGCGAC CGGACGGGTC ACCGTAACGG GTCTCGTGAT CACGCGGCAA CGGCCGGGTA CCGCGTCTGG TGTCATCTTT CTGACGCTGG AGGATGAGAC CGGCGTGTCC AACGTGGTGG TCTGGAGCCG GGTTTACGAG GCCTTCCGCA AGGCGGTGAT CGCAGGGCGT CTGCTACGGG TCACCGGCAG GATTGAACGC GAGGGGCAGG TCGTTCATGT GATTGCCGAG CGGATCGAAG ACATCTCGCC GATGCTCTCA AGCCTCGGGC GGCCGGCCAC CGACGCAGGT GATCGTCAGG CTGAGGCAGC GCACCAACCC GCAAGTAGTG GCGGCGGATC AACCGCCCGG CATCCGCGTG AGCAGGCCAA GAAGCTGTTT CCGAGCCGGG ATTTTCACTA G
|
Protein sequence | MTGYAELCVT SNFTFLTGAS HPEELVTRAA ELGLKAIAIT DRNSVAGVVR AFSALKELAR LREEARAASD GAEAGPVVRS RQVTDHSSRQ TMQHMPAGDA PRIPPDMVLP KLIPGARIVL TDSPVDWLAL PADIAAWSRL TRLLSLGKRR ATKGECHLTR KDLLDWGEGM VLIALPPDPM EHPARASLSD LRHMLRIFPG QCFLGAAPRY DGRDPTRLDQ LARIAQDTGL PLVALGEVMM HRSSRRPLAD VLTCLREGCT IDTIGERRLT NGEHRLKSPA EMARMFHRYP AAIRRTLEIA DRCAFRLDDL RYQYPDEARD GEPAQARLER LSREGLHWRY PEGPPARIVT RVDKELKLIG EMGYAPYFLT VHDIVAFARS KGILCQGRGS AANSVVCYLL GVTEVPPESI TLIFERFISK ERGEPPDIDV DFEHERREEV IQWIYEQYGR HRAGLTATVI HFRSRAAIRE VGKVMGLSQD VVARLSGQIW GWSSSAPGED RMRDAGIDPA DGRVALAAKL IGEIIGFPRH LSQHVGGFVI THGRLDELCP IENAAMEDRT VIEWDKDDID ALGLLKVDVL ALGMLTCIRK AFGLLDDHRH LQLTLANVPP EDPVVYDMLC KADAVGVFQV ESRAQLNFLP RMQPRKFYDL VCEVAIVRPG PIQGGMVHPF INRRQGKEKV EDLGPAMMEV LGRTYGVPLF QEQAMQIAVV AAGFSAAEAD RLRRSLATFK RMGTIGAFRE RFISGMLARS YEAGFAERCF AQIEGFGSYG FPESHAASFA RLVYISAWLK RHHPAVFTCA LLNSQPMGFY APAQLVRDAR EHGVEIRAIS VNHSAWDCTL EPRGDGALAL RLGFRQIKGM REEDANWIEA ARGNGYPDVE GLWRRAGVRP DALERLAEGD AFASLGLNRR DALWAARALR GPNPLPLFGA DGEGGAEPEV ALPAMTLGQE VIEDYLALRL SLRAHPMELL RPRLPESLPH ERLDRATGRV TVTGLVITRQ RPGTASGVIF LTLEDETGVS NVVVWSRVYE AFRKAVIAGR LLRVTGRIER EGQVVHVIAE RIEDISPMLS SLGRPATDAG DRQAEAAHQP ASSGGGSTAR HPREQAKKLF PSRDFH
|
| |