Gene Dshi_3998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3998 
Symbol 
ID5714527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009957 
Strand
Start bp60274 
End bp63624 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content66% 
IMG OID641276910 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_001542206 
Protein GI159046536 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGTT ATGCGGAGCT CTGCGTGACG AGCAATTTCA CCTTCCTGAC CGGCGCCTCG 
CATCCTGAGG AGCTGGTCAC GCGGGCTGCC GAACTGGGTC TGAAGGCCAT CGCCATCACC
GACCGCAACT CGGTGGCAGG CGTGGTGCGG GCCTTCTCGG CCCTGAAAGA GCTGGCCCGA
CTGCGCGAGG AGGCCCGTGC AGCGAGCGAC GGGGCCGAGG CCGGACCAGT GGTCCGATCC
CGGCAAGTGA CCGATCATTC CAGCCGCCAG ACGATGCAGC ACATGCCGGC AGGCGATGCG
CCCCGAATTC CGCCGGACAT GGTGTTGCCG AAGCTGATCC CTGGCGCCCG GATCGTGCTG
ACCGACAGCC CGGTCGACTG GCTCGCCCTA CCCGCCGACA TCGCGGCCTG GTCGCGGCTG
ACCCGGCTCC TGTCCCTCGG CAAGCGGCGG GCCACGAAAG GCGAATGCCA TCTGACGCGC
AAGGATCTGT TGGATTGGGG CGAGGGCATG GTGCTGATCG CGCTGCCGCC TGACCCAATG
GAGCACCCGG CCCGGGCATC TTTAAGCGAC CTGCGGCATA TGCTGCGCAT CTTCCCCGGC
CAGTGCTTTC TTGGCGCGGC CCCCCGCTAT GACGGGCGAG ACCCAACCCG CCTCGACCAG
CTGGCCCGGA TCGCCCAGGA CACCGGCCTG CCGCTCGTGG CCCTCGGCGA GGTGATGATG
CATCGCTCGT CCAGGCGTCC GCTCGCCGAT GTCCTGACCT GCCTGCGCGA GGGCTGCACG
ATCGACACGA TCGGGGAGCG CCGGCTGACC AATGGCGAGC ACCGGCTCAA ATCCCCGGCC
GAGATGGCGC GGATGTTCCA TCGCTATCCG GCGGCGATCC GGCGGACACT GGAGATCGCC
GACCGCTGCG CTTTCCGGCT GGACGATCTG CGCTATCAGT ATCCCGACGA GGCGCGAGAC
GGCGAACCCG CGCAGGCGCG GCTCGAACGA TTGTCGCGGG AGGGGCTGCA CTGGCGCTAT
CCCGAGGGGC CGCCGGCCCG GATCGTCACG CGGGTCGACA AGGAACTGAA GCTCATCGGC
GAGATGGGCT ATGCACCGTA TTTCCTGACG GTCCATGACA TCGTGGCTTT CGCGCGCTCG
AAGGGCATTC TCTGCCAGGG CCGCGGGTCG GCCGCCAACT CGGTCGTCTG CTACCTGCTC
GGCGTCACCG AAGTGCCCCC CGAGAGCATC ACCCTGATCT TCGAACGCTT CATCTCGAAG
GAACGCGGTG AGCCGCCTGA TATCGACGTG GATTTCGAGC ACGAGCGGCG CGAGGAAGTG
ATCCAATGGA TCTACGAGCA ATATGGCCGG CATCGCGCCG GGCTGACTGC GACCGTGATC
CATTTCCGTT CACGCGCCGC GATCCGCGAG GTCGGCAAGG TGATGGGACT GAGCCAGGAC
GTAGTCGCCC GGCTGTCGGG GCAGATCTGG GGCTGGTCGT CCAGCGCACC AGGAGAAGAT
CGGATGCGCG ACGCCGGAAT CGATCCGGCA GACGGGCGCG TGGCGCTGGC GGCGAAACTG
ATCGGTGAGA TCATCGGGTT TCCCCGGCAT CTGAGCCAAC ATGTCGGCGG CTTTGTCATC
ACCCATGGCC GACTCGACGA GCTCTGCCCC ATCGAGAACG CGGCAATGGA AGATCGCACG
GTCATCGAAT GGGACAAGGA CGATATCGAC GCGCTTGGGC TCCTGAAGGT CGATGTGTTG
GCGCTCGGCA TGTTGACTTG CATCCGGAAG GCCTTCGGGC TGCTGGACGA TCACCGTCAC
CTGCAGCTGA CCCTCGCCAA TGTGCCGCCT GAGGACCCGG TGGTCTATGA CATGCTCTGC
AAGGCGGATG CGGTCGGGGT CTTCCAGGTC GAGAGCCGGG CACAGCTGAA CTTCTTGCCG
CGCATGCAGC CGCGAAAATT CTACGATCTG GTCTGCGAGG TCGCGATCGT CCGCCCCGGT
CCGATCCAGG GCGGCATGGT TCATCCCTTC ATCAACCGCC GTCAGGGCAA GGAGAAGGTC
GAAGACCTCG GTCCTGCCAT GATGGAAGTA CTAGGCCGGA CCTATGGTGT CCCCCTCTTC
CAGGAGCAGG CCATGCAGAT CGCGGTGGTC GCGGCCGGCT TTTCGGCGGC CGAGGCCGAC
CGGCTGCGCC GGTCGCTCGC AACCTTCAAG CGGATGGGCA CGATCGGCGC GTTCCGCGAA
CGGTTCATAT CGGGCATGTT GGCCCGCAGC TATGAGGCGG GGTTCGCAGA ACGTTGCTTC
GCCCAGATCG AAGGCTTCGG CAGCTATGGC TTCCCCGAAA GCCATGCGGC GAGTTTCGCG
CGGCTGGTCT ATATCTCGGC CTGGCTGAAG CGGCACCATC CGGCCGTGTT CACCTGCGCC
CTGCTGAACA GCCAGCCGAT GGGCTTCTAC GCCCCGGCTC AGCTGGTTCG CGACGCGCGT
GAGCATGGCG TCGAGATCCG GGCCATCTCG GTCAATCATT CGGCCTGGGA CTGCACCCTG
GAGCCGCGCG GTGACGGGGC GCTGGCGCTG CGGCTCGGGT TTCGGCAGAT CAAGGGGATG
CGGGAGGAGG ATGCGAACTG GATTGAGGCC GCCCGCGGCA ATGGCTACCC CGATGTTGAG
GGCCTCTGGC GCCGGGCCGG GGTCAGGCCC GATGCACTGG AACGGTTGGC AGAAGGGGAT
GCCTTCGCCT CGCTCGGTCT CAATCGGCGC GATGCGCTAT GGGCCGCCCG GGCATTGCGA
GGGCCAAATC CCCTGCCCCT TTTCGGGGCC GATGGTGAAG GCGGGGCGGA ACCGGAGGTT
GCGCTGCCGG CGATGACGCT GGGCCAGGAG GTGATTGAGG ACTACCTGGC ACTGCGGCTT
TCTCTGCGCG CCCATCCTAT GGAACTGCTC CGACCGCGCC TGCCGGAAAG CCTGCCGCAT
GAACGGCTGG ACCGGGCGAC CGGACGGGTC ACCGTAACGG GTCTCGTGAT CACGCGGCAA
CGGCCGGGTA CCGCGTCTGG TGTCATCTTT CTGACGCTGG AGGATGAGAC CGGCGTGTCC
AACGTGGTGG TCTGGAGCCG GGTTTACGAG GCCTTCCGCA AGGCGGTGAT CGCAGGGCGT
CTGCTACGGG TCACCGGCAG GATTGAACGC GAGGGGCAGG TCGTTCATGT GATTGCCGAG
CGGATCGAAG ACATCTCGCC GATGCTCTCA AGCCTCGGGC GGCCGGCCAC CGACGCAGGT
GATCGTCAGG CTGAGGCAGC GCACCAACCC GCAAGTAGTG GCGGCGGATC AACCGCCCGG
CATCCGCGTG AGCAGGCCAA GAAGCTGTTT CCGAGCCGGG ATTTTCACTA G
 
Protein sequence
MTGYAELCVT SNFTFLTGAS HPEELVTRAA ELGLKAIAIT DRNSVAGVVR AFSALKELAR 
LREEARAASD GAEAGPVVRS RQVTDHSSRQ TMQHMPAGDA PRIPPDMVLP KLIPGARIVL
TDSPVDWLAL PADIAAWSRL TRLLSLGKRR ATKGECHLTR KDLLDWGEGM VLIALPPDPM
EHPARASLSD LRHMLRIFPG QCFLGAAPRY DGRDPTRLDQ LARIAQDTGL PLVALGEVMM
HRSSRRPLAD VLTCLREGCT IDTIGERRLT NGEHRLKSPA EMARMFHRYP AAIRRTLEIA
DRCAFRLDDL RYQYPDEARD GEPAQARLER LSREGLHWRY PEGPPARIVT RVDKELKLIG
EMGYAPYFLT VHDIVAFARS KGILCQGRGS AANSVVCYLL GVTEVPPESI TLIFERFISK
ERGEPPDIDV DFEHERREEV IQWIYEQYGR HRAGLTATVI HFRSRAAIRE VGKVMGLSQD
VVARLSGQIW GWSSSAPGED RMRDAGIDPA DGRVALAAKL IGEIIGFPRH LSQHVGGFVI
THGRLDELCP IENAAMEDRT VIEWDKDDID ALGLLKVDVL ALGMLTCIRK AFGLLDDHRH
LQLTLANVPP EDPVVYDMLC KADAVGVFQV ESRAQLNFLP RMQPRKFYDL VCEVAIVRPG
PIQGGMVHPF INRRQGKEKV EDLGPAMMEV LGRTYGVPLF QEQAMQIAVV AAGFSAAEAD
RLRRSLATFK RMGTIGAFRE RFISGMLARS YEAGFAERCF AQIEGFGSYG FPESHAASFA
RLVYISAWLK RHHPAVFTCA LLNSQPMGFY APAQLVRDAR EHGVEIRAIS VNHSAWDCTL
EPRGDGALAL RLGFRQIKGM REEDANWIEA ARGNGYPDVE GLWRRAGVRP DALERLAEGD
AFASLGLNRR DALWAARALR GPNPLPLFGA DGEGGAEPEV ALPAMTLGQE VIEDYLALRL
SLRAHPMELL RPRLPESLPH ERLDRATGRV TVTGLVITRQ RPGTASGVIF LTLEDETGVS
NVVVWSRVYE AFRKAVIAGR LLRVTGRIER EGQVVHVIAE RIEDISPMLS SLGRPATDAG
DRQAEAAHQP ASSGGGSTAR HPREQAKKLF PSRDFH