Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sden_3620 |
Symbol | |
ID | 4020177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella denitrificans OS217 |
Kingdom | Bacteria |
Replicon accession | NC_007954 |
Strand | + |
Start bp | 4339054 |
End bp | 4341819 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637957681 |
Product | DNA polymerase I |
Protein accession | YP_564618 |
Protein GI | 91794967 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00307404 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATAA TCCCAGAAAA CCCCCTCGTC CTTGTGGATG GTTCTTCCTA TCTTTATCGC GCTTACTACG CACCACCACA TCTAACCAAT TCTAAGGGTG AAGCCACGGG AGCGGTATAC GGTGTAGTCA ACATGCTTCG CAGCTTAATG AGCCGTTATC ATCCGAGTCA CATTGCCGTG GTATTTGATG CCAAAGGTAA AACATTTAGA AATGATATGT ACAGCGAGTA CAAGGCGCAG CGCCCACCCA TGCCTGATGA CTTACGCAGT CAAATCGCCC CTTTGCATAG AATAATCCAC GCATTAGGCC TACCGCTTAT CAGTATCGAA GGGGTTGAAG CCGATGACGT CATAGGCACC ATAGCCAAGC GTGCCAGCGC CGAAGGCCGT GCCACCTTGA TAAGCACAGG TGATAAAGAC ATGGCGCAGC TGGTTAATGA ACATGTCACC CTGATAAATA CCATGACAGA CACCATAATG GGGCCTGAAG AAGTGGCCAC TAAATTTGGT GTGGGCCCCG AGCTTATCAT AGATTTCTTA GCCCTGATGG GCGATAAATC CGATAACATT CCAGGTCTAC CGGGTGTTGG TGAAAAAACC GCACTGGCCA TGTTAAACGG AGTTGGCAGT GTAGAAAAAC TATTAGCCGA TCCCCAAAGC GTCCTTGAAG TGGGCTTTAG GGGCGCTAAA ACCATGCCAG CAAAAATTAT CGACAATGCT GACATGCTTA AACTCTCCTA CCAACTGGCC ACCATCAAAA CCGATGTGGA GTTGGATCCC GACTGGGATC TTCTTGCGGT GAAGCCGCAT AATAAAGATG AGCTTATCGC CTGTTATGGT GAAATGGAGT TTAAACGCTG GCTAGCTGAA GTATTAGATA ATAAAGCGCC AACCCAAGTT AAAACATCAT CCAATGCTGA GTCTGACGAA GTGATGGCTC CCGTTGAGGC CATTAGCGCC AAGTATCACA CCATCTTAAC CCTCGATGAC TTAGATACTT GGATTGCAAA GTTAAGCCAA GCCAAACTCA TTGCCATAGA CACTGAGACC ACCAGCTTAA ATTACATGGA TGCTAAGCTC GTGGGGATTT CATTCGCCAT CGAAGCAGGC GAAGCCGCAT ATTTGCCGTT AGCCCATGAT TATTTAGATG CTCCGAGCCA AATAGACATG GCAACCGCCC TTGAGAAGCT GCGGCCGCTA CTGGAAAGTG ACAACCCAGC TAAAGTAGGC CAAAACTTAA AGTACGATAT CAGCATTTTC GCTAATGTGG GCATCAAGCT TAAAGGCGTG CACTTCGATA CCATGCTGGA GTCCTATGTG TTTAATTCGG TCGCCTCCCG CCATGATATG GATGGGCTTG CCTTAAAGTA CTTAGGCCAT AAGAACATTA GCTTCGAGGA CGTTGCCGGT AAAGGCGCGA AACAGCTGAC CTTTAATCAA ATCGATTTAG ACACCGCCGC GCCCTATGCC GCAGAAGATG CCGATATTAC CTTACGTCTG CATCAACATT TATGGCCAAG GCTTGAGAAA GAGCCAGAAC TGGCCCAAGT GTTCACCGAA TTAGAATTAC CACTTATTCA AGTTTTATCT GATATCGAGC GCCAAGGGGT ACTTATAGAC CCTATGCTAT TAAGTCAGCA AAGTGATGAA CTAGCGCAGA AGATTGATAA GTTAGAACTT GAGGCCTATG AGATAGCCGG TGAGAAATTT AACCTAGGGT CGCCAAAACA GTTGCAAGTC TTATTCTTTG AAAAATTGGC ATACCCTATC ATCAAGAAGA CCCCCAAAGG TGCGCCATCG ACGGCTGAAG AGGTGTTAGT CGAACTCGCC TTAGACTACC CCTTGCCCAA GATTATCTTA GCTCATCGCA GCTTAGCCAA ACTTAAGAGT ACCTACACGG ACAAACTGCC GTTAATGGTC AATGGCACCA CGGGGCGAGT GCATACCAGC TACCATCAAG CGAATGCCGC GACGGGGCGC TTATCATCGA GCGATCCAAA CCTTCAAAAC ATTCCAATCC GCACTGAAGA AGGTCGCCGC ATCAGACAAG CCTTTATCGC GCCGGCTGGG CGTAAAATTT TAGCGGCGGA TTATTCACAA ATTGAATTAA GGATCATGGC ACATTTATCC CAAGATAAAG GCCTACTCAC CGCCTTTGCC GAAGGTAAAG ATATTCATAG AGCCACCGCA GCGGAAGTCT TTGGCGCACA TTTTGAAGAA GTCACCACAG AGCAAAGGCG CCGTGCAAAA GCCGTCAACT TCGGTTTGAT TTATGGCATG TCGGCCTTTG GCTTGGCTAA GCAGCTGGAT ATTCCTCGCA ACGAAGCACA AACCTATATC GACACTTATT TCGCCCGCTA TCCTGGGGTA TTACAGTATA TGGAAGAGAC CCGTGCTAGC GCAGCCGAAT TAGGCTACGT ATCGACCCTG TTTGGCCGCC GCCTGTATCT ACCAGAAATT CGCGATCGTA ACGCCATGCG CCGCCAAGCT GCAGAAAGGG CTGCCATTAA TGCCCCAATG CAAGGCACGG CCGCTGACAT CATCAAGAAA GCCATGATCA ACATAGCGAA CTGGATCAAG ACCGAGACCC AAGGTGAAAT AACCATGATC ATGCAAGTAC ACGATGAACT GGTGTTTGAA GTGGATGAAG CTCAAGCTGA AACATTAAAA GCCAAAATTT GCTTACTCAT GGCACAAGCC GCCGATTTGG ATGTCACCCT ATTGGCAGAA GCGGGCATTG GTAATAATTG GGATGAAGCC CACTAG
|
Protein sequence | MPIIPENPLV LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLM SRYHPSHIAV VFDAKGKTFR NDMYSEYKAQ RPPMPDDLRS QIAPLHRIIH ALGLPLISIE GVEADDVIGT IAKRASAEGR ATLISTGDKD MAQLVNEHVT LINTMTDTIM GPEEVATKFG VGPELIIDFL ALMGDKSDNI PGLPGVGEKT ALAMLNGVGS VEKLLADPQS VLEVGFRGAK TMPAKIIDNA DMLKLSYQLA TIKTDVELDP DWDLLAVKPH NKDELIACYG EMEFKRWLAE VLDNKAPTQV KTSSNAESDE VMAPVEAISA KYHTILTLDD LDTWIAKLSQ AKLIAIDTET TSLNYMDAKL VGISFAIEAG EAAYLPLAHD YLDAPSQIDM ATALEKLRPL LESDNPAKVG QNLKYDISIF ANVGIKLKGV HFDTMLESYV FNSVASRHDM DGLALKYLGH KNISFEDVAG KGAKQLTFNQ IDLDTAAPYA AEDADITLRL HQHLWPRLEK EPELAQVFTE LELPLIQVLS DIERQGVLID PMLLSQQSDE LAQKIDKLEL EAYEIAGEKF NLGSPKQLQV LFFEKLAYPI IKKTPKGAPS TAEEVLVELA LDYPLPKIIL AHRSLAKLKS TYTDKLPLMV NGTTGRVHTS YHQANAATGR LSSSDPNLQN IPIRTEEGRR IRQAFIAPAG RKILAADYSQ IELRIMAHLS QDKGLLTAFA EGKDIHRATA AEVFGAHFEE VTTEQRRRAK AVNFGLIYGM SAFGLAKQLD IPRNEAQTYI DTYFARYPGV LQYMEETRAS AAELGYVSTL FGRRLYLPEI RDRNAMRRQA AERAAINAPM QGTAADIIKK AMINIANWIK TETQGEITMI MQVHDELVFE VDEAQAETLK AKICLLMAQA ADLDVTLLAE AGIGNNWDEA H
|
| |