Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_0073 |
Symbol | |
ID | 3967157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 92145 |
End bp | 94925 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637919132 |
Product | DNA polymerase I |
Protein accession | YP_525549 |
Protein GI | 90019722 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.800167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAATT CGTCGCTACC GCTACTTCTT GTTGATGGCT CTTCGTATTT ATATCGAGCT TTCCATGCAA TGCCGCCACT TACCACCTCC AGTGGGCAGC CCACGGGCGC GGTGCGCGGA GTAGTTAATA TGTTGCGCAA GCTCGCCAAG GATTACCCCG AAAGCCCAAT AGCAGTAATT TTTGACGCCA AGGGCAAAAC CTTCCGCGAT GATATTTATA GCGATTACAA AGCCAATCGC CCGCCAATGC CCGATGATTT GCGCGCACAA ATTGAGCCGT TACACACCAT TATTCGCGCA ATGGGCTTGC CGCTTATTAT TCAAGATGGG GTAGAAGCCG ACGATGTAAT TGGTACTTAC GCCCAACAGG CCACCGAAAA AGGCATACCG GTTGTGGTAT CTACTGGCGA TAAGGACATG GCGCAGTTAG TCAGTGAGCA CGTAACACTT GTGAACACCA TGACCGAAAC CGTCATGGAT ATTGCCGGTG TTAACGAAAA ATTTGGTATT CCGCCTTCGC TAATTATCGA TTTCTTGGCG TTAATGGGCG ATAAGGTCGA CAATATTCCT GGCGTGCCCG GCGTAGGTGA GAAAACAGCC TTAGCGCTAC TGCAAAACTT AGGCAGTGTA GAAGATATTT ACGCCAATTT AGACAAGGTA CCCACCTTAG AGGTGCGCGG TGCAAAATCG TTGGCGAAAA AGCTCGAAGA TAATCGCGAC AAAGCGTTTT TATCGTACGA GCTGGCAACC ATCAAACTAG ATGTAGAGCT AGAGCACAGC CTTGAAGAGT TAATTCCCGT CGACCCAGAC AACGATGCGT TGCTGGCCTT GTTTACCGAA CTGGAATTTA AAGGTTGGGT TACCGAGCTC TCGAAAGGGG GCGGCACTAA AGCCAAAAAA GCTAGCCCCA ACGCTAAGCA GGCAGCAGAA GAGGCGCCAG CCGCCGCGCC CGAAGAAGCC GTAGAAATTA ACCGCGATAA CTACGAAACC ATCTTAACCG AAGCCAGCTT AGAGGCGTGG GTAGACCGCC TAAGCAAAGC AGAGCTGTTC GCGTTTGATA CCGAAACTAC CAGCCTTAAC TATATGGAAG CTAAAGTGGT GGGTGTTTCG TTTGCTGTGG CCCCGGGAGA GGCCGCCTAT GTGCCGTTTG GACACGATTA CTTGGGCGCG CCAGAGCAGC TAAGTGAAGA GCAAGTGCTG GGTAAGCTAA AGCCGTTACT TGAAGACGAA AATGTTAAGA AAGTCGGCCA AAATCTTAAA TATGACGCAA GTGTGTTGGC AAATCACGGT ATTGCCCTGC AGGGTATAGC GTTCGATACC ATGCTGGAAT CTTACGTGGT GAACTCCACC GCCAACCGTC ACGATATGGA TACCCTTGCG CTGGCGCATT TAGGTCACAC CAATATTTCA TTTGAAGAAA TTGCCGGTAA GGGCGCTAAG CAACTTACTT TTAACCAAAT TCAACTCGAC CAAGCCGCAC CTTACGCCGC CGAAGATGCC GACATAACTC TGCGTTTGCA CCAAGCGCTA TGGCCACAAG TGGAGGCCGT AGAAGGAATA AAGTCTTTGT TCGAAAATGT CGAGCTGCCG TTGGTGCCGG TGTTATCTAA GGTAGAGCGC ACAGGCGCAT TAATTGATGC CACTATTCTC GGCCAGCAAA GTGCCGAGCT GGCCGCCGAC TTAGAGCGCA TTCAACGCGA GGCGTGGGAA TTGGCCGGTG AAGAGTTTAA TTTAGCCTCG CCCAAACAAT TGGGTGTAAT TTTGTTCGAA AAGCTGGGCA TCCCCGTTAT TAAAAAGACG AAAACCGGCG CCCCCTCAAC CGCAGAAGAA GTGCTGCAAG AGCTGGCTTT AGATTACCCC TTGCCCGCAC TGTTACTGGA GCAGCGCGGC TTGGCGAAGT TGAAATCTAC TTATACCGAC AAGCTGCCAA CCATGATTAA CCCCCATACT GGGCGGGTGC ATACATCCTA TAACCAAGCA GTAACGGCGA CTGGGCGGTT AAGTTCTACC GACCCCAATT TGCAGAATAT TCCTATTAAA ACCGAAGCTG GGCGGCGTGT GCGCAAGGCG TTTATCGCCC CCGAGGGTTA CCGCATAGTC GCGGCTGACT ACTCGCAAAT TGAATTGCGC ATTATGGCGC ACCTATCGGG CGACGAAGGA TTAACCAACG CGTTTAACCA AGGGTTGGAT GTGCACAGTG CCACTGCCGC CGAAGTATTT GGCACCAGTG TAGAGAGCGT TACCCCAGAG CAGCGCCGCA GCGCCAAAGC CATTAACTTT GGGCTTATAT ACGGCATGTC GGCGTTTGGC TTGGCGCGCC AGCTGCACAT TGGTCGCAAT GATGCCCAGC GCTACATAGA TACCTATTTC GATCGCTACC CTGGTGTTGC ACGCTATATG GAAGATATTC GTGTATTCGC GAAGGAGCAT GGCTACGTAG AAACGTTAAT GGGCCGCAGG TTGTATTTGC CAGAAATTAA CGCCAGTAAC GGTATGCGCA GGCAGGCGGC AGAGCGCACC GCAATCAACG CGCCCATGCA AGGGTCTGCG GCAGATATCA TCAAAAAAGC GATGATTGAC GTCGATAAGT GGCTATCTAC CCTAAAAGTA GACGCGAAAA TGATAATGCA GGTACACGAT GAATTGGTGC TAGAGGTTGC CGAAGACCAA ATAGAGCCAG TAACCGCCAC CCTGTGCGAT ATTATGTCGG CAGCGCTTAA GTTAGATGTA CCGCTATTGG TTGAAGCCGG TGTGGGGATG AACTGGGATG AGGCGCACTA G
|
Protein sequence | MSNSSLPLLL VDGSSYLYRA FHAMPPLTTS SGQPTGAVRG VVNMLRKLAK DYPESPIAVI FDAKGKTFRD DIYSDYKANR PPMPDDLRAQ IEPLHTIIRA MGLPLIIQDG VEADDVIGTY AQQATEKGIP VVVSTGDKDM AQLVSEHVTL VNTMTETVMD IAGVNEKFGI PPSLIIDFLA LMGDKVDNIP GVPGVGEKTA LALLQNLGSV EDIYANLDKV PTLEVRGAKS LAKKLEDNRD KAFLSYELAT IKLDVELEHS LEELIPVDPD NDALLALFTE LEFKGWVTEL SKGGGTKAKK ASPNAKQAAE EAPAAAPEEA VEINRDNYET ILTEASLEAW VDRLSKAELF AFDTETTSLN YMEAKVVGVS FAVAPGEAAY VPFGHDYLGA PEQLSEEQVL GKLKPLLEDE NVKKVGQNLK YDASVLANHG IALQGIAFDT MLESYVVNST ANRHDMDTLA LAHLGHTNIS FEEIAGKGAK QLTFNQIQLD QAAPYAAEDA DITLRLHQAL WPQVEAVEGI KSLFENVELP LVPVLSKVER TGALIDATIL GQQSAELAAD LERIQREAWE LAGEEFNLAS PKQLGVILFE KLGIPVIKKT KTGAPSTAEE VLQELALDYP LPALLLEQRG LAKLKSTYTD KLPTMINPHT GRVHTSYNQA VTATGRLSST DPNLQNIPIK TEAGRRVRKA FIAPEGYRIV AADYSQIELR IMAHLSGDEG LTNAFNQGLD VHSATAAEVF GTSVESVTPE QRRSAKAINF GLIYGMSAFG LARQLHIGRN DAQRYIDTYF DRYPGVARYM EDIRVFAKEH GYVETLMGRR LYLPEINASN GMRRQAAERT AINAPMQGSA ADIIKKAMID VDKWLSTLKV DAKMIMQVHD ELVLEVAEDQ IEPVTATLCD IMSAALKLDV PLLVEAGVGM NWDEAH
|
| |