Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_4290 |
Symbol | |
ID | 7089092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 5096132 |
End bp | 5098897 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643463164 |
Product | DNA polymerase I |
Protein accession | YP_002360179 |
Protein GI | 217975428 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000095151 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000000056057 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCAACCG TCGCTAAAAA CCCACTTGTG CTTGTGGATG GATCTTCCTA TTTATATCGC GCTTATTATG CGCCTCCTCA CCTGACAAAC TCAAAGGGCG AAGCAACCGG TGCCGTTTAT GGCGTAGTGA ATATGTTACG CAGCTTGCTG ACTCGCTATC AGCCGAGCCA TATCGCAGTC GTGTTCGATG CCAAGGGCAA AACCTTCCGC AATGACATGT ACAGCGAGTA CAAAGCACAG CGCCCACCTA TGCCTGATGA CCTACGTTCG CAAATCGAAC CTTTACACCG CATTATTCGT GCACTGGGTT TACCACTGAT TTCGATTTCC GGCGTCGAAG CCGATGACGT GATCGGCACT ATTGCTCGCC AAGCCAGTTT AGAAAACCGT GCAGTGCTTA TCAGCACTGG CGATAAAGAC ATGGCGCAGT TAGTCGATGA GAACGTCACG CTCATCAACA CCATGACAGA CACCATAATG GGCCCAGAAG AAGTCGCGAT TAAATTTGGT GTTGGTCCAG ACCGTATCAT AGATTTGCTC GCGCTGATGG GCGACAAGGC CGATAACATT CCCGGCTTGC CCGGCGTTGG TGAGAAAACA GCGCTCGCTA TGCTCACAGG AGCCGGCAGC GTGAGTAACT TATTAGCGGA ACCCGAAAAA GTCGCCGAGC TCGGCTTTAG GGGTTCTAAA ACCATGGCGG CGAAGATCAT TGAAAATGCC GACATGCTCA AGCTTTCTTA CGAACTCGCC ACCATCAAAA CCGATGTCGA ACTTGAGCAA GACTGGCACG AACTCACCAT CAAGCCTGCG GACAAAGATG AACTGATCAA ATGCTACGGC GAGATGGAGT TTAAGCGTTG GTTAGCCGAA GTCTTAGATA ATAAAATCAC TGCAAATACC CCAATTGATG CGGCATCCGA GACACAGGAA GACTCAACTC CAGTCGAAGC GATTGCAACG CAGTACGACT GCATTCTCAC GGAAGCCGAA TTAGACGCTT GGATTGCTAA GCTTAAGCAA GCCGACTTGA TGGCGGTAGA CACTGAAACC ACCAGCTTAG ATTACATGGT GGCTGAATTA GTCGGGATCT CCTTCGCCGT TGAGGCAGGA AAAGCCGCTT ATCTGCCTTT GACCCATGAT TATGTTGGCG CACCGACTCA AATCGATAAA ACCGTCGCGC TAGAAAAACT GCGTCCACTG CTTGAAGATC CAAAACTTAA GAAAGTCGGT CAAAATCTTA AGTACGACAT CAGTATCTTA GCCAATGCGG GTATCAAACT GCAGGGCGTC GCTTTCGACA CTATGCTCGA ATCCTATGTT TTCAACTCAG TAGCTTCGCG CCATGATATG GATGGCTTAG CGCTCAAGTA CTTAGGCCAT AAGAATATCA GCTTCGAAGA AATCGCTGGC AAAGGTGCAA AACAGCTGAC CTTCAACCAA ATTCCACTAG AAACAGCTGC GCCTTATGCG GCCGAAGATG CCGACATCAC CCTGCGTTTA CACCAACATT TGTGGCCAAG ACTTGAAAAA GAAGCGGAAC TGGCCGCTAT GTTTACCGAA GTCGAATTGC CGCTGATCCA AGTATTGTCG GATATTGAAC GCCAAGGGGT ATTAATCGAC AGCATGTTAC TCGGCCAACA AAGCGACGAG CTGGCGCGTA AAATCGATAC CTTAGAAGAA AAAGCCTACG ACATTGCCGG TGAGAAATTT AACCTTGGCT CGCCTAAGCA ACTGCAAGTA TTGTTTTTTG AAAAGCTAGG TTATCCGATC ACCAAAAAGA CCCCCAAGGG CGCACCATCG ACCGCGGAAG AAGTGTTGGT CGAATTGGCG TTAGATTTCC CTTTGCCAAA GGTGATCCTC GAACATCGCA GCCTATCTAA ATTAAAGAGC ACTTACACAG ATAAACTGCC ACTAATGGTC AATGCTAAAA CGGGTCGCGT GCACACTAGC TATCATCAAG CTAATGCGGC AACAGGACGC TTATCATCGA GCGAGCCTAA CCTGCAGAAT ATTCCTATCC GCACCGAAGA AGGTCGCCGT ATTCGCCAAG CCTTTATCGC ACCTGATGGT CGTAAAATTT TGGCAGCCGA CTACTCGCAA ATCGAACTAC GGATCATGGC CCATTTATCC CAAGATGCCG GTTTACTCAA AGCCTTCGCC GAGGGCAAAG ACATTCACAG AGCTACGGCT GCCGAAGTGT TTGGCACGGA CTTTGATGAA GTAACCACAG AACAGCGTCG CCGGGCCAAA GCGGTTAACT TCGGCTTAAT TTACGGCATG TCAGCCTTTG GTTTAGCGCG TCAGCTCGAT ATCCCGCGCC ATGAAGCGCA AACCTATATC GACACCTACT TTGCCCGCTA TCCTGGCGTA TTACGTTATA TGGAAGAGAC GCGTGCGGGG GCTGCCGACC TAGGTTATGT ATCGACTCTG TTTGGCCGTC GCCTCTATTT ACCCGAAATC CGTGATCGTA ATGCTATGCG CCGCCAAGGT GCTGAACGTG CTGCGATTAA TGCCCCAATG CAAGGCACAG CTGCCGACAT CATCAAAAAG GCCATGATCA ATATCGCCCA GTGGATCAAG ACAGAAACCC AAGGCGAAAT CACTATGATC ATGCAGGTTC ACGATGAATT GGTGTTTGAA GTCGATGCAG ATAAAGCAGA AGCGCTTAAA AAGACAATCT GTACTTTAAT GGCACAAGCC GCCGATCTCG ATGTTGAACT GCTTGCCGAA GCGGGCATTG GTAATAACTG GGATGAAGCC CACTAA
|
Protein sequence | MPTVAKNPLV LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLL TRYQPSHIAV VFDAKGKTFR NDMYSEYKAQ RPPMPDDLRS QIEPLHRIIR ALGLPLISIS GVEADDVIGT IARQASLENR AVLISTGDKD MAQLVDENVT LINTMTDTIM GPEEVAIKFG VGPDRIIDLL ALMGDKADNI PGLPGVGEKT ALAMLTGAGS VSNLLAEPEK VAELGFRGSK TMAAKIIENA DMLKLSYELA TIKTDVELEQ DWHELTIKPA DKDELIKCYG EMEFKRWLAE VLDNKITANT PIDAASETQE DSTPVEAIAT QYDCILTEAE LDAWIAKLKQ ADLMAVDTET TSLDYMVAEL VGISFAVEAG KAAYLPLTHD YVGAPTQIDK TVALEKLRPL LEDPKLKKVG QNLKYDISIL ANAGIKLQGV AFDTMLESYV FNSVASRHDM DGLALKYLGH KNISFEEIAG KGAKQLTFNQ IPLETAAPYA AEDADITLRL HQHLWPRLEK EAELAAMFTE VELPLIQVLS DIERQGVLID SMLLGQQSDE LARKIDTLEE KAYDIAGEKF NLGSPKQLQV LFFEKLGYPI TKKTPKGAPS TAEEVLVELA LDFPLPKVIL EHRSLSKLKS TYTDKLPLMV NAKTGRVHTS YHQANAATGR LSSSEPNLQN IPIRTEEGRR IRQAFIAPDG RKILAADYSQ IELRIMAHLS QDAGLLKAFA EGKDIHRATA AEVFGTDFDE VTTEQRRRAK AVNFGLIYGM SAFGLARQLD IPRHEAQTYI DTYFARYPGV LRYMEETRAG AADLGYVSTL FGRRLYLPEI RDRNAMRRQG AERAAINAPM QGTAADIIKK AMINIAQWIK TETQGEITMI MQVHDELVFE VDADKAEALK KTICTLMAQA ADLDVELLAE AGIGNNWDEA H
|
| |