Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_4485 |
Symbol | |
ID | 5756316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | + |
Start bp | 5296781 |
End bp | 5299546 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641290840 |
Product | DNA polymerase I |
Protein accession | YP_001556902 |
Protein GI | 160877586 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00194681 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0107032 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCG TCGCTAAAAA CCCACTTGTG CTTGTGGATG GATCTTCTTA TTTATATCGC GCTTATTATG CGCCTCCTCA CCTGACAAAC TCAAAGGGCG AAGCAACCGG TGCCGTTTAT GGCGTAGTGA ATATGTTACG CAGCTTGCTG ACTCGCTATC AGCCGAGCCA TATCGCAGTC GTGTTCGATG CCAAGGGCAA AACCTTCCGC AATGACATGT ACAGCGAGTA CAAAGCACAG CGCCCACCAA TGCCTGATGA CCTACGTTCG CAAATCGAAC CTTTACACCG CATTATTCGC GCACTGGGTT TACCACTGAT TTCGATTTCC GGCGTCGAAG CCGATGACGT GATCGGCACT ATTGCTCGCC AAGCCAGTTT AGAAAACCGT GCAGTGCTTA TCAGCACTGG CGATAAAGAC ATGGCGCAGT TAGTCGATGA GAACGTCACG CTCATCAACA CCATGACAGA CACCATAATG GGCCCAGAAG AAGTCGCGAT TAAATTTGGT GTTGGTCCAG ACCGTATCAT AGATTTGCTC GCGCTGATGG GCGACAAGGC CGACAACATT CCCGGCTTAC CCGGCGTTGG TGAGAAAACT GCGCTCGCTA TGCTCACAGG AGCCGGCAGC GTGAGTAATT TATTAGCGGA ACCCGAAAAA GTCGCCGAGC TCGGCTTTAG GGGTTCTAAA ACCATGGCGG CGAAGATCAT TGAAAATGCC GATATGCTCA AGCTTTCTTA CGAACTCGCC ACCATTAAAA CCGATGTCGA ACTTGAGCAA GACTGGCACG AACTCACCAT CAAGCCCGCG GACAAAGATG AACTGATCAA ATGCTACGGC GAGATGGAGT TTAAGCGTTG GTTAGCCGAA GTCTTAGATA ACAAAATCAC TGCAAATACC TCAATTGATG CGGCATCCGA GACACAAGAA GACTCAACTC CAGTCGAAGC GATTGCAACG CAGTACGACT GCATTCTCAC GGAAGCCGAA TTAGACGCTT GGATTGCGAA GCTTAAAGAA GCCGACTTGA TGGCGGTAGA TACCGAAACC ACCAGCTTAG ATTACATGGT AGCGGAATTA GTCGGGATCT CCTTCGCCGT TGAGGCAGGA AAAGCCGCTT ATCTGCCTTT GACCCATGAT TATGTTGGCG CACCGACTCA AATCGATAAA ACCGTCGCGC TAGAAAAACT GCGTCCACTG CTTGAAGATC CAAAACTGAA GAAAGTCGGT CAAAATCTTA AGTACGACAT AAGTATCTTA GCCAATGCGG GAATCAAACT GCAGGGCGTC GCTTTCGACA CTATGCTCGA ATCCTATGTC TTCAACTCAG TAGCTTCGCG CCATGATATG GATGGCTTAG CGCTCAAGTA CTTAGGCCAT AAGAATATCA GCTTTGAAGA AATCGCCGGT AAAGGTGCAA AACAGCTGAC CTTCAACCAA ATTCCACTGG AAACGGCTGC GCCTTATGCG GCCGAAGATG CCGACATCAC CCTACGTTTA CACCAACATT TGTGGCCAAG ACTCGAAAAA GAAGCAGAAC TGGCCGCTAT GTTTACCGAA GTCGAATTGC CGCTGATCCA AGTATTGTCG GATATTGAAC GCCAAGGGGT ATTAATCGAT AGCATGTTAC TCGGCCAACA AAGCGACGAG CTAGCGCGTA AAATCGATAC CTTAGAAGAA AAAGCCTACG ACATTGCCGG TGAGAAATTT AACCTTGGCT CGCCTAAGCA ACTGCAAGTA TTGTTTTTTG AAAAGCTAGG TTATCCGATC ACCAAAAAGA CTCCCAAGGG CGCACCATCA ACCGCAGAAG AAGTGTTGGT CGAATTGGCA TTAGATTTCC CTTTGCCAAA GGTGATCCTC GAACATCGCA GCCTATCTAA ATTAAAGAGC ACTTACACAG ATAAACTGCC ACTCATGGTC AATGCTAAAA CGGGTCGTGT GCACACTAAC TATCATCAGG CCAATGCGGC AACAGGACGC TTATCATCGA GCGAGCCTAA CCTGCAGAAT ATTCCTATCC GCACCGAAGA AGGTCGCCGT ATTCGCCAAG CCTTTATCGC ACCTGATGGT CGTAAAATTT TGGCAGCCGA CTACTCGCAA ATCGAACTGC GGATCATGGC ACATTTATCC CAAGATGCCG GCTTACTCAA AGCCTTCGCC GAGGGCAAAG ACATTCACAG AGCTACGGCT GCCGAAGTGT TTGGCACGGA CTTTGATGAA GTAACCACAG AACAGCGTCG CCGCGCCAAA GCGGTTAACT TCGGCTTAAT TTACGGCATG TCAGCCTTTG GTTTAGCGCG TCAGCTCGAT ATCCCGCGCC ATGAAGCGCA AACCTATATC GACACCTACT TTGCCCGTTA TCCAGGCGTA TTACGTTATA TGGAAGAGAC GCGTGCGGGG GCTGCCGACC TAGGTTATGT ATCGACTCTG TTTGGTCGTC GCCTCTACTT GCCCGAAATC CGTGATCGTA ACGCGATGCG CCGCCAAGGT GCTGAACGTG CTGCGATTAA TGCCCCAATG CAAGGCACGG CTGCCGACAT CATCAAAAAG GCCATGATCA ATATCGCCCA GTGGATAAAG ACAGAAACCC AAGGCGAAAT CACTATGATC ATGCAGGTTC ACGATGAATT GGTGTTTGAA GTCGATGCGG ATAAAGCAGA AGCGCTTAAA AAGACAATCT GCACTTTAAT GGCACAAGCC GCCGATCTCG ATGTTGAACT ACTTGCCGAA GCGGGCATTG GTAATAACTG GGATGAAGCT CACTAA
|
Protein sequence | MPTVAKNPLV LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLL TRYQPSHIAV VFDAKGKTFR NDMYSEYKAQ RPPMPDDLRS QIEPLHRIIR ALGLPLISIS GVEADDVIGT IARQASLENR AVLISTGDKD MAQLVDENVT LINTMTDTIM GPEEVAIKFG VGPDRIIDLL ALMGDKADNI PGLPGVGEKT ALAMLTGAGS VSNLLAEPEK VAELGFRGSK TMAAKIIENA DMLKLSYELA TIKTDVELEQ DWHELTIKPA DKDELIKCYG EMEFKRWLAE VLDNKITANT SIDAASETQE DSTPVEAIAT QYDCILTEAE LDAWIAKLKE ADLMAVDTET TSLDYMVAEL VGISFAVEAG KAAYLPLTHD YVGAPTQIDK TVALEKLRPL LEDPKLKKVG QNLKYDISIL ANAGIKLQGV AFDTMLESYV FNSVASRHDM DGLALKYLGH KNISFEEIAG KGAKQLTFNQ IPLETAAPYA AEDADITLRL HQHLWPRLEK EAELAAMFTE VELPLIQVLS DIERQGVLID SMLLGQQSDE LARKIDTLEE KAYDIAGEKF NLGSPKQLQV LFFEKLGYPI TKKTPKGAPS TAEEVLVELA LDFPLPKVIL EHRSLSKLKS TYTDKLPLMV NAKTGRVHTN YHQANAATGR LSSSEPNLQN IPIRTEEGRR IRQAFIAPDG RKILAADYSQ IELRIMAHLS QDAGLLKAFA EGKDIHRATA AEVFGTDFDE VTTEQRRRAK AVNFGLIYGM SAFGLARQLD IPRHEAQTYI DTYFARYPGV LRYMEETRAG AADLGYVSTL FGRRLYLPEI RDRNAMRRQG AERAAINAPM QGTAADIIKK AMINIAQWIK TETQGEITMI MQVHDELVFE VDADKAEALK KTICTLMAQA ADLDVELLAE AGIGNNWDEA H
|
| |