Gene Sbal195_4485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_4485 
Symbol 
ID5756316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp5296781 
End bp5299546 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content49% 
IMG OID641290840 
ProductDNA polymerase I 
Protein accessionYP_001556902 
Protein GI160877586 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00194681 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0107032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCG TCGCTAAAAA CCCACTTGTG CTTGTGGATG GATCTTCTTA TTTATATCGC 
GCTTATTATG CGCCTCCTCA CCTGACAAAC TCAAAGGGCG AAGCAACCGG TGCCGTTTAT
GGCGTAGTGA ATATGTTACG CAGCTTGCTG ACTCGCTATC AGCCGAGCCA TATCGCAGTC
GTGTTCGATG CCAAGGGCAA AACCTTCCGC AATGACATGT ACAGCGAGTA CAAAGCACAG
CGCCCACCAA TGCCTGATGA CCTACGTTCG CAAATCGAAC CTTTACACCG CATTATTCGC
GCACTGGGTT TACCACTGAT TTCGATTTCC GGCGTCGAAG CCGATGACGT GATCGGCACT
ATTGCTCGCC AAGCCAGTTT AGAAAACCGT GCAGTGCTTA TCAGCACTGG CGATAAAGAC
ATGGCGCAGT TAGTCGATGA GAACGTCACG CTCATCAACA CCATGACAGA CACCATAATG
GGCCCAGAAG AAGTCGCGAT TAAATTTGGT GTTGGTCCAG ACCGTATCAT AGATTTGCTC
GCGCTGATGG GCGACAAGGC CGACAACATT CCCGGCTTAC CCGGCGTTGG TGAGAAAACT
GCGCTCGCTA TGCTCACAGG AGCCGGCAGC GTGAGTAATT TATTAGCGGA ACCCGAAAAA
GTCGCCGAGC TCGGCTTTAG GGGTTCTAAA ACCATGGCGG CGAAGATCAT TGAAAATGCC
GATATGCTCA AGCTTTCTTA CGAACTCGCC ACCATTAAAA CCGATGTCGA ACTTGAGCAA
GACTGGCACG AACTCACCAT CAAGCCCGCG GACAAAGATG AACTGATCAA ATGCTACGGC
GAGATGGAGT TTAAGCGTTG GTTAGCCGAA GTCTTAGATA ACAAAATCAC TGCAAATACC
TCAATTGATG CGGCATCCGA GACACAAGAA GACTCAACTC CAGTCGAAGC GATTGCAACG
CAGTACGACT GCATTCTCAC GGAAGCCGAA TTAGACGCTT GGATTGCGAA GCTTAAAGAA
GCCGACTTGA TGGCGGTAGA TACCGAAACC ACCAGCTTAG ATTACATGGT AGCGGAATTA
GTCGGGATCT CCTTCGCCGT TGAGGCAGGA AAAGCCGCTT ATCTGCCTTT GACCCATGAT
TATGTTGGCG CACCGACTCA AATCGATAAA ACCGTCGCGC TAGAAAAACT GCGTCCACTG
CTTGAAGATC CAAAACTGAA GAAAGTCGGT CAAAATCTTA AGTACGACAT AAGTATCTTA
GCCAATGCGG GAATCAAACT GCAGGGCGTC GCTTTCGACA CTATGCTCGA ATCCTATGTC
TTCAACTCAG TAGCTTCGCG CCATGATATG GATGGCTTAG CGCTCAAGTA CTTAGGCCAT
AAGAATATCA GCTTTGAAGA AATCGCCGGT AAAGGTGCAA AACAGCTGAC CTTCAACCAA
ATTCCACTGG AAACGGCTGC GCCTTATGCG GCCGAAGATG CCGACATCAC CCTACGTTTA
CACCAACATT TGTGGCCAAG ACTCGAAAAA GAAGCAGAAC TGGCCGCTAT GTTTACCGAA
GTCGAATTGC CGCTGATCCA AGTATTGTCG GATATTGAAC GCCAAGGGGT ATTAATCGAT
AGCATGTTAC TCGGCCAACA AAGCGACGAG CTAGCGCGTA AAATCGATAC CTTAGAAGAA
AAAGCCTACG ACATTGCCGG TGAGAAATTT AACCTTGGCT CGCCTAAGCA ACTGCAAGTA
TTGTTTTTTG AAAAGCTAGG TTATCCGATC ACCAAAAAGA CTCCCAAGGG CGCACCATCA
ACCGCAGAAG AAGTGTTGGT CGAATTGGCA TTAGATTTCC CTTTGCCAAA GGTGATCCTC
GAACATCGCA GCCTATCTAA ATTAAAGAGC ACTTACACAG ATAAACTGCC ACTCATGGTC
AATGCTAAAA CGGGTCGTGT GCACACTAAC TATCATCAGG CCAATGCGGC AACAGGACGC
TTATCATCGA GCGAGCCTAA CCTGCAGAAT ATTCCTATCC GCACCGAAGA AGGTCGCCGT
ATTCGCCAAG CCTTTATCGC ACCTGATGGT CGTAAAATTT TGGCAGCCGA CTACTCGCAA
ATCGAACTGC GGATCATGGC ACATTTATCC CAAGATGCCG GCTTACTCAA AGCCTTCGCC
GAGGGCAAAG ACATTCACAG AGCTACGGCT GCCGAAGTGT TTGGCACGGA CTTTGATGAA
GTAACCACAG AACAGCGTCG CCGCGCCAAA GCGGTTAACT TCGGCTTAAT TTACGGCATG
TCAGCCTTTG GTTTAGCGCG TCAGCTCGAT ATCCCGCGCC ATGAAGCGCA AACCTATATC
GACACCTACT TTGCCCGTTA TCCAGGCGTA TTACGTTATA TGGAAGAGAC GCGTGCGGGG
GCTGCCGACC TAGGTTATGT ATCGACTCTG TTTGGTCGTC GCCTCTACTT GCCCGAAATC
CGTGATCGTA ACGCGATGCG CCGCCAAGGT GCTGAACGTG CTGCGATTAA TGCCCCAATG
CAAGGCACGG CTGCCGACAT CATCAAAAAG GCCATGATCA ATATCGCCCA GTGGATAAAG
ACAGAAACCC AAGGCGAAAT CACTATGATC ATGCAGGTTC ACGATGAATT GGTGTTTGAA
GTCGATGCGG ATAAAGCAGA AGCGCTTAAA AAGACAATCT GCACTTTAAT GGCACAAGCC
GCCGATCTCG ATGTTGAACT ACTTGCCGAA GCGGGCATTG GTAATAACTG GGATGAAGCT
CACTAA
 
Protein sequence
MPTVAKNPLV LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLL TRYQPSHIAV 
VFDAKGKTFR NDMYSEYKAQ RPPMPDDLRS QIEPLHRIIR ALGLPLISIS GVEADDVIGT
IARQASLENR AVLISTGDKD MAQLVDENVT LINTMTDTIM GPEEVAIKFG VGPDRIIDLL
ALMGDKADNI PGLPGVGEKT ALAMLTGAGS VSNLLAEPEK VAELGFRGSK TMAAKIIENA
DMLKLSYELA TIKTDVELEQ DWHELTIKPA DKDELIKCYG EMEFKRWLAE VLDNKITANT
SIDAASETQE DSTPVEAIAT QYDCILTEAE LDAWIAKLKE ADLMAVDTET TSLDYMVAEL
VGISFAVEAG KAAYLPLTHD YVGAPTQIDK TVALEKLRPL LEDPKLKKVG QNLKYDISIL
ANAGIKLQGV AFDTMLESYV FNSVASRHDM DGLALKYLGH KNISFEEIAG KGAKQLTFNQ
IPLETAAPYA AEDADITLRL HQHLWPRLEK EAELAAMFTE VELPLIQVLS DIERQGVLID
SMLLGQQSDE LARKIDTLEE KAYDIAGEKF NLGSPKQLQV LFFEKLGYPI TKKTPKGAPS
TAEEVLVELA LDFPLPKVIL EHRSLSKLKS TYTDKLPLMV NAKTGRVHTN YHQANAATGR
LSSSEPNLQN IPIRTEEGRR IRQAFIAPDG RKILAADYSQ IELRIMAHLS QDAGLLKAFA
EGKDIHRATA AEVFGTDFDE VTTEQRRRAK AVNFGLIYGM SAFGLARQLD IPRHEAQTYI
DTYFARYPGV LRYMEETRAG AADLGYVSTL FGRRLYLPEI RDRNAMRRQG AERAAINAPM
QGTAADIIKK AMINIAQWIK TETQGEITMI MQVHDELVFE VDADKAEALK KTICTLMAQA
ADLDVELLAE AGIGNNWDEA H