Gene Sbal223_4290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_4290 
Symbol 
ID7089092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp5096132 
End bp5098897 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content49% 
IMG OID643463164 
ProductDNA polymerase I 
Protein accessionYP_002360179 
Protein GI217975428 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000095151 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000056057 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAACCG TCGCTAAAAA CCCACTTGTG CTTGTGGATG GATCTTCCTA TTTATATCGC 
GCTTATTATG CGCCTCCTCA CCTGACAAAC TCAAAGGGCG AAGCAACCGG TGCCGTTTAT
GGCGTAGTGA ATATGTTACG CAGCTTGCTG ACTCGCTATC AGCCGAGCCA TATCGCAGTC
GTGTTCGATG CCAAGGGCAA AACCTTCCGC AATGACATGT ACAGCGAGTA CAAAGCACAG
CGCCCACCTA TGCCTGATGA CCTACGTTCG CAAATCGAAC CTTTACACCG CATTATTCGT
GCACTGGGTT TACCACTGAT TTCGATTTCC GGCGTCGAAG CCGATGACGT GATCGGCACT
ATTGCTCGCC AAGCCAGTTT AGAAAACCGT GCAGTGCTTA TCAGCACTGG CGATAAAGAC
ATGGCGCAGT TAGTCGATGA GAACGTCACG CTCATCAACA CCATGACAGA CACCATAATG
GGCCCAGAAG AAGTCGCGAT TAAATTTGGT GTTGGTCCAG ACCGTATCAT AGATTTGCTC
GCGCTGATGG GCGACAAGGC CGATAACATT CCCGGCTTGC CCGGCGTTGG TGAGAAAACA
GCGCTCGCTA TGCTCACAGG AGCCGGCAGC GTGAGTAACT TATTAGCGGA ACCCGAAAAA
GTCGCCGAGC TCGGCTTTAG GGGTTCTAAA ACCATGGCGG CGAAGATCAT TGAAAATGCC
GACATGCTCA AGCTTTCTTA CGAACTCGCC ACCATCAAAA CCGATGTCGA ACTTGAGCAA
GACTGGCACG AACTCACCAT CAAGCCTGCG GACAAAGATG AACTGATCAA ATGCTACGGC
GAGATGGAGT TTAAGCGTTG GTTAGCCGAA GTCTTAGATA ATAAAATCAC TGCAAATACC
CCAATTGATG CGGCATCCGA GACACAGGAA GACTCAACTC CAGTCGAAGC GATTGCAACG
CAGTACGACT GCATTCTCAC GGAAGCCGAA TTAGACGCTT GGATTGCTAA GCTTAAGCAA
GCCGACTTGA TGGCGGTAGA CACTGAAACC ACCAGCTTAG ATTACATGGT GGCTGAATTA
GTCGGGATCT CCTTCGCCGT TGAGGCAGGA AAAGCCGCTT ATCTGCCTTT GACCCATGAT
TATGTTGGCG CACCGACTCA AATCGATAAA ACCGTCGCGC TAGAAAAACT GCGTCCACTG
CTTGAAGATC CAAAACTTAA GAAAGTCGGT CAAAATCTTA AGTACGACAT CAGTATCTTA
GCCAATGCGG GTATCAAACT GCAGGGCGTC GCTTTCGACA CTATGCTCGA ATCCTATGTT
TTCAACTCAG TAGCTTCGCG CCATGATATG GATGGCTTAG CGCTCAAGTA CTTAGGCCAT
AAGAATATCA GCTTCGAAGA AATCGCTGGC AAAGGTGCAA AACAGCTGAC CTTCAACCAA
ATTCCACTAG AAACAGCTGC GCCTTATGCG GCCGAAGATG CCGACATCAC CCTGCGTTTA
CACCAACATT TGTGGCCAAG ACTTGAAAAA GAAGCGGAAC TGGCCGCTAT GTTTACCGAA
GTCGAATTGC CGCTGATCCA AGTATTGTCG GATATTGAAC GCCAAGGGGT ATTAATCGAC
AGCATGTTAC TCGGCCAACA AAGCGACGAG CTGGCGCGTA AAATCGATAC CTTAGAAGAA
AAAGCCTACG ACATTGCCGG TGAGAAATTT AACCTTGGCT CGCCTAAGCA ACTGCAAGTA
TTGTTTTTTG AAAAGCTAGG TTATCCGATC ACCAAAAAGA CCCCCAAGGG CGCACCATCG
ACCGCGGAAG AAGTGTTGGT CGAATTGGCG TTAGATTTCC CTTTGCCAAA GGTGATCCTC
GAACATCGCA GCCTATCTAA ATTAAAGAGC ACTTACACAG ATAAACTGCC ACTAATGGTC
AATGCTAAAA CGGGTCGCGT GCACACTAGC TATCATCAAG CTAATGCGGC AACAGGACGC
TTATCATCGA GCGAGCCTAA CCTGCAGAAT ATTCCTATCC GCACCGAAGA AGGTCGCCGT
ATTCGCCAAG CCTTTATCGC ACCTGATGGT CGTAAAATTT TGGCAGCCGA CTACTCGCAA
ATCGAACTAC GGATCATGGC CCATTTATCC CAAGATGCCG GTTTACTCAA AGCCTTCGCC
GAGGGCAAAG ACATTCACAG AGCTACGGCT GCCGAAGTGT TTGGCACGGA CTTTGATGAA
GTAACCACAG AACAGCGTCG CCGGGCCAAA GCGGTTAACT TCGGCTTAAT TTACGGCATG
TCAGCCTTTG GTTTAGCGCG TCAGCTCGAT ATCCCGCGCC ATGAAGCGCA AACCTATATC
GACACCTACT TTGCCCGCTA TCCTGGCGTA TTACGTTATA TGGAAGAGAC GCGTGCGGGG
GCTGCCGACC TAGGTTATGT ATCGACTCTG TTTGGCCGTC GCCTCTATTT ACCCGAAATC
CGTGATCGTA ATGCTATGCG CCGCCAAGGT GCTGAACGTG CTGCGATTAA TGCCCCAATG
CAAGGCACAG CTGCCGACAT CATCAAAAAG GCCATGATCA ATATCGCCCA GTGGATCAAG
ACAGAAACCC AAGGCGAAAT CACTATGATC ATGCAGGTTC ACGATGAATT GGTGTTTGAA
GTCGATGCAG ATAAAGCAGA AGCGCTTAAA AAGACAATCT GTACTTTAAT GGCACAAGCC
GCCGATCTCG ATGTTGAACT GCTTGCCGAA GCGGGCATTG GTAATAACTG GGATGAAGCC
CACTAA
 
Protein sequence
MPTVAKNPLV LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLL TRYQPSHIAV 
VFDAKGKTFR NDMYSEYKAQ RPPMPDDLRS QIEPLHRIIR ALGLPLISIS GVEADDVIGT
IARQASLENR AVLISTGDKD MAQLVDENVT LINTMTDTIM GPEEVAIKFG VGPDRIIDLL
ALMGDKADNI PGLPGVGEKT ALAMLTGAGS VSNLLAEPEK VAELGFRGSK TMAAKIIENA
DMLKLSYELA TIKTDVELEQ DWHELTIKPA DKDELIKCYG EMEFKRWLAE VLDNKITANT
PIDAASETQE DSTPVEAIAT QYDCILTEAE LDAWIAKLKQ ADLMAVDTET TSLDYMVAEL
VGISFAVEAG KAAYLPLTHD YVGAPTQIDK TVALEKLRPL LEDPKLKKVG QNLKYDISIL
ANAGIKLQGV AFDTMLESYV FNSVASRHDM DGLALKYLGH KNISFEEIAG KGAKQLTFNQ
IPLETAAPYA AEDADITLRL HQHLWPRLEK EAELAAMFTE VELPLIQVLS DIERQGVLID
SMLLGQQSDE LARKIDTLEE KAYDIAGEKF NLGSPKQLQV LFFEKLGYPI TKKTPKGAPS
TAEEVLVELA LDFPLPKVIL EHRSLSKLKS TYTDKLPLMV NAKTGRVHTS YHQANAATGR
LSSSEPNLQN IPIRTEEGRR IRQAFIAPDG RKILAADYSQ IELRIMAHLS QDAGLLKAFA
EGKDIHRATA AEVFGTDFDE VTTEQRRRAK AVNFGLIYGM SAFGLARQLD IPRHEAQTYI
DTYFARYPGV LRYMEETRAG AADLGYVSTL FGRRLYLPEI RDRNAMRRQG AERAAINAPM
QGTAADIIKK AMINIAQWIK TETQGEITMI MQVHDELVFE VDADKAEALK KTICTLMAQA
ADLDVELLAE AGIGNNWDEA H