Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewana3_4109 |
Symbol | |
ID | 4480323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. ANA-3 |
Kingdom | Bacteria |
Replicon accession | NC_008577 |
Strand | + |
Start bp | 4931315 |
End bp | 4934083 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639728724 |
Product | DNA polymerase I |
Protein accession | YP_871732 |
Protein GI | 117922540 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000411128 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.224079 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTACCA TAGCCAATAA CCCACTTGTC CTTGTGGATG GATCTTCTTA TTTATATCGC GCCTATTATG CGCCTCCTCA CCTGACAAAC TCAAAGGGCG AAGCTACTGG TGCTGTTTAT GGCGTAGTGA ATATGCTCCG CAGCTTATTA AGCCGTTATC AACCTAGCCA TATCGCTGTG GTGTTCGATG CTAAAGGCAA AACCTTCCGC AATGACTTAT ATGAAGAATA CAAGGCACAT CGCCCGCCTA TGCCGGATGA CCTGCGCTCA CAAATTGAGC CACTACACCG TATTATCCGT GCCTTAGGCC TGCCCTTAAT CTCTATTCCT GGTGTTGAGG CGGACGATGT TATCGGCACA ATCGCTCGCC AAGCGAGCCG CGAAAACCGC GCTGTACTCA TCAGCACTGG TGATAAAGAC ATGGCGCAGC TGGTTGATGA AAATATCACG CTGATCAACA CCATGACAGA TACCATTATG GGCCCTGAAG AAGTTGCGGC TAAATATGGT GTAGGTCCAG ACAGAATTAT CGATTTCTTA GCGCTGATGG GCGATAAGGC GGATAACATT CCCGGTTTAC CTGGTGTTGG CGAAAAAACC GCATTAGCTA TGCTCACGGG GGCGGGTAGT GTCGCCAATT TGCTTGCAGA GCCCGAAAAA GTAACCGAAT TAGGCTTTAG GGGCGCAAAA ACCATGGCGG CGAAAATCAT CGACAATGCC GACATGCTAA AGCTGTCCTA TGAGCTTGCC ACCATTAAAA CCGATGTTGA ACTCGAACAA GATTGGCATG AGCTCACCGC CAAACCCGCT GACAGGGACG AACTGATCAA ATGCTACGGC GAGATGGAGT TTAAACGCTG GCTTGCCGAA GTCTTAGATA ATAAGGCGCC AGCGACGGTC GCAGCAAAAG CCGAAACAAC AGAGACCCAA GAAGAGTCAG CGCCCAGCGT CACGATTGAA ACCCAATACG ATACAATTCT GACCGAAGCT CAGCTTGATG AGTGGATTGC CAAACTCAAA CAAGCGCCAT TAATGGCCGT AGATACCGAG ACCACCAGCC TCGACTATAT GGTTGCGGAA TTGGTTGGCC TGTCCTTTGC TGTTGAAGCG GGTAAAGCCG CCTATCTGCC CTTAGCCCAC GATTATGTTG GCGCACCTCA ACAATTAGAT AAGCAGACTG CACTCGAAAA ACTGCGCCCC TTACTCGAAG ATGCCAAGAT TAAAAAAGTC GGTCAAAATC TGAAATATGA CATCAGCGTA TTAGCCAATG CAGGCATAAA ACTCCAAGGC GTGGTATTCG ACACTATGCT CGAATCCTAT GTGTTTAACT CGATCGCCTC ACGCCATGAT ATGGATGGGT TGGCGCTAAA ATATCTGGGC CATAAAAATA TCGCCTTTGA AGATATCGCA GGTAAAGGTG CTAAACAGCT GACCTTCAAC CAAATTCCGT TGGAAACAGC TGCGCCCTAT GCGGCGGAAG ATGCCGATAT TACCCTACGT CTACATCAAC ATTTGTGGCC AAGACTCGAA AAAGAGACCG AATTAGCCTC GGTCTTTACC GATATTGAAC TGCCGCTGAT CCAAATACTG TCCGATATTG AACGCCAAGG TGTGTTTATC GATAGTATGT TGCTCGGCCA ACAGAGTGAT GAACTTGCCC GCAAAATCGA TGAGTTAGAA ACAAAAGCTT ATGATATTGC AGGTGAAAAA TTCAATTTAA GCTCACCAAA GCAACTACAA GTGCTGTTTT TTGAAAAGCT GGGTTATCCG GTCATCAAAA AAACCCCTAA GGGCGCCCCC TCTACCGCGG AAGAAGTACT GGTTGAGTTG GCATTGGATT TCCCTCTGCC TAAAGTGATC CTTGAACATA GAAGCCTAAC CAAGCTAAAG AGTACTTACA CCGACAAGCT CCCTCTAATG GTGAACGCGA AAACGGGTCG GGTACACACA AGCTACCATC AGGCCAACGC CGCAACGGGG CGTTTGTCCT CGAGCGAACC AAACCTACAG AATATTCCTA TCCGCACCGA GGAAGGTCGT CGTATTCGCC AAGCCTTTAT TGCGCCGCAG GGACGTAAGA TTTTGGCCGC CGACTATTCG CAGATTGAAT TACGCATCAT GGCGCATTTA TCCCAAGATG CGGGCTTACT TAAAGCCTTC GCCGAAGGTA AAGACATTCA CAGAGCCACC GCCGCCGAAG TATTTGGCAC CGACTTTGAC AGTGTCACCT CGGAGCAGCG TCGCCGCGCC AAAGCCGTTA ACTTTGGCCT TATCTATGGC ATGTCCGCCT TTGGATTGGC GCGTCAGCTC GATATTCCCC GCAACGAGGC ACAAACTTAC ATCGACACTT ACTTCGCCCG CTATCCAGGC GTATTAAGGT ATATGGAAGA AACACGAGCC AGTGCAGCAG AACTTGGCTA TGTCTCTACG CTATTTGGGC GCCGTCTCTA TTTACCTGAA ATTCGCGATC GTAATGCAAT GCGCCGCCAA GCAGCAGAAA GAGCCGCGAT TAACGCCCCA ATGCAAGGCA CCGCCGCGGA TATTATTAAA AAAGCCATGA TCAGCATTGC CGATTGGATA AAAACCGATA CCCAAGGTGA AATCGCCATG ATCATGCAAG TCCACGACGA ATTAGTATTC GAAGTCGATG CCGATAAAGC CGAAACACTC AAGCTCAAGG TGTGTGAACT CATGGCAAAA GCAGCCAATC TGGACGTGGA ACTTCTGGCA GAAGCTGGTA TTGGCGATAA CTGGGATCAA GCCCACTAG
|
Protein sequence | MPTIANNPLV LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLL SRYQPSHIAV VFDAKGKTFR NDLYEEYKAH RPPMPDDLRS QIEPLHRIIR ALGLPLISIP GVEADDVIGT IARQASRENR AVLISTGDKD MAQLVDENIT LINTMTDTIM GPEEVAAKYG VGPDRIIDFL ALMGDKADNI PGLPGVGEKT ALAMLTGAGS VANLLAEPEK VTELGFRGAK TMAAKIIDNA DMLKLSYELA TIKTDVELEQ DWHELTAKPA DRDELIKCYG EMEFKRWLAE VLDNKAPATV AAKAETTETQ EESAPSVTIE TQYDTILTEA QLDEWIAKLK QAPLMAVDTE TTSLDYMVAE LVGLSFAVEA GKAAYLPLAH DYVGAPQQLD KQTALEKLRP LLEDAKIKKV GQNLKYDISV LANAGIKLQG VVFDTMLESY VFNSIASRHD MDGLALKYLG HKNIAFEDIA GKGAKQLTFN QIPLETAAPY AAEDADITLR LHQHLWPRLE KETELASVFT DIELPLIQIL SDIERQGVFI DSMLLGQQSD ELARKIDELE TKAYDIAGEK FNLSSPKQLQ VLFFEKLGYP VIKKTPKGAP STAEEVLVEL ALDFPLPKVI LEHRSLTKLK STYTDKLPLM VNAKTGRVHT SYHQANAATG RLSSSEPNLQ NIPIRTEEGR RIRQAFIAPQ GRKILAADYS QIELRIMAHL SQDAGLLKAF AEGKDIHRAT AAEVFGTDFD SVTSEQRRRA KAVNFGLIYG MSAFGLARQL DIPRNEAQTY IDTYFARYPG VLRYMEETRA SAAELGYVST LFGRRLYLPE IRDRNAMRRQ AAERAAINAP MQGTAADIIK KAMISIADWI KTDTQGEIAM IMQVHDELVF EVDADKAETL KLKVCELMAK AANLDVELLA EAGIGDNWDQ AH
|
| |