Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sama_0093 |
Symbol | |
ID | 4602350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella amazonensis SB2B |
Kingdom | Bacteria |
Replicon accession | NC_008700 |
Strand | - |
Start bp | 104678 |
End bp | 107449 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639779405 |
Product | DNA-directed DNA polymerase |
Protein accession | YP_925975 |
Protein GI | 119773235 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00183207 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0262318 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGATA TTGCCAGCAA CCCGCTCATC CTTGTGGATG GTTCTTCCTA TCTTTATCGC GCCTATTATG CACCCCCACA TCTAACCAAT TCCAAGGGCG AAGCCACTGG TGCCGTTTAT GGTGTGGTTA ACATGCTGCG CAGTCTGCTG TCCCGCTACC GCCCAACGCA AATGGCGGTG GTATTTGATG CCAAGGGCCC CACATTCCGC AATGAAATGT ATCAGGAATA CAAAGCCCAT CGCCCTCCCA TGCCGGACGA CCTGCGCAGC CAGATTGCGC CGCTGCACCG TATTATCAAG GCTCTGGGCA TTCCCCTTAT CAGCATTCCC GGCGTAGAAG CCGACGACGT GATTGGTACT ATTGCCATTC AGGCCGGGAA CGAAGGCCGC AGCGTGCTCA TCAGCACCGG CGACAAAGAC ATGGCGCAGC TGGTTAATGA ACACATCACG CTTATTAACA CCATGACCGA CACCATACTT GGCCCAGAAG AAGTGGCCAC TAAGTTTGGT GTGGGGCCAG AGTTGATTAT CGATCTGCTG GCGATGATGG GCGATAAAGC AGATAACATT CCAGGCCTGC CAGGAGTGGG AGAGAAAACC GCACTCGCCA TGCTGACCGG TGCCGGTGGC GTAGCTAAAC TGCTCGCCGA GCCGGACTGC GTTACAGACC TGGGCTTTCG CGGCGCCAAA ACCATGGCAG CCAAAATCCG TGACAATGCC GAAATGCTGG AGCTGTCTTA CAAGCTCGCC ACCATTAAAA CGGATGTGAC ACTTGAGCAG GATTGGCATC AGTTGTCTCT CAGCGAGCCG AACCGCGACG AGCTCATCGC CTGCTACGGT GAGATGGAGT TCAAGCGCTG GCTGGCGGAA GTGCTGGACA ACAAGTCGCC TTTGGGCACC AAAGCAGCTG CGCCGCAGGC AAATCAGGAG ACTACCGAAT CACCAGCCGC TGCCGTCATC GAAACCCAAT ACCACACCCT GCTGACCGAA TCTGAACTGG ATGCCTGGCT CGATAAGCTG AGTAAAGCCG AGTTGATTGC TGTGGATACC GAAACTACCA GTCTTGACTA TATGCAAGCC GAGTTGGTGG GGCTGTCGTT TGCCATCGAA GCCGGTAAGG CCGCTTACTT GCCATTGGGG CACGACTATC CCGGCGCACC GTCCCAATTG CCGCGCGAAG CGACTCTCGC CAAGCTCAAG CCGCTGCTTG AAAATCCGGA CATTAAAAAA GTTGGTCAAA ACCTTAAGTA CGATATGAGC GTGCTGGCCA ATGCTGGTAT CCAGCTGAAA GGCGTCGCCT TCGATACCAT GCTTGAATCT TACGTATTCA ACTCAGTGGC CAGCCGCCAC GACATGGACG GACTGGCACT CAAGTATCTG GGGCATAAAA ATATCAGCTT TGAAGAGATC GCCGGCAAAG GTGCCAAACA ACTCACCTTC AACCAAATCG CACTTGAACA AGCTGCCCCC TATGCGGCGG AAGACGCCGA TATCACCTTA AGACTGCATC AACATCTATG GCCGCGCCTC AGCAAAGAAG AGGGTCTCGA GTCCGTGTTT ACCGAACTCG AGCTGCCGCT GATTGAGGTA CTGTCAGATA TTGAACGTCG CGGCGTACTC ATTGACAGTA TGCTGCTTGG GCAGCAGAGT GAAGAACTGG CGCGCAAGAT TGACGAGCTT GAACAACACG CACACGAAAT TGCCGGTGAA CCCTTTAACC TGTCGTCCAC CAAGCAGTTG CAGGAGCTGT TTTTCACCAA GCTGGGTTAT CCGGTCATTA AAAAAACTCC CAAGGGTGCC CCTTCCACCG CCGAAGAAGT GCTGGTTGAA TTGGCGCTTG ACTACCCACT GCCCAAAATC ATCCTTGAGC ACCGCAGTCT CACCAAGCTC AAGAGCACTT ATACCGACAA ATTGCCGCTG ATGATCAATG GCCGTACCGG TCGTGTGCAC ACCAGCTACC ATCAGGCCAA TGCCGCTACA GGTCGCCTGT CGTCCAGCGA CCCGAACCTG CAGAATATTC CGATTCGCAC CGAAGAAGGC CGTCGTATCC GTCAAGCCTT TGTGGCGCCC GAAGGCAAAC GCATTCTGGC GGCGGACTAT TCCCAGATTG AACTCAGGAT CATGGCGCAT CTGTCTCAGG ATGAGGGCTT GCTGCGCGCT TTTGCCGAAG GCAAAGACAT TCACCGCGCC ACTGCCGCCG AAGTATTTGA TGTGGCATTT GAAACAGTGA CATCAGAGCA ACGCCGCCGC GCCAAAGCGG TGAACTTCGG CCTTATATAC GGCATGTCGG CCTTCGGCCT GGCCCGTCAG CTGGATATTC CCCGCAACGA GGCCCAAAGC TATATCGATA CCTACTTCAA GCGCTACCCT GGCGTACTCA AATACATGGA GGAGACACGC GCCATGGCTT CTGAGCAGGG TTATGTCAGT ACCCTGTTTG GCCGTCGCCT CTATTTGCCG GAAATCCGTG ACCGTAATGC CATGCGCAGG CAGGCTGCAG AACGTGCCGC CATTAACGCG CCCATGCAGG GTACGGCTGC AGACATCATC AAGAAGGCGA TGATTGCGAT CCAGCATTGG ATTAAGACCG AGACTCAGGG CGAAATCCAG ATGATTATGC AGGTGCACGA TGAACTGGTA TTCGAGGTGG ATGCCGATAA GGCCGATGCC CTGAAAGAGA AGGTATGCGA GTTGATGGCT GCGGCGGCAA GCCTCGATGT GCCATTGCTG GCCGAAGCCG GTCTGGGTGA TAACTGGGAT CAGGCCCACT GA
|
Protein sequence | MPDIASNPLI LVDGSSYLYR AYYAPPHLTN SKGEATGAVY GVVNMLRSLL SRYRPTQMAV VFDAKGPTFR NEMYQEYKAH RPPMPDDLRS QIAPLHRIIK ALGIPLISIP GVEADDVIGT IAIQAGNEGR SVLISTGDKD MAQLVNEHIT LINTMTDTIL GPEEVATKFG VGPELIIDLL AMMGDKADNI PGLPGVGEKT ALAMLTGAGG VAKLLAEPDC VTDLGFRGAK TMAAKIRDNA EMLELSYKLA TIKTDVTLEQ DWHQLSLSEP NRDELIACYG EMEFKRWLAE VLDNKSPLGT KAAAPQANQE TTESPAAAVI ETQYHTLLTE SELDAWLDKL SKAELIAVDT ETTSLDYMQA ELVGLSFAIE AGKAAYLPLG HDYPGAPSQL PREATLAKLK PLLENPDIKK VGQNLKYDMS VLANAGIQLK GVAFDTMLES YVFNSVASRH DMDGLALKYL GHKNISFEEI AGKGAKQLTF NQIALEQAAP YAAEDADITL RLHQHLWPRL SKEEGLESVF TELELPLIEV LSDIERRGVL IDSMLLGQQS EELARKIDEL EQHAHEIAGE PFNLSSTKQL QELFFTKLGY PVIKKTPKGA PSTAEEVLVE LALDYPLPKI ILEHRSLTKL KSTYTDKLPL MINGRTGRVH TSYHQANAAT GRLSSSDPNL QNIPIRTEEG RRIRQAFVAP EGKRILAADY SQIELRIMAH LSQDEGLLRA FAEGKDIHRA TAAEVFDVAF ETVTSEQRRR AKAVNFGLIY GMSAFGLARQ LDIPRNEAQS YIDTYFKRYP GVLKYMEETR AMASEQGYVS TLFGRRLYLP EIRDRNAMRR QAAERAAINA PMQGTAADII KKAMIAIQHW IKTETQGEIQ MIMQVHDELV FEVDADKADA LKEKVCELMA AAASLDVPLL AEAGLGDNWD QAH
|
| |