Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0548 |
Symbol | |
ID | 4027687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 606583 |
End bp | 609369 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637965716 |
Product | DNA polymerase I |
Protein accession | YP_572609 |
Protein GI | 92112681 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.965006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGCTG ATGTCATGGC CAATACGCCC CCCATCGTTC TCGTCGACGG CTCGTCGTAT CTGTATCGGG CTTTTCATGC CTTGCCGCCG CTGACCACGT CCAAGGGGAA CCCCACCGGG GCGGTGAAGG GCGTGCTCAA CATGCTCAAG AGCCTGATCA AGCAGTATCC ACAAAGTCCC ATGGCGGTGG TCTTCGATGC CAAGGGCAAG ACCTTCCGCG ACGATATCTA CGCCGAGTAC AAGGCGCACC GTCCGCCGAT GCCCGACGAT CTGCGCCCGC AGGTCGAGCC CCTGCACGAC TGCATTCGCG CGCTGGGCCT GCCGCTGTTG TGCATCGAGG GCGTCGAGGC CGACGACGTG ATCGGCACCC TGGCACGCCA GGCCACCGAG GCGGGGCGCG ACGCGGTGAT TTCCACCGGT GACAAGGACA TGGCGCAGCT CGTCAATGCC CACATCACGC TGGTCAACAC CATGAAGGGC GAGACGCTCG ACGTGGCCGG CGTCGAGGAA AAATTCGGCA TTCCGCCGTC GCTGGTCATC GACTTCCTGG CCCTGATGGG CGACAAGGTC GACAATATCC CCGGCGTGCC CGGCGTCGGC GAGAAGACGG CGCTCGGCCT GCTGCAAGGC ATGCAGGGCG GGCTGGACAC CATCTACGCC GACCTCGAGC GTGTCACCAC GCTGTCGTTT CGCGGCGCCA AGACGATGCC CAAGAAGCTC GAGGCCAACC GTGAGCAGGC CTTCCTGTCG TATCAACTGG CCACCATCAA GACCGACTGC GAGCTGCCGG TGGGGCTCGA TGACCTGGAT ATCGCGCACC CCGACCGCGA GGCGCTCAAG ACGCTGTACA CCGAGCTGGA ATTCAAGAAC TGGCTGAACG AGCTGCTCGA GGGGCGCGAC GAGGGCGTCG ACGATGTCGG TTCGGGCGAT GCGGTGGATG CCGCGCAGGG GCCCGTGGCG AGTACTGCCG AGGCGACGTC GCGCACGGAT CACGTCATCG TCACCCGCGA GGCCTTCGAT GCCTGGCTTG CGCGCCTGGG CGAGGCGGAC ATCTTCTGCT TCGACCTGGA AACCACCAGC CTCAATTACA TGGAGGCCGA TATCGTGGGA ATCGGCCTGT CGCTGGACGC CGGCGAAGCG GCCTACATCC CGGTGGCGCA CCGCTATCTC GACGCTCCCG AGCAGCTCGA CCGCGCGTCG GTGCTCGCCG CGCTCAAGCC GCTCTGGGAA GACCCCGCCA AGGCCAAGAT CGGCCAGAAC CTCAAGTACG ACATTTCCGT CCTGGCACGC TACGACATCG AGGTCGCGGG ACGGCTCGAG GACACCATGC TGGCATCCTA CGTGCTCAAT GCCACGGCGA CGCGGCACGA CATGGACTCG CTGGCGCTCA AGTACCTCGG CGAGAAGACC ATTTCCTTCG AGGAGATCGC CGGCAAGGGG GCCAAGCAGT TGACCTTCGA CCAGATTGCA CTGGAGCAGG CTGCCCCCTA CGCCTGCGAG GACGTCGACA TCACCTTGCG GCTGCACCGG GAACTGCGCC CGCGCGTGGA TGGCGAGGGC CGGCTGGCGG CGGTGCTGGA CGACATCGAA CTGCCCCTGG TGCCGGTGCT CTCGCGCATG GAGCGCAACG GGGTGGCACT GGATGCCGAG CGCCTGCACG CGCAGAGTCG CGAGCTGGAG AAGCGCCTGC GGGAACTGGA AACCCGCGCC TATGAGCTGG CCGGACGCGA GTTCAATCTC GGCTCGCCCA AGCAGCTCGG CGAGATTCTC TTCGATGAGC TCAAGATCCC GGTGATCAAG AAGACGCCCA AGGGCGCGCC CAGCACCGCC GAGGCGGTGC TCGAGGAACT GGCGCTGGAT TACCCCTTGC CCAAGGTGAT CATCGAGCAT CGCGGCTTCG CCAAGCTGAA GTCGACCTAC ACCGACAAGC TGCCGCAACT GGTCAATGCC ACCACGCGGC GGCTGCATAC CAGCTATCAC CAGGCCGTGA CGGCCACCGG GCGCCTGTCG TCGTCCGACC CCAACCTGCA GAACATTCCC ATCCGTACCG AAGAAGGCCG CAAGATCCGC CAGGCCTTCG TCGCGCGCCC CGGCTACCGC ATCGTCGCTG CCGACTATTC GCAGATCGAG CTGCGCATCA TGGCGCATCT TTCCGGCGAC AAGGGACTGC TCGATGCCTT CGCCGAAGGA CGCGACATCC ATACCGCCAC CGCGGCGGAA GTGTTCGGCG TGGCGCTCGA CGCCGTCAGC GGCGAGCAAC GGCGCAGCGC CAAGGCCATC AACTTCGGTC TGATCTACGG CATGAGCGCC TGGGGACTGG GGCGCCAGCT GCACATCGAG CGCAATCAGG CGCAGACCTA CATCGACCGC TACTTCGATC GTTACCCCGG CGTGGCGCGC TTCATGGAAC GCATTCGTGC CCAGGCCGCC GACGACGGTT ACGTCGAGAC GGTCTTCGGA CGTCGCCTCT ATCTGCCCGA GATCAATGCC CAGAACCGGA CCCGGCGTCA GGCTGCCGAG CGCACCGCCA TCAATGCGCC GATGCAGGGG ACCGCCGCCG ATATCATCAA GCTGGCGATG ATCGATGTCG ACCGCTGGTT ACGCGAAGGC GACTTCGACG CCTGGATGGT GATGCAGGTT CACGACGAAC TGGTCTTCGA GGTCAAGGAG GCGCAGGTCG ATGCCTTCAC CGACGCCGTT CGCCAGCGCA TGGAAGGCGC CGCCAAGCTT GACGTGCCGC TGACCGTCGA AGCCAACGCC GGCGACAACT GGGACGAAGC GCATTGA
|
Protein sequence | MDADVMANTP PIVLVDGSSY LYRAFHALPP LTTSKGNPTG AVKGVLNMLK SLIKQYPQSP MAVVFDAKGK TFRDDIYAEY KAHRPPMPDD LRPQVEPLHD CIRALGLPLL CIEGVEADDV IGTLARQATE AGRDAVISTG DKDMAQLVNA HITLVNTMKG ETLDVAGVEE KFGIPPSLVI DFLALMGDKV DNIPGVPGVG EKTALGLLQG MQGGLDTIYA DLERVTTLSF RGAKTMPKKL EANREQAFLS YQLATIKTDC ELPVGLDDLD IAHPDREALK TLYTELEFKN WLNELLEGRD EGVDDVGSGD AVDAAQGPVA STAEATSRTD HVIVTREAFD AWLARLGEAD IFCFDLETTS LNYMEADIVG IGLSLDAGEA AYIPVAHRYL DAPEQLDRAS VLAALKPLWE DPAKAKIGQN LKYDISVLAR YDIEVAGRLE DTMLASYVLN ATATRHDMDS LALKYLGEKT ISFEEIAGKG AKQLTFDQIA LEQAAPYACE DVDITLRLHR ELRPRVDGEG RLAAVLDDIE LPLVPVLSRM ERNGVALDAE RLHAQSRELE KRLRELETRA YELAGREFNL GSPKQLGEIL FDELKIPVIK KTPKGAPSTA EAVLEELALD YPLPKVIIEH RGFAKLKSTY TDKLPQLVNA TTRRLHTSYH QAVTATGRLS SSDPNLQNIP IRTEEGRKIR QAFVARPGYR IVAADYSQIE LRIMAHLSGD KGLLDAFAEG RDIHTATAAE VFGVALDAVS GEQRRSAKAI NFGLIYGMSA WGLGRQLHIE RNQAQTYIDR YFDRYPGVAR FMERIRAQAA DDGYVETVFG RRLYLPEINA QNRTRRQAAE RTAINAPMQG TAADIIKLAM IDVDRWLREG DFDAWMVMQV HDELVFEVKE AQVDAFTDAV RQRMEGAAKL DVPLTVEANA GDNWDEAH
|
| |