Gene Csal_0548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0548 
Symbol 
ID4027687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp606583 
End bp609369 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content65% 
IMG OID637965716 
ProductDNA polymerase I 
Protein accessionYP_572609 
Protein GI92112681 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.965006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCTG ATGTCATGGC CAATACGCCC CCCATCGTTC TCGTCGACGG CTCGTCGTAT 
CTGTATCGGG CTTTTCATGC CTTGCCGCCG CTGACCACGT CCAAGGGGAA CCCCACCGGG
GCGGTGAAGG GCGTGCTCAA CATGCTCAAG AGCCTGATCA AGCAGTATCC ACAAAGTCCC
ATGGCGGTGG TCTTCGATGC CAAGGGCAAG ACCTTCCGCG ACGATATCTA CGCCGAGTAC
AAGGCGCACC GTCCGCCGAT GCCCGACGAT CTGCGCCCGC AGGTCGAGCC CCTGCACGAC
TGCATTCGCG CGCTGGGCCT GCCGCTGTTG TGCATCGAGG GCGTCGAGGC CGACGACGTG
ATCGGCACCC TGGCACGCCA GGCCACCGAG GCGGGGCGCG ACGCGGTGAT TTCCACCGGT
GACAAGGACA TGGCGCAGCT CGTCAATGCC CACATCACGC TGGTCAACAC CATGAAGGGC
GAGACGCTCG ACGTGGCCGG CGTCGAGGAA AAATTCGGCA TTCCGCCGTC GCTGGTCATC
GACTTCCTGG CCCTGATGGG CGACAAGGTC GACAATATCC CCGGCGTGCC CGGCGTCGGC
GAGAAGACGG CGCTCGGCCT GCTGCAAGGC ATGCAGGGCG GGCTGGACAC CATCTACGCC
GACCTCGAGC GTGTCACCAC GCTGTCGTTT CGCGGCGCCA AGACGATGCC CAAGAAGCTC
GAGGCCAACC GTGAGCAGGC CTTCCTGTCG TATCAACTGG CCACCATCAA GACCGACTGC
GAGCTGCCGG TGGGGCTCGA TGACCTGGAT ATCGCGCACC CCGACCGCGA GGCGCTCAAG
ACGCTGTACA CCGAGCTGGA ATTCAAGAAC TGGCTGAACG AGCTGCTCGA GGGGCGCGAC
GAGGGCGTCG ACGATGTCGG TTCGGGCGAT GCGGTGGATG CCGCGCAGGG GCCCGTGGCG
AGTACTGCCG AGGCGACGTC GCGCACGGAT CACGTCATCG TCACCCGCGA GGCCTTCGAT
GCCTGGCTTG CGCGCCTGGG CGAGGCGGAC ATCTTCTGCT TCGACCTGGA AACCACCAGC
CTCAATTACA TGGAGGCCGA TATCGTGGGA ATCGGCCTGT CGCTGGACGC CGGCGAAGCG
GCCTACATCC CGGTGGCGCA CCGCTATCTC GACGCTCCCG AGCAGCTCGA CCGCGCGTCG
GTGCTCGCCG CGCTCAAGCC GCTCTGGGAA GACCCCGCCA AGGCCAAGAT CGGCCAGAAC
CTCAAGTACG ACATTTCCGT CCTGGCACGC TACGACATCG AGGTCGCGGG ACGGCTCGAG
GACACCATGC TGGCATCCTA CGTGCTCAAT GCCACGGCGA CGCGGCACGA CATGGACTCG
CTGGCGCTCA AGTACCTCGG CGAGAAGACC ATTTCCTTCG AGGAGATCGC CGGCAAGGGG
GCCAAGCAGT TGACCTTCGA CCAGATTGCA CTGGAGCAGG CTGCCCCCTA CGCCTGCGAG
GACGTCGACA TCACCTTGCG GCTGCACCGG GAACTGCGCC CGCGCGTGGA TGGCGAGGGC
CGGCTGGCGG CGGTGCTGGA CGACATCGAA CTGCCCCTGG TGCCGGTGCT CTCGCGCATG
GAGCGCAACG GGGTGGCACT GGATGCCGAG CGCCTGCACG CGCAGAGTCG CGAGCTGGAG
AAGCGCCTGC GGGAACTGGA AACCCGCGCC TATGAGCTGG CCGGACGCGA GTTCAATCTC
GGCTCGCCCA AGCAGCTCGG CGAGATTCTC TTCGATGAGC TCAAGATCCC GGTGATCAAG
AAGACGCCCA AGGGCGCGCC CAGCACCGCC GAGGCGGTGC TCGAGGAACT GGCGCTGGAT
TACCCCTTGC CCAAGGTGAT CATCGAGCAT CGCGGCTTCG CCAAGCTGAA GTCGACCTAC
ACCGACAAGC TGCCGCAACT GGTCAATGCC ACCACGCGGC GGCTGCATAC CAGCTATCAC
CAGGCCGTGA CGGCCACCGG GCGCCTGTCG TCGTCCGACC CCAACCTGCA GAACATTCCC
ATCCGTACCG AAGAAGGCCG CAAGATCCGC CAGGCCTTCG TCGCGCGCCC CGGCTACCGC
ATCGTCGCTG CCGACTATTC GCAGATCGAG CTGCGCATCA TGGCGCATCT TTCCGGCGAC
AAGGGACTGC TCGATGCCTT CGCCGAAGGA CGCGACATCC ATACCGCCAC CGCGGCGGAA
GTGTTCGGCG TGGCGCTCGA CGCCGTCAGC GGCGAGCAAC GGCGCAGCGC CAAGGCCATC
AACTTCGGTC TGATCTACGG CATGAGCGCC TGGGGACTGG GGCGCCAGCT GCACATCGAG
CGCAATCAGG CGCAGACCTA CATCGACCGC TACTTCGATC GTTACCCCGG CGTGGCGCGC
TTCATGGAAC GCATTCGTGC CCAGGCCGCC GACGACGGTT ACGTCGAGAC GGTCTTCGGA
CGTCGCCTCT ATCTGCCCGA GATCAATGCC CAGAACCGGA CCCGGCGTCA GGCTGCCGAG
CGCACCGCCA TCAATGCGCC GATGCAGGGG ACCGCCGCCG ATATCATCAA GCTGGCGATG
ATCGATGTCG ACCGCTGGTT ACGCGAAGGC GACTTCGACG CCTGGATGGT GATGCAGGTT
CACGACGAAC TGGTCTTCGA GGTCAAGGAG GCGCAGGTCG ATGCCTTCAC CGACGCCGTT
CGCCAGCGCA TGGAAGGCGC CGCCAAGCTT GACGTGCCGC TGACCGTCGA AGCCAACGCC
GGCGACAACT GGGACGAAGC GCATTGA
 
Protein sequence
MDADVMANTP PIVLVDGSSY LYRAFHALPP LTTSKGNPTG AVKGVLNMLK SLIKQYPQSP 
MAVVFDAKGK TFRDDIYAEY KAHRPPMPDD LRPQVEPLHD CIRALGLPLL CIEGVEADDV
IGTLARQATE AGRDAVISTG DKDMAQLVNA HITLVNTMKG ETLDVAGVEE KFGIPPSLVI
DFLALMGDKV DNIPGVPGVG EKTALGLLQG MQGGLDTIYA DLERVTTLSF RGAKTMPKKL
EANREQAFLS YQLATIKTDC ELPVGLDDLD IAHPDREALK TLYTELEFKN WLNELLEGRD
EGVDDVGSGD AVDAAQGPVA STAEATSRTD HVIVTREAFD AWLARLGEAD IFCFDLETTS
LNYMEADIVG IGLSLDAGEA AYIPVAHRYL DAPEQLDRAS VLAALKPLWE DPAKAKIGQN
LKYDISVLAR YDIEVAGRLE DTMLASYVLN ATATRHDMDS LALKYLGEKT ISFEEIAGKG
AKQLTFDQIA LEQAAPYACE DVDITLRLHR ELRPRVDGEG RLAAVLDDIE LPLVPVLSRM
ERNGVALDAE RLHAQSRELE KRLRELETRA YELAGREFNL GSPKQLGEIL FDELKIPVIK
KTPKGAPSTA EAVLEELALD YPLPKVIIEH RGFAKLKSTY TDKLPQLVNA TTRRLHTSYH
QAVTATGRLS SSDPNLQNIP IRTEEGRKIR QAFVARPGYR IVAADYSQIE LRIMAHLSGD
KGLLDAFAEG RDIHTATAAE VFGVALDAVS GEQRRSAKAI NFGLIYGMSA WGLGRQLHIE
RNQAQTYIDR YFDRYPGVAR FMERIRAQAA DDGYVETVFG RRLYLPEINA QNRTRRQAAE
RTAINAPMQG TAADIIKLAM IDVDRWLREG DFDAWMVMQV HDELVFEVKE AQVDAFTDAV
RQRMEGAAKL DVPLTVEANA GDNWDEAH