Gene Clim_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0473 
Symbol 
ID6354468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp535392 
End bp538223 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content52% 
IMG OID642668104 
ProductDNA polymerase I 
Protein accessionYP_001942545 
Protein GI189346016 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACG AACGGCAATT CGACTTTTTC AGCCCGCAGC AGACTCCGGC AGAGCCGGAC 
GACTCCGCTG CCCCACAACA AAAGCCCGGG CTCTTTCTTA TCGACGGCAT GGCCCTGGTT
TACCGTTCCT ACTATGCCCT GCAACAGGCC GGAATGAAAA CCCGCGACGG AATACCTACA
GGAGCCATTC ACGGATTTGC CTCTGCGCTC CTGAAAATTT TCGAAGTGTG GCATCCTGAA
TACCTTGCCG TCACATTCGA CAGCAGGGAG AAAACCTTTC GGCACAACCT CTACGAACCC
TACAAGGCCA ACCGTCCGGC CCCTCCTGAA GATCTTGTCC GGCAGATCGA GGCTATACTT
GAACTGATCG ACGCGTTCTC CATTCCGCTC ATCAAACTAC CCGGTTACGA AGCTGACGAT
CTTATCGGAT CTGCCGTCAA ACGATTCGAG AAACAGTGCT CTATCTACAT CGTTACTCCG
GACAAGGATC TTGCGCAGCT GGTTCACGAG GGAGTAATCA TGCTGAAGCC CTCAAAAAAA
CAGAACGAAC TCGAACCTTT CGGACCAGCG GAAATTATGG CACAGTTCGG CGTTGAACCG
GAACGCTTTA TCGATCTTCT GACCCTTACC GGCGATACTT CCGACAATAT CCCCGGAGCA
AAAGGCATTG GCCCGAAAAC CGCAGCCGCC CTGCTTCTTA AATACGGCAC ACTCGACAAC
ATATACCGCA ACATCGGCGA ACTGACCCCG AAAACACGAG CAAGCCTTGA AGCCTTTCAA
CCTCAGCTCG ATCTGATACG GCAGCTCGTT ACCATCCACT CCGATATCGA TCTCGGGCTG
AATCTCGAAA CGCTTGCATG CAAAGCTCCT GTGCAAGGCA GAATCATGCC CTTCCTTGGA
AAGTACGAGC TGAAAGCTGT CGCCGTTCGC CTGCCGTCCG TTTTTCCCGG TATGCTTTCA
AACGGAACCC CTGCAGAAGA AGAGCGTGAC GAACAAGAAA ACAACAATGC ACCGTCTCTC
GACTCGCCAT TAACCGGTGC GGACTATGCC ATCATCGATA CCCCAGAAGG GGTTGAAAAG
CTTGTCCGGC AACTCGAGCA GGCCGACTCG TTTGCCATTG ACACCGAAAC GACAAGCCTC
GACACCTTTC AGGCGGAACT GGCAGGTATC TCTTTTTCGC TCAAACCCAA AGAAGCCCGA
TTTATCTATT TCGGAAAAGG CGGCCTCGAC ATAGGCAACA CTCTCGAACG ACTCAAGCCT
GTTCTTGAAA ATCCCGATAT CCGCAAAACC GGTCAGAACC TCAAGTATGA TCTGCTCGTT
CTGAAAAATT ACGGCATATC GCTTACACCG GTTGCATTCG ACACCATGCT GGCAAGCTAC
GTGCTCGATC CGGAGGAAAA ACACAACCTC GACGATCTCG CAGCCCGCCA TCTCTCCATC
AGAACGACAA CCTTTGACGA ACTGACCGCC GGTGAAGGGA AAAAACGAAC CCATATTCTC
GACGTTCCGT CAGCTCAGCT TTCCGACTAT GGCTGTCAGG ATGCCGATAT CGCTCTCCGG
CTTCAGGAGG TATTCGAAGC CAAGCTTCTC GAAGACCAGA GACTGCTGCA CCTCTGCAGA
ACCATAGAAT TTCCCCTGGT GGAAGTGCTG GCCGGCATGG AATATCAGGG CATTTCCATC
GATACGGCGC ATCTTGAAAA AACAGCGGAT CTCGTCAATG CGCAGATAGG CGATCTCTCC
CGAAAAATCC ATGCGGCAGC AGGCACGCCC TTCAATCTCG ACTCGCCCAA ACAACTTTCC
GGCATTCTCT TCAACACGCT GGGACTGCCA ACGAAAAAAA CCACGAAAAC CGGTTTTTCG
ACCAATGTCG AGGTACTTGA AGAGCTTGCC CCGCTGCACC CGATAATCGG CGACCTGCTC
GAATACCGGA GTCTCCAGAA ACTCAAGACC ACCTATATCG ACGCCCTGCC GAAAATGATC
AATCCCGGAA CGGGAAGACT GCACACCTCG TTCAACCAGC ATGTCACCGC AACCGGCAGG
CTCTCGTCAT CCAATCCGAA CCTGCAGAAC ATTCCGGTAC GGACTACTGC CGGCAAAGAG
ATCAGAAGGG CATTCATACC CTCGAATCCC GATAATCTGA TACTCTCCGC CGATTATTCC
CAGATCGAGC TCCGGATTGC CGCCGAAATT TCAGGCGACG AAAAACTCAT TGAAGCATTC
CGCAACCGTG AAGATATCCA TCGAGCCACG GCAAAAGTCA TCTTCAGTAC CGACGAAATC
ACTTCCGATA TGCGCCGCAA GGCAAAAGAG GTCAACTTCG GCGTACTGTA CGGCATTCAG
CCATACGGTT TATCGAAACG GCTGAACATT CCCCAATCGG AAGCAAAAGC AATTATCGAA
ACCTATACCA CGAAATATCC CGGTCTGTTC CACGCATTGC TGGAGATCAT CGAAGAAGGA
AAACGAAAAG GATACGTAAC AACCCTCCTC GGAAGAAGAC GCTATATTCC CGATCTGAAC
AGCCGTAACG GAAACATTCA GAAAGCTGCC GGCCGGGCGG CCATGAACAC TCCTATTCAG
GGCACGGCCG CCGACATCAT CAAATGCGCG ATGAATCTCT GCGACCGGCA GATAAAAAAG
AACAACATGA ACTCGGTTAT GCTGCTTCAG GTTCACGACG AACTGCTCTT TGAAACAACT
GCCGAAGAAA AAAACCGGCT TTCGAAACTG GTTGAAACCG CTATGATCGA AGCCGCAGTC
TATTGCGGCC TTAAAAAAGT ACCTGTCGAA GTAGATACCG GCACAGGAAA AAACTGGCTG
GAGGCTCATT AG
 
Protein sequence
MTNERQFDFF SPQQTPAEPD DSAAPQQKPG LFLIDGMALV YRSYYALQQA GMKTRDGIPT 
GAIHGFASAL LKIFEVWHPE YLAVTFDSRE KTFRHNLYEP YKANRPAPPE DLVRQIEAIL
ELIDAFSIPL IKLPGYEADD LIGSAVKRFE KQCSIYIVTP DKDLAQLVHE GVIMLKPSKK
QNELEPFGPA EIMAQFGVEP ERFIDLLTLT GDTSDNIPGA KGIGPKTAAA LLLKYGTLDN
IYRNIGELTP KTRASLEAFQ PQLDLIRQLV TIHSDIDLGL NLETLACKAP VQGRIMPFLG
KYELKAVAVR LPSVFPGMLS NGTPAEEERD EQENNNAPSL DSPLTGADYA IIDTPEGVEK
LVRQLEQADS FAIDTETTSL DTFQAELAGI SFSLKPKEAR FIYFGKGGLD IGNTLERLKP
VLENPDIRKT GQNLKYDLLV LKNYGISLTP VAFDTMLASY VLDPEEKHNL DDLAARHLSI
RTTTFDELTA GEGKKRTHIL DVPSAQLSDY GCQDADIALR LQEVFEAKLL EDQRLLHLCR
TIEFPLVEVL AGMEYQGISI DTAHLEKTAD LVNAQIGDLS RKIHAAAGTP FNLDSPKQLS
GILFNTLGLP TKKTTKTGFS TNVEVLEELA PLHPIIGDLL EYRSLQKLKT TYIDALPKMI
NPGTGRLHTS FNQHVTATGR LSSSNPNLQN IPVRTTAGKE IRRAFIPSNP DNLILSADYS
QIELRIAAEI SGDEKLIEAF RNREDIHRAT AKVIFSTDEI TSDMRRKAKE VNFGVLYGIQ
PYGLSKRLNI PQSEAKAIIE TYTTKYPGLF HALLEIIEEG KRKGYVTTLL GRRRYIPDLN
SRNGNIQKAA GRAAMNTPIQ GTAADIIKCA MNLCDRQIKK NNMNSVMLLQ VHDELLFETT
AEEKNRLSKL VETAMIEAAV YCGLKKVPVE VDTGTGKNWL EAH