Gene TM1040_2764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2764 
Symbol 
ID4077636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2910652 
End bp2913459 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content61% 
IMG OID638008089 
ProductDNA polymerase I 
Protein accessionYP_614758 
Protein GI99082604 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTA GCAATTTTGG CAAAGGCTGT CACCTCCACC TGATCGATGG CTCCGCCTTC 
ATCTTTCGCG CCTACCACGC GCTGCCGCCG CTGACGCGCA AATCCGACGG GCTGCCCATC
GGCGCGGTCT CGGGGTTCTG CAACATGCTG TTCAAACAGG TCGAGGACAA CAAGGGCCCC
GATGCGCCGA CCCATGTGGC GGTGATCTTT GACCACTCCG GCAAATCGTT TCGCAACGAT
ATGTACGATC AGTACAAGGC CAATCGCCCC CCCGCGCCGG AGGATCTGGT GCCGCAGTTC
CCGCTGACGC GCGAGGCCAC CCGCGCCTTC AACATCGCCT GCAAGGAGAT CGAAGGCTTT
GAGGCCGATG ACATCATCGC TACATTGGCC TGTCAGGCGC GCGAGGCCGG GGGCCGCGTG
ACGATCATCT CGTCAGATAA GGACCTGATG CAGCTGGTCG GGGACGGGGT CGAGATGCTC
GACGCGATGA AGAACAAGCG CATCGACAGC GATGGCGTGC GCGAGAAATT CGGCGTCGGC
CCCGACCGGG TTGTGGATGT GCAGGCGCTT GCGGGCGACT CTGTTGACAA CGTGCCCGGC
GCGCCCGGCA TCGGCATCAA GACCGCCGCG CTCCTGATCA ACGAGTTCGG CTCGCTGGAG
GATCTGCTGG ATCGCGCCGA AGAGATCAAG CAGCCCAAGC GTCGTCAAAC CCTGATCGAG
AAGCGCGACC AGATCGAGAT GTCGAAACGG CTGGTGCAGC TGGATTGCGA CATGGAGCTG
GACTTCACGC TCGACGATCT GGAGGTGCGC GATCCGGATG CAGATACGCT CTTGGGCTTC
CTCGCCGAGA TGGAATTCCG CACCCTGTCC AAGCGAATGG CCGACCAGCT GGGCCGCGAG
GCCCCGACCA TCCCCGAAGC GCCCTCCGCC GCAGCCGCCG CGCTGGAGCT GCCCGAAGCG
CCGGGCTTTG ACAGTGCCGA ATATACCACC GTGCGCGACG CCGAGACCCT TCAGCAGTGG
ATCGATCTCA TTCGTGAGCA TGGCTATGTC GCGGTCGATA CCGAGACCAC CGGCCTTAAC
GAGATGATCG CGGATCTCGT TGGCATCAGC CTATGTGTGG TTCCGGGGCA GGCCTGCTAT
GTGCCGCTCA CACATAAGAC GGGGAACTCC GACGATCTCT TTGGCTCTGA CGATCTGGCC
GAGGGGCAGA TGCCGCTCAA GGACGCGCTG GAGATGCTGA AACCGGTGCT GGAGGATGAC
GCCATCCTCA AGATCGGCCA GAACATGAAA TACGATGCCA AGATCTTTGC CCGTAACGGC
ATCGACGTGA CCCCGATCGA CGACACCATG TTGTTGTCCT ATGCGCTTCA TGGCGGCATG
CACGGACATG GGATGGATAC GCTGTCGGAG CGCTATCTCG ACCATCAGCC GATCCCGATC
AAATCGCTCT TGGGGAGCGG AAAATCCGCG ATCACATTTG ACCGGGTGTC GATAGAGGAC
GCCACGCCCT ATGCGGCCGA GGACGCCGAC ATCACCCTGC GCCTCTGGCA GCAGTTCAAA
CCGCAGTTGC ACCAGAAACA GGTGACCACC GTCTACGAGA CCTTGGAACG CCCGCTGGTG
CCGGTGCTGG CGCAGATGGA ACAACATGGC ATCAAGGTGG ACCGCGACAC GCTCAGCCGT
ATGTCGAACG CGTTCTCACA AAAAATGGCC GCGCTCGAGG CAGAGATCCA CGAGCTCGCG
GGCGAGACCT TCAATGTCGG CTCCCCCAAG CAGCTGGGCG AGATCCTTTT TGACAAGATG
TCTCTGCCGG GGGGCAAGAA GGGCAAGACC GGTGCCTATG CCACCGGGGC GGACATTCTT
GAGGATCTGG CCACGGAGCA TACCTTGCCC GCGCGCGTGT TGGACTGGCG GCAACTCTCC
AAGCTGAAAT CCACGTACAC AGACGCGCTT CAGGAGCACA TCCATCCCGA AACCGGGCGG
GTACATACGT CTTATTTGCA GACCGGCGCC AACACTGGGC GTCTGGCCTC GAGCGATCCC
AACCTGCAAA ACATCCCCGT GCGCAGCGAA GAGGGACGCC GCATCCGCGA GGCCTTTGTC
GCAGACGAGG GCAATGTGCT TCTGTCGCTC GACTACAGCC AGATCGAGCT GCGCATCCTG
GCCCATGTTG CGGGAATCGA TGCGCTAAAA CAGGCGTTTG CCGACGGCCA CGACATCCAC
GCGATGACCG CCTCCGAGGT CTTTGATGTG CCACTTGAGG AGATGACTCC GGACATTCGC
CGCAAGGCCA AGGCGATCAA CTTTGGCGTG ATCTATGGGA TTTCGGGCTT TGGGCTTGCA
CGCAACCTGC GTATTCCGCG CGGCGAGGCC CAGGGATTTA TCGACCGCTA TTTCGAGCGT
TTCCCCGGCA TTCGTCAGTA TATGGATGAC ACGGTGAACT TCGCCAAGGA GCACGGTTAT
GTGCAAACGC TCTTTGGCCG TAAGATCCAC ACGCCGGAGA TCGCAGCCAA GGGACCGCGT
GCGAGCTTTG CAAAACGCGC GGCTATCAAC GCGCCCATTC AGGGCACGGC CGCAGATGTC
ATCCGGCGCG CCATGGTGCG TATGCCAGAG GCCATCGCCC ACCTGCCCGC GCGCATGCTG
CTGCAGGTCC ACGATGAATT GCTGTTCGAA GTGCCCGAGG ATCACGTCGA AGAGACGATT
TCCGTCGCCC GCGAGATCAT GGAAGGCGCG GCTGATCCGG CAGTGCATAT GGATGTAAAA
CTGGTGGTCG ACGCGGGACG CGGCCAAAAC TGGGCCGAGG CGCATTAA
 
Protein sequence
MSTSNFGKGC HLHLIDGSAF IFRAYHALPP LTRKSDGLPI GAVSGFCNML FKQVEDNKGP 
DAPTHVAVIF DHSGKSFRND MYDQYKANRP PAPEDLVPQF PLTREATRAF NIACKEIEGF
EADDIIATLA CQAREAGGRV TIISSDKDLM QLVGDGVEML DAMKNKRIDS DGVREKFGVG
PDRVVDVQAL AGDSVDNVPG APGIGIKTAA LLINEFGSLE DLLDRAEEIK QPKRRQTLIE
KRDQIEMSKR LVQLDCDMEL DFTLDDLEVR DPDADTLLGF LAEMEFRTLS KRMADQLGRE
APTIPEAPSA AAAALELPEA PGFDSAEYTT VRDAETLQQW IDLIREHGYV AVDTETTGLN
EMIADLVGIS LCVVPGQACY VPLTHKTGNS DDLFGSDDLA EGQMPLKDAL EMLKPVLEDD
AILKIGQNMK YDAKIFARNG IDVTPIDDTM LLSYALHGGM HGHGMDTLSE RYLDHQPIPI
KSLLGSGKSA ITFDRVSIED ATPYAAEDAD ITLRLWQQFK PQLHQKQVTT VYETLERPLV
PVLAQMEQHG IKVDRDTLSR MSNAFSQKMA ALEAEIHELA GETFNVGSPK QLGEILFDKM
SLPGGKKGKT GAYATGADIL EDLATEHTLP ARVLDWRQLS KLKSTYTDAL QEHIHPETGR
VHTSYLQTGA NTGRLASSDP NLQNIPVRSE EGRRIREAFV ADEGNVLLSL DYSQIELRIL
AHVAGIDALK QAFADGHDIH AMTASEVFDV PLEEMTPDIR RKAKAINFGV IYGISGFGLA
RNLRIPRGEA QGFIDRYFER FPGIRQYMDD TVNFAKEHGY VQTLFGRKIH TPEIAAKGPR
ASFAKRAAIN APIQGTAADV IRRAMVRMPE AIAHLPARML LQVHDELLFE VPEDHVEETI
SVAREIMEGA ADPAVHMDVK LVVDAGRGQN WAEAH