Gene TM1040_0175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0175 
Symbol 
ID4078783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp192594 
End bp195974 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content63% 
IMG OID638005469 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_612170 
Protein GI99080016 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCCA ACCGAACCCA CCGCCCGGTC GAGACCTTCA GCACCGCGCG CCTGAGCCCC 
CCGCACGCGG GGCCCGGTTA TGCCGAGCTC TGCGTGACCA GCAACTTCAC CTTTCTGACC
GGGGCTTCTC ATCCCGAGGA ACTGGTGACA CGCGCCGCGG AACTCGGGCT TGCCGCTATC
GCCATAACGG ATCGCAACTC ATTGGCCGGG GTGGTGCGCG CCTGGAGCGC CCTGAAAGAG
CTGCGGCGTG AGGCAGAGGC GGGTATCCAG ATCCGCTCGC ACCAGCGCGT TGATCCCTCT
TCGCGCCAGA AGATCGACAC CTCCGCGCCG CTTGAGCCGC CTGCGGCCCC GACCCTGCCC
AAGCTGATTG TCGGCTGCCG TCTGGTGCTG CGCGACAGCC AGACCGACTG GATTGCCCTG
CCTCGCGATC GGGCTGCTTA CGCGCGCCTG ACCCGCCTTT TGACGCTTGG CAAACGACGC
GCGCCCAAGG GCGAATGTCA TCTGGACGCC AGAGATCTGA TGGCGGCCTG CCGGGGCATG
AGTCTGATTG CGCTGCCACA ATCGGATCTG AAAACCGCCA TTCCCGAAAT CCGCCGGATG
CAGCAATGTT TTCCGCATTA TGTGTTTCTA GGCGCAGCGC CCCGGTACGA TGGCAGCGAT
CAGGCCTATC TGGCGGCCTG CGCTCATGCC GCGCAGATCA CAAGCGCACC AATGGTGGCG
GTCGGGGATG TGCTGATGCA CCGCGCCAGC CGCAGACCGC TGGCGGATGT GCTGACCTGC
ATGCGCGAAC ATATCACCAT TGATGAAATC GGCAGCCGCG CCCTGCCGAA CTCTGAACGC
CGCCTCAAGG CGGGTGCAGA CATGGCCCGG CTCTTTCAGG CCCACCCCGC CGCCCTACGC
CGCACGCTGG AGATCGCGGA CAAATGCAGC TTCGATCTCG GTGAGCTGTC GTATGACTAC
CCGCATGAGA TCGTCGATGG TGAGACACCG CAGGCGCGGC TTGAGCGGCT GGCGGCAGAG
GGGCTCAGGA GGCGCTATCC CGACGGCCCC ACGCAAAAAG CCGTCGTCCT CATGGACAAG
GAACTGAAAC TCGTCGCGGA ACTGGGCTTT GCCGCCTATT TCCTGACCGT GCATGACATC
GTGCAATACG CCAAATCGCA GAACATCCTC TGCCAGGGGC GTGGCTCGGC AGCGAATTCG
ATCCTGTGTT ACCTGCTTGG CATCACCGAC GTCAGCCCCG ACATGATCAC CATGGTGTTT
GAACGCTTTG TCTCGCGATA CCGGGGCGAG CCGCCGGACA TCGACGTGGA TTTCGAGCAT
GAACGCCGCG AGGAGGTGAT CCAATGGATC TACCAGAAGT ACGGCCGCGA CCGGGCGGGG
CTTTGTGCCA CCGTGATCCA TTTCCGCTCG CGCGCCGCCA TCCGCGAGGT GGGCAAGGTC
ATGGGTCTCT CTCAGGACGT CACCGCAGGC CTGTCGGGCC AGATCTGGGG CATGAGCAAC
GGCGGCGTTG ATCTCAAACG CATCGAAGAG CTTGGCCTCG ACATTCAGGA CCGCCGCCTG
ATGCAGACCA TCCGGTTGAT CGGAGAGATC ATCGGCTTTC CCCGGCATCT GTCCCAGCAC
GTCGGCGGTT TTGTCATCAC CAAGGGCCGC CTGGATGAGC TGGCCCCGAT TGAAAACGCC
GCGATGGAAG ACCGCACCGT GATCGAGTGG GACAAGGACG ACATCGACGC GCTTGGCATC
CTGAAAGTCG ACGTGCTGTC GCTGGGCATG CTCACCTGCC TGCAAAAATC CTTTGCCCTG
CTCAAGGCGC ATGAGGGAGA GACCCTTTCT ATCGGCACCG TTCCTCAGGA GGATGCCAAG
ACCTATGCCA TGCTATGCCG CGCGGATGCG GTCGGGGTCT TTCAGGTGGA GAGCCGGGCA
CAGATGAACT TTCTGCCCCG CATGCAGCCG CGCGAGTTTT ATGATCTGGT GATCGAGGTG
GCGATCGTGC GCCCCGGCCC CATTCAGGGC GATATGGTCC AGCCCTATAT CCGCCGCCGC
AATGGCCTCG AGGAGCCCGA ACCCTTTGGC CCGGAGCTTG AACAGGTCAC CAAACGCACC
TTGGGCGTGC CTCTGTTTCA GGAACAGGCC ATGCAGATTG CCGTGGTCGG TGCGGGCTTC
ACCCCCGAAG AAGCCGACCA CCTGCGTCGT TCGCTGGCGT CCTTTCGGCG CATGGGCACC
ATTGGCACCT TTCGCGACAA GTTCATCAAC GGCATGCTCG ACAAAGGCTA CAGCCAGGAG
GTCGCCGCGC GCTGCTTTTC TCAGATCGAG GGCTTTGCCG ATTATGGCTT TCCCGAAAGT
CACGCAGCCG CCTTTGCCAT GCTGGCCTAT GTTTCGGCCT GGCTCAAATG CCACCATCCG
GCGGTCTTTG CCTGCGCGCT TTTGAACTCA CAGCCGATGG GGTTCTATGC CCCGGCCCAG
ATCGTACGCG ATGCGCGCGA ACACGGCGTC GAGATCCGGC CGATCTGCGT CAATGCGTCG
GATTGGGATA ACCAGCTGGA GCGTCGCCCG GATGGGGCGC TGGCCCTGCG ACTGGGGTTT
CGCCAAATCA AAGGGTTTCG CGAGGAGGAC GCAGGCTGGA TCACCGCCGC GCGCGGCAAT
GGCTATCCCG ACCCAGAATC GCTGTGGCTG CGCGCTGGCC TGCGCCCGGA TGTCCTGACC
CGGCTCGCCG AGGCGGATGC GTTTTCCGAC ATGGGGCTTA CACGCCGCGA CGCCCTGTGG
CAGGTCAAGG CAATCCGCAG CGCCAAGCCG TTGCCTCTGT TTGACGACCC GATTGATGGC
GAGAGCATTT CAGAGCCTGC CGTAGACCTG CCCGTCATGC ATCTGGGCGA AGAGGTCGTG
GAGGATTACA TCTCCACCCG GCTGACACTG CGCGCACATC CGATGGAATT GCTGCGCCCC
GCGCTGCCGG GCCTCACCCC GCATGCCGCG CTTCTGGACG CGCCATTGGG GCGGCACACG
GTCTGCGGGC TGGTGATCAC ACGCCAGCGC CCCGGCACGG CATCGGGTGT CATCTTTCTG
ACGCTGGAGG ATGAAACCGG GGTCAGCAAT GTGGTCGTCT GGCCCAAGAT TTATGAACAA
TATCGCCGCA TCGTCATGGG AGGGCGGCTG CTGCAGGTGC GCGGATATCT GCAACGCGAA
GGGATCGTCG TGCATCTGAT CGCCCAGGAG ATCACCGATA TGTCGCACCG GCTCTCCGAC
CTTGGGCACC CGCTGGACGA GGCCGTGGGG ATCACCCAAC CGCAGGCCGA TGACGCCCCC
CGCCCCAGAC AGGCCACCAC AAGCGCGCGC CATCCACGCG AACAGGCCAA GCGCCTGTTT
CCAAGCCGGG ATTTTCACTA G
 
Protein sequence
MEPNRTHRPV ETFSTARLSP PHAGPGYAEL CVTSNFTFLT GASHPEELVT RAAELGLAAI 
AITDRNSLAG VVRAWSALKE LRREAEAGIQ IRSHQRVDPS SRQKIDTSAP LEPPAAPTLP
KLIVGCRLVL RDSQTDWIAL PRDRAAYARL TRLLTLGKRR APKGECHLDA RDLMAACRGM
SLIALPQSDL KTAIPEIRRM QQCFPHYVFL GAAPRYDGSD QAYLAACAHA AQITSAPMVA
VGDVLMHRAS RRPLADVLTC MREHITIDEI GSRALPNSER RLKAGADMAR LFQAHPAALR
RTLEIADKCS FDLGELSYDY PHEIVDGETP QARLERLAAE GLRRRYPDGP TQKAVVLMDK
ELKLVAELGF AAYFLTVHDI VQYAKSQNIL CQGRGSAANS ILCYLLGITD VSPDMITMVF
ERFVSRYRGE PPDIDVDFEH ERREEVIQWI YQKYGRDRAG LCATVIHFRS RAAIREVGKV
MGLSQDVTAG LSGQIWGMSN GGVDLKRIEE LGLDIQDRRL MQTIRLIGEI IGFPRHLSQH
VGGFVITKGR LDELAPIENA AMEDRTVIEW DKDDIDALGI LKVDVLSLGM LTCLQKSFAL
LKAHEGETLS IGTVPQEDAK TYAMLCRADA VGVFQVESRA QMNFLPRMQP REFYDLVIEV
AIVRPGPIQG DMVQPYIRRR NGLEEPEPFG PELEQVTKRT LGVPLFQEQA MQIAVVGAGF
TPEEADHLRR SLASFRRMGT IGTFRDKFIN GMLDKGYSQE VAARCFSQIE GFADYGFPES
HAAAFAMLAY VSAWLKCHHP AVFACALLNS QPMGFYAPAQ IVRDAREHGV EIRPICVNAS
DWDNQLERRP DGALALRLGF RQIKGFREED AGWITAARGN GYPDPESLWL RAGLRPDVLT
RLAEADAFSD MGLTRRDALW QVKAIRSAKP LPLFDDPIDG ESISEPAVDL PVMHLGEEVV
EDYISTRLTL RAHPMELLRP ALPGLTPHAA LLDAPLGRHT VCGLVITRQR PGTASGVIFL
TLEDETGVSN VVVWPKIYEQ YRRIVMGGRL LQVRGYLQRE GIVVHLIAQE ITDMSHRLSD
LGHPLDEAVG ITQPQADDAP RPRQATTSAR HPREQAKRLF PSRDFH