Gene Caci_5240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5240 
Symbol 
ID8336594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6032607 
End bp6035990 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content72% 
IMG OID644958338 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_003115940 
Protein GI256394376 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0104039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCT TCAACCCGGT CATCCCCTGG CGCGAGCTGG AGAAGCGCCT CACCTGGGGA 
CGTTACGGCG AGGCCGTGCC TGCTGCCCCC GGACCGGAGG CGGCAGCTGC GGCGCCGCCG
GACCCGGAGT CCCGCCGCAG CGCCAAACTC GATCCGCCCT ACGCCGAACT CCACGCGCAC
TCCCACTTCT CCTTCCTCGA CGGCGCCTCC AGCCCCGCCG AGCTGGTCGC CGAGGCCGCC
CGGCTGGGTC TGCACGGCCT GGCCGTCACC GACCACGACG GCTTCGCCGG CGCCATGCAG
TACAAGCAGG CCGCCGAGGA CGCCGAGCTG GCCACGGTCT TCGGCGCCGA GCTGTCGCTG
GAGCTGACCG CGGCTCGCAC CGGCGTCACC GATCCGGCAG GCAGCCATCT GCTGGTGCTG
GCCCGCCGCG CCGAGGGCTA CCGGCGGCTG TCGGCGGCGA TCGGCAAGGC GCAGCTGGCC
GGGCAGGAGA AGGGCAAGCC CGTCTATGTC CTGGACGATC TGGCCGAGGC CGCCGCCGAC
GGCGACTGGA CCATCCTGAC CGGCTGCCGC AAGGGCACCG TGCGCCAGGC GCTGGAGCGC
CACGGCCTGA CCGAGGCCGG TTTCGAGGCC GCCGACGCCG AATTGCGCAA GCTGATGGAC
CTGTTCGGCG ATCGCAATGT CCTGATCGAG CTCACCGACC ACGACCAGCC GCTGGACGAC
GCCCGCAACC GTCTCCTGGC CCGCCTGGCC GCCCGCCACG GCCTGCCGAC CGTGGCCACC
GGCAACGTCC ACTACGCCCG CCCGGCCGAC TACCGCCTCC ACACCGCCAT GGCCGCGCTG
CGGGCCCGCC GCACCCTGGC CGAGATGGAC GGCTGGCTGC CCGGCGCGCC CACCGCGTAT
CTGCGCAGCG GCGCCGAGAT GCTCGCGCGC TTCCCCGTGC ACGAGTTCGG CCGCGCCGTG
GTCCGCAGCG CCGAGCTGGC CGCCGACCAC GCCTTCGACT TCAAGACCAT CAACCCGCAG
CTGCCCGACT ACCCGGTGCC CGAAGGACAC ACCGAGGCCA GCTGGCTCAG ACATCTGACA
TACCTCGGCG CCGCGCAGCG CTACGGCTCC GAAAGCGAGA ATCCCAAGGC CTACCACCAG
ATCGCCTACG AACTCGACGT CATCGAGCAG CTGAAGTTCC CCGGCTACTT CCTCATCGTC
TACGACATCG CAGAGTTCTG CCGGGCCCAG GGCATCCTGG CCCAGGGCCG GGGCTCGGCC
GCCAACAGTG CGGTCTGCTT CGCCCTGGGC ATCACCAGCG TCGACTCGGT CCGGCACGGC
CTGGTCTTCG AGCGCTTCCT GTCCCCGGTG CGCGACGGCC CACCGGACAT CGACGTCGAC
ATCGAGGCCG GGCGCCGCGA GGACGTGATC CAGCACGTCT ACGCCAAGTA CGGACGCGAC
CGCGCGGCCA TGGTCTGCAA CGTGATCACC TACCGCGCCC GGCTGGCGCT GCGCGACTCG
GCGCGCGTTC TGGGCTACTC CCCGGGCACC GTCGACGCCT GGGCCGGGAC CATCGGGCCG
CACGAAGGCG TCCCGGACGG CTCGGAGGAG ATCCCCGAAC AGGTGCTGGA GCTGGCCGGA
CGGCTGCAGC GCCGGCCGCG GCACCTGGGC ATCCACAACG GCGGCATGGT GATCGTGGAC
CGGCCGCTGG CGCAGGTGGT GCCGATCGAG TGGGCGCGCC GCGAGGGCCG CACGGTCCTG
CAGTGGGACA AGGACGACTG CGCGGCCGCC GGGCTGGTGA AGTTCGACCT GCTGGGCCTG
GGCATGCTCG GCGCGATCCA CGACGCGCTG GACCTGATCG CCGAGCACCA CGGCACCCGC
CTGGGCCTGC ACGACCTGCC GCAGGACGGG GACCCCGAGG CGGACGCGGT CTACGCGATG
ATCCAGGACG CCGACACCGT CGGGGTGTTC CAGATCGAGT CGCGGGCCCA GATGGTCACG
CTGCCGCGGC TGAAGCCGAA GTGCTTCTAC GACCTCGTGG TCGAGGTGGC GCTGATCCGC
CCCGGGCCGA TCCAGGGCGG CGCGGTGCAT CCGTATCTGC GGCGGCGCAA CGGCGACGAG
GAGGTCACCT ACCCGCACCC CACGCTGGAG CCGGTGCTGG AGCGCACGCT GGGCGTCGTG
CTGTTCCAGG AGCAGGCGAT GCAGATGGCG ATCGCGGCGG CCGGGTTCAG CGCCGCCGAG
GCCGACCGGC TGCGCCAGGC CATGGGTTCC AAGCGGTCGC CGGAGCGGAT GGCGGAGCTC
AAGGAGCGGC TGATGGCCGG GATGGCCGAG CGCGGCATCA CCCCGGAGAC GGGTGAGGAC
ATATACGCCA AGCTGCACGC CTTCTCAGGG TACGGATTCC CGGAGTCGCA CTCGGTGTCG
ATGGCGTACA TCGTGTGGTG CAGCTGCTAT TTGAAGCGTT ACTACCCGGC GGCGTTCACC
GCCGCGCTGC TGAACAACCA GCCGATGGGC TTCTACAGCC CCGGATCGCT GGTCACCGAC
GTGCGGCGGC ACGGGGTGCA GGTGAAGCGG GTGGACGTGA ACGCCTCGGG GGCGAAGGCG
ACGTTGCAGA GCCCTGACAA GCCCTACGGC TCTGGCTCCG GCTCCGGCTC TGGTTCTGGC
TCCGACCGGA CGCGGCACCG GCACGCCTCG CCGATCGCGC AGCCGGCCGT CCGGCTGGGC
CTGAGCAGCG TGCGCGACCT CGGCGACGAC GCGGCCGAGC AAGTCGTCGC CGAGCGGACG
GCGGGCGGTC CGTTCACCGG CGTCGAGGAC TTCATCGTGC GCACCGGGTT GTCGCGCTCG
ACGCTGGAGG CGCTGGCCAC CGCCGGGGCG TTCGGGTGCT TCGGACTGGA CCGCCGCGAG
GCGCTGTGGA CGGCCGGCGC GCTGGCCGGC ACCACCGCCG GACACCTGCC GGGCACCGCG
CCGGGCACGA CCGTCCCGCA GCTGGACCCG CTCACGCCGG TGGAGGTCAC GCTGGCCGAC
CTGTGGGCCA CCGGCACCAG CCCGGAGGAC CACCCGATCG GCCACCTGAG GTGGCGGCTC
GCGATGCGCG GCGTGACCCC CGCCGTCGAA CTGCGCACCG CGCGCAACCG CAGCCTGGTA
CGGGTCGCCG GACTGGTCAC ACACCGGCAG CGGCCGCCGA CGGCGCACGG AACGTGCTTT
CTGAGCATGG AGGACGAGAC CGGGCTGATC AACGTGATCT GCCCGGCGCC GGTGTGGGAG
GCGCAGCGGA AGGTGGCGCT GCGGTACGGG GCGCTGCTGA TCCACGGGAC GCTGGAGCGG
ACGGACGGGG CGGTGAACGT GGTGGCGGGG CGGATCGCCC GGCTCGACGT GGTGGTTCCC
GATCGCAGTC GGAACTTTCG GTGA
 
Protein sequence
MSTFNPVIPW RELEKRLTWG RYGEAVPAAP GPEAAAAAPP DPESRRSAKL DPPYAELHAH 
SHFSFLDGAS SPAELVAEAA RLGLHGLAVT DHDGFAGAMQ YKQAAEDAEL ATVFGAELSL
ELTAARTGVT DPAGSHLLVL ARRAEGYRRL SAAIGKAQLA GQEKGKPVYV LDDLAEAAAD
GDWTILTGCR KGTVRQALER HGLTEAGFEA ADAELRKLMD LFGDRNVLIE LTDHDQPLDD
ARNRLLARLA ARHGLPTVAT GNVHYARPAD YRLHTAMAAL RARRTLAEMD GWLPGAPTAY
LRSGAEMLAR FPVHEFGRAV VRSAELAADH AFDFKTINPQ LPDYPVPEGH TEASWLRHLT
YLGAAQRYGS ESENPKAYHQ IAYELDVIEQ LKFPGYFLIV YDIAEFCRAQ GILAQGRGSA
ANSAVCFALG ITSVDSVRHG LVFERFLSPV RDGPPDIDVD IEAGRREDVI QHVYAKYGRD
RAAMVCNVIT YRARLALRDS ARVLGYSPGT VDAWAGTIGP HEGVPDGSEE IPEQVLELAG
RLQRRPRHLG IHNGGMVIVD RPLAQVVPIE WARREGRTVL QWDKDDCAAA GLVKFDLLGL
GMLGAIHDAL DLIAEHHGTR LGLHDLPQDG DPEADAVYAM IQDADTVGVF QIESRAQMVT
LPRLKPKCFY DLVVEVALIR PGPIQGGAVH PYLRRRNGDE EVTYPHPTLE PVLERTLGVV
LFQEQAMQMA IAAAGFSAAE ADRLRQAMGS KRSPERMAEL KERLMAGMAE RGITPETGED
IYAKLHAFSG YGFPESHSVS MAYIVWCSCY LKRYYPAAFT AALLNNQPMG FYSPGSLVTD
VRRHGVQVKR VDVNASGAKA TLQSPDKPYG SGSGSGSGSG SDRTRHRHAS PIAQPAVRLG
LSSVRDLGDD AAEQVVAERT AGGPFTGVED FIVRTGLSRS TLEALATAGA FGCFGLDRRE
ALWTAGALAG TTAGHLPGTA PGTTVPQLDP LTPVEVTLAD LWATGTSPED HPIGHLRWRL
AMRGVTPAVE LRTARNRSLV RVAGLVTHRQ RPPTAHGTCF LSMEDETGLI NVICPAPVWE
AQRKVALRYG ALLIHGTLER TDGAVNVVAG RIARLDVVVP DRSRNFR