Gene Hoch_4061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4061 
Symbol 
ID8546462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5576248 
End bp5579772 
Gene Length3525 bp 
Protein Length1174 aa 
Translation table11 
GC content67% 
IMG OID646388738 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_003268453 
Protein GI262197244 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.496247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0397833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGT TCACACATCT CCACCTGCAC ACTCAATATA GCCTCTTGGA TGGGGCCATC 
CGGGTCAAGG ACCTCTTTCC GAAGGTGCTC GAGCTCGGCA TGGACTCGGT GGCGGTCACC
GACCACGGCA ACATGTTCGG CGCCGTGGAT CTGTACACCG CGGCCCGCGA GCACGGGGTC
AAGATGATCT TCGGCTGCGA GACCTACATC GCGGCCAGCG ATCGCTTCGA CCGCACCAAC
CGCCGCAGCT ATCACATGGT GCTGCTGGCC AAGGACCAGG TCGGCTACAA GAACCTGTCG
TTCCTCAACT CCATGGGCTA CCTGGAGGGC TTCTACTACA ACCCGCGCAT CGACAAGAAG
ATCCTGCGCG AGCACTCCGA GGGGCTCATC GGCATGAGCG CGTGCCTCGG CGGCGAGGTC
GCGCAGACCC TGCTCAAGCA GGGCACGGCC GAGGCCGAGG CGGTGGCGCT CGAGTACCAG
GACATCTTCG GCAAGGGCAA CTTCTACCTG GAGATGATGG GCAACCAGCT CGAGGAGCAG
CAGCAGCTCA ACGCCGACCT CAAGCGCATG AGCAGAAAGC TCGGCATCCC GTTGGTGGCG
ACCAACGACT GCCACTACGT GAGCCAGCAG GACGCCCACG CCCACGAGAT CCTGCTCAGC
ATCCAGACCG GCAAGACCAC CAGCGACGAC AAGCGCCTGC GCCACACGGT CGACTCCTAC
TTCCTCAAGA ACGCGGCCGA GATGGAGGTG GACTTCAGCG ACGTGCCCGA GGCCATGGAG
AACACGGTCG ACATCGCCAG GCGCTGCAAC GTCGAGCTCG ACCTCGGCAA CACCTTCCTG
CCCAACTACA AAGTCCCCGA GGGCATGAGC GTCGAGAGCT ACCTCGAGAA GGTGGTCTTC
GACGGCCTCG AGCGCCGCTT CCGCGAGCTG AGCGAGATCG GACGCTCGTT CGATCCCGAC
CAGTACCGCG AGCGCCTGCG CCACGAGCTC GGCGTCATCA ACAAGATGGG GTTCCCGGGC
TACTTCCTCA TCGTGTGGGA CTTCATCAAC TGGGCCAAGG AGCACAACAT CCCGGTCGGC
CCGGGCCGCG GCTCGGGCGC CGGTAGCTGC GTGGCCTGGG CCATGCGCAT CACCGACATC
GACCCGCTGG AGTTCAAGCT GCTGTTCGAG CGCTTCCTCA ACCCCGAGCG CGTGAGCATG
CCCGACTTCG ACGTCGACTT CTGCATGAAC CGGCGCGACG AGGTCATCGA CTACGTCAAG
CGCAAGTACG GCGAGAAGAA CGTCGGCCAG ATCGCCACCT TCCACCAGCT CAAGGCGCGC
GGCGTCATCC GCGACATCGC CCGGGTCATG GAGCTGCCCT ACGCCGAGGC CGACAAGCTG
GCCAAGCTGG TGCCCGACCC GGTGGGCGGC AAGTCGCCGC CGGTGCGCGA CGCCATCGAG
CAGGAGCCCG AGCTCAAGCG GCTGTACAAC GAGGACCCCA AGATGCGCGA CCTGCTCGAC
GTCGCCGCCT CGCTCGAGGG CCTCAACCGC CACGTCGGCA TGCACGCGGC CGGCGTGGTC
ATCGCCGAGG ATCCGCTGTG GGAGTACGTG CCCTGCTTCC GCGGGCAGAA CGGCGAGATC
GTCACCCAGT TCGCGATGAA GGAGGCTGAG AAGGCCGGCC TGGTCAAGTT CGACTTCCTC
GGCCTCAAGA CCCTCACCGT CATCCACACC GCGGTGCGCC TGATCAACGA GCAGCGCGAG
GCCGCGGGCG AGCCGCTGTT CGACATCGAC AAGATCGACA AGGGCGACCC CGCGGTCTAC
AAGATGATCT CGCGCGGCGA CACCACCGGC GTGTTCCAGC TCGAGTCCTC GGGCTTCCGC
GAGATCCTCA CCAAGCTCAA GCCCGACCAG CTCGAGGACA TCGTCGCCGC CGTGGCCCTG
TACCGACCCG GCCCGCTCGA GGGCGGCATG GTCGACGACT TCATCGAGCG CAAACACGGG
CGCAAGCGGG TCGAGTATCC GCACCCCGAC CTCGAGCCGG TGCTGCGCGA CACCTACGGC
GTCATCGTCT ACCAGGAGCA GGTGATGCAG ATCGCCCAGG TCCTGGCCGG CTACAGCCTG
GGACGCGCGG ATCTTCTCCG CCGCGCCATG GGCAAGAAGA ACAAAGACGT CATGGCCAAG
GAGAAGGACG GCTTCGTGAC CGGCGCCACC GAGCGCGGCG TCGACTCCAA GCTGGCCGAG
CAGGTCTTCG ACCTCATGGC CTTCTTCGCC GGCTACGGCT TCAACCGCTC GCACTCGGCC
GCCTACGGCT GGATCTCGTA CCAGACCGCC TACCTCAAGC ACCACTACCC GCACGAGTTC
ATGGCCGGTC TGATGTCCTG CGACCAGGAC AACACCGACA ACATCGTCAA GTTCATCGCC
GAGGCCCGGG CCATGGGCCT GGTCGTGGCC CGTCCCGACG TCAACGAGTC GGCCGCCGAC
TTCACCGTCG TGGTCGGCGA GGACGACACC AAGCGCATCC GCTTCGGCCT GGGCGCGGTC
AAGGGCGTGG GTCAGGGCGC GGTCGAGGCC ATCATCGAGT GCCGCAATCA GGACGAGAAG
TTCATCTCGA TCTACGAGTT CTGCCGGCGC GTGGACTCGC AGAAGTGCAA CCGGCGCGTC
ATCGAAGCCC TGGTCAAGAG CGGCGCCTTC GACGGCCTGT CCGAAGACGC CGGTCTGCAC
CGCGCGCGCG TGTTCGCCAC CATCGAGGCG GCCATGGAGA GCGGCGCCCA GGCCCAGCGC
GACCGACGCA GCGGCCAGAC CTCGCTGTTT GGCCTCATCG CCGCGCCCGA GGGCGACAGC
GGCGCCACCG CCGACGGCAT GCCCGAGACC TACCCCGAGG TCGAAGAGTG GCCGGCCAAG
GAGCTGTTGG CCTTCGAGAA GGAGTCGCTC GGCTTCTACA TCAGCGGCCA CCCGCTCGAC
CGCTACCGCG CCGATCTTAC GCGCTACGCC AACGCCACCA CCACCGACTT CCTCGAGGGC
AAGCGCCCGG CCGGCCCGGC CGCGGTCGGC GGCGTGGTCT CGGCCTACCG CGAGCGGCCG
ACGCGCAAAG GCGACGGCAA GATCGCCTTC TTCCAGCTCG AGGACGCCAC CGGCCAGCTC
GAGGTCATCG TCTTCCCCAA GACCTTCGAG CGCGTGCGCG AGACCCTGGT GCTCGACGAG
CCCATCCTGT GCAGCGGCAA GGTGGTCGAT GAAGGCGAGG GCGCCCAGCA CGCCTGGCGC
ATGCTGCTCG AGGAGGCCAC GCCCCTGGCC CATCTGCGCC AGAGCCAGAC CTCGCGGGTC
GATATCCACA TCGCGGCCGA CCAGGTCACG CCCGATCAGA TCGAGGCGCT CGAGCAGATC
CTCACCGCCA GCCGGGGCTC GTGTCAGGCC GTGCTGCACC TGTCCATCCC GCGCCGCTCG
GCGACCTCGG TGTGGCTCGA CCCGCGCTGG AACGTGGCCC CGAGCGAGGA GCTTCTGGCG
CGCATCGAGC GGCTCTTCGG CGCCCCGGTC GCGACCCTGC ACTGA
 
Protein sequence
MSAFTHLHLH TQYSLLDGAI RVKDLFPKVL ELGMDSVAVT DHGNMFGAVD LYTAAREHGV 
KMIFGCETYI AASDRFDRTN RRSYHMVLLA KDQVGYKNLS FLNSMGYLEG FYYNPRIDKK
ILREHSEGLI GMSACLGGEV AQTLLKQGTA EAEAVALEYQ DIFGKGNFYL EMMGNQLEEQ
QQLNADLKRM SRKLGIPLVA TNDCHYVSQQ DAHAHEILLS IQTGKTTSDD KRLRHTVDSY
FLKNAAEMEV DFSDVPEAME NTVDIARRCN VELDLGNTFL PNYKVPEGMS VESYLEKVVF
DGLERRFREL SEIGRSFDPD QYRERLRHEL GVINKMGFPG YFLIVWDFIN WAKEHNIPVG
PGRGSGAGSC VAWAMRITDI DPLEFKLLFE RFLNPERVSM PDFDVDFCMN RRDEVIDYVK
RKYGEKNVGQ IATFHQLKAR GVIRDIARVM ELPYAEADKL AKLVPDPVGG KSPPVRDAIE
QEPELKRLYN EDPKMRDLLD VAASLEGLNR HVGMHAAGVV IAEDPLWEYV PCFRGQNGEI
VTQFAMKEAE KAGLVKFDFL GLKTLTVIHT AVRLINEQRE AAGEPLFDID KIDKGDPAVY
KMISRGDTTG VFQLESSGFR EILTKLKPDQ LEDIVAAVAL YRPGPLEGGM VDDFIERKHG
RKRVEYPHPD LEPVLRDTYG VIVYQEQVMQ IAQVLAGYSL GRADLLRRAM GKKNKDVMAK
EKDGFVTGAT ERGVDSKLAE QVFDLMAFFA GYGFNRSHSA AYGWISYQTA YLKHHYPHEF
MAGLMSCDQD NTDNIVKFIA EARAMGLVVA RPDVNESAAD FTVVVGEDDT KRIRFGLGAV
KGVGQGAVEA IIECRNQDEK FISIYEFCRR VDSQKCNRRV IEALVKSGAF DGLSEDAGLH
RARVFATIEA AMESGAQAQR DRRSGQTSLF GLIAAPEGDS GATADGMPET YPEVEEWPAK
ELLAFEKESL GFYISGHPLD RYRADLTRYA NATTTDFLEG KRPAGPAAVG GVVSAYRERP
TRKGDGKIAF FQLEDATGQL EVIVFPKTFE RVRETLVLDE PILCSGKVVD EGEGAQHAWR
MLLEEATPLA HLRQSQTSRV DIHIAADQVT PDQIEALEQI LTASRGSCQA VLHLSIPRRS
ATSVWLDPRW NVAPSEELLA RIERLFGAPV ATLH