Gene Hoch_3204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3204 
Symbol 
ID8545592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4417633 
End bp4420857 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content72% 
IMG OID646387871 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_003267599 
Protein GI262196390 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0855873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCT ACGCGCCGCT GTGGTGCAAG AGCAACTTCT CGTTCCTCGA GGGCGCCAGC 
CACCCCGACG AGCTCATCGA GGAGGCCCAC GCCCTGGGCC TGCGCGCGCT CGCGCTCAGC
GATCGCGACG GCCTCTACGG CGTGGTCCGC GCCCACGTGT GCGCCGAGAA GATCGGCTTC
AAGCTCATCC ACGGCGCCCA GGTGAGCGTC GATGACGGCA GCCAGATCGT GCTCCTGTGC
CGCGACCGCG CCGGCTACGC CAACCTGTGC CATCTGCTCA CCAAGGGCCG GCGGCGCTCG
GACAAGGGCA GCTCGCAGGT GTCGTGGCGC GAGGTGTGCG CGCACGCCGG CGGCCTCATC
GCGCTGTGGG GCGGCGCCGG CAGCCTGCTC ACGCGCCCGG GCGAGAACCG GCCCGGCGGC
GTCCCGGCCG ACATCCCGCG GCCCCTGGGC GGTCGCCCTC AGAGCTATCT CGGACGCGTG
GCCGACGACT TGCGCGAGGC CTTTGGCGAC GCGCTCTACG CCCTGTGCGC GCGCCACCGC
GAGGCCGAAG AGGTGGTCAC CGAGGCCCGC CTGCGCGCGC GCGCCGAGCG CTTCGGCCTG
CCCGTGGCGG CCGCGGTCGA GGTGCTCTAC CACAGCCGCG CGCGCCGGCC GCTGCAGGAC
GTCCTCACCT GCCTGCGCCA CCACGTCACC CTGAGCACCG CCGGCCGCTA CATCCGCGCC
AACGACGAGC ACGACCTGCA CTCGCCCCAG GCCTTTGGCA TCCTCTTTGA CGACGACCCC
GCGGCCGTGG CCCGCACCCT CGACATCGCC GCGCGCTGCC AGTTCGGGCT CGGCGAGATC
CGCTACCGCT ACCCCTCGGA GCGGCTGCCG AGCGGCAAGA CCACCTCCGA GTGGCTGCGC
GAGCTCAGCT TCGAGGGCGC GCGCTGGCGC TACCGCGGCG AGGTCCCCGC CGATGTCCGC
GCGCAGCTCA CGCGCGAGCT CGCGCTCATC GACGAGCTCG ACTACGGCGG CTACTTCCTC
ACGATGTACG AGATCGTCCG CTTCTGTCGC GCCCAGGGCA TCCTGTGCCA GGGCCGCGGC
TCGGCCGCCA ACTCGGCCGT GTGCTACTGC CTCGACATCA CCGCCGTGGA CCCGGTGCGC
ATGGGCCTCT TGTTCGAGCG CTTCCTGTCG CGCGAGCGCG CCGAGCCGCC CGACATCGAC
CTCGACATCG AGCACGATCG CCGCGAAGAG GTCATCCAGC ACGTCTACGA CAAGTACGGG
CGCGATCACG CCGCCATGGT CGCCGTGGTC ATCCGCTACC GGCCGCGCTC GGCCGTGCGC
GACGTCGGCA AGGTGCTCGG CATCCCGGCG ACCTCGCTCG ACCGCTGCGC CAAGCTGCTC
TCGCACTACG AGGGCATCAC GGCCGAGGCC CTCGAGCAGG CCGGCATGGA CCCGCACCTG
CCCGCGCACC AGCACCTGGG CCGCCTGGCC AGCGAGATCC TCGACTTCCC GCGCCACCTC
TCGATCCACC CCGGCGGCTT CCTGCTCGGC CACGAGCCCG TGCACAGCCT GGTGCCCATC
GAGAACGGCG CCATGGCCGG GCGCACGGTC ATTCAGTGGG ACAAGAACGA CCTCGAAGAC
CTCGGCCTGT TCAAGGTCGA CCTGCTCGGC CTGGGCGCGC TCAACCAGCT CCACCGCTGC
TTCGACCTGG TGTCCGAACA CCGCGGCATC GACCTGAGCA TGGCCACCAT CCCGGCCGAC
GACACCGCCA CCTACGACAT GATCTGCCGC GCCGATACCG TCGGCGTCTT CCAGATCGAG
AGCCGCGCGC AGATGTCCAT GCTGCCGCGG CTGCGGCCGC GCTACTTCTA CGACCTGGTC
ATCGAGGTGA GCATCGTGCG CCCGGGGCCG ATCACGGGCG GCATGGTCCA CCCCTACCTG
CGCCGGCGCC ACGGCCTCGA GAAGATCGAA TATCCCCACG AGAGCCTCGA GCCGGTGCTC
GAGCGCACCC TGGGCGTGCC GCTGTTTCAA GAGCAGGTGA TGCGCCTGGC CATGGTCGCG
GCCGACTACA CCCCGGGCGA AGCCGACCAG CTCCGCCGCG ACATGGCCGC CTGGCGCCGC
AGCGGCCGCA TCGACCAGCA CCGCGAGCGC CTGGTCTCGG CCATGACCCG CAAGGGCATC
GCGGCTGAGT TCGCCGAGCG CGTGTTCGAA CAGATCCGCG GCTTCGGCGA GTACGGCTTC
CCCGAGAGCC ACGCCGCCAG CTTCGCGCTC ATCGCCTACG CCACCGCCTA CATGCGCTGT
CACTTCCCGG CCGAATACGC GTGCGCGCTG CTCAACGCCC AGCCCATGGG CTTCTACTCG
CCGGCCACCA TCATCAACGA CGCCCGCCGT CACGGCGTGA GCGTGCGCCC TATCGACGTC
GGCGCCAGCG CCTGGGACTG CACCCTCGAG CCCCTGCCGG CGAGCCAGCG CCGCACCACC
GAAGAGAACG GCGACAGCGG CGACAGCGAC AGCGACGCGC CCGCGCGCAT CTGTTACGCC
ATCCGCATGG GCCTGCGCTA CGTCAAGGGC CTGCGCCGCG ACGCCGGGAC GCGCATCGAA
GATGCCCGCG CCCGCGCGCC CTTCGCCGAC CTCGGCGATT TCGTACGCCG CACCCGACTC
GACGAGCGCT CGCACACCCG CCTGGCCGAA TCCGGCGCCC TGGCCGCCTT TGGGCGCAAT
CGCCGCGACG TGCTGTGGCA GGTGCGCGGC CATCAGCGCG CGAAATCGGA CACCCTGTCC
CTGCCCCAGA CCGGCCCGGC GCCCAGCCTG GCCCAGCTCG ACCAGCTCGA CGAGATCCTC
TGGGACTACC AGGCCAGCCT GCACAGCACC CGCGGCCATC CGCTCGAGCC GCTGCGCGCC
TCGCTGCGCG CCCAGAACAT CGCCGACGCG CGCTCTGTGC AGCGCATGCG CCACGGCCAG
CGCCTGCGCT ACGCCGGCCT GGTCATCTGC CGACAACGCC CGCCCACGGC CGCCGGCGTG
ACCTTCATGA CCCTCGAGGA CGAGAGCGGC TTCGTCAACC TGGTCATCTG GCAGCAGGTG
TGGGCCAGCT ACGGCGTGCT CGCCAAATCC ACCGCGTTCC TGGGTGTGAG CGGCCGCGTA
CAGGCCGAAG AGGGCCTGGT GCACCTGGTC GTCGAGTCGC TGTGGACGCC GCAGGTCGTG
CGCGGCGACG GCGTGCCCCC GCCCAAGCGC CGCGACTTCC GCTGA
 
Protein sequence
MSTYAPLWCK SNFSFLEGAS HPDELIEEAH ALGLRALALS DRDGLYGVVR AHVCAEKIGF 
KLIHGAQVSV DDGSQIVLLC RDRAGYANLC HLLTKGRRRS DKGSSQVSWR EVCAHAGGLI
ALWGGAGSLL TRPGENRPGG VPADIPRPLG GRPQSYLGRV ADDLREAFGD ALYALCARHR
EAEEVVTEAR LRARAERFGL PVAAAVEVLY HSRARRPLQD VLTCLRHHVT LSTAGRYIRA
NDEHDLHSPQ AFGILFDDDP AAVARTLDIA ARCQFGLGEI RYRYPSERLP SGKTTSEWLR
ELSFEGARWR YRGEVPADVR AQLTRELALI DELDYGGYFL TMYEIVRFCR AQGILCQGRG
SAANSAVCYC LDITAVDPVR MGLLFERFLS RERAEPPDID LDIEHDRREE VIQHVYDKYG
RDHAAMVAVV IRYRPRSAVR DVGKVLGIPA TSLDRCAKLL SHYEGITAEA LEQAGMDPHL
PAHQHLGRLA SEILDFPRHL SIHPGGFLLG HEPVHSLVPI ENGAMAGRTV IQWDKNDLED
LGLFKVDLLG LGALNQLHRC FDLVSEHRGI DLSMATIPAD DTATYDMICR ADTVGVFQIE
SRAQMSMLPR LRPRYFYDLV IEVSIVRPGP ITGGMVHPYL RRRHGLEKIE YPHESLEPVL
ERTLGVPLFQ EQVMRLAMVA ADYTPGEADQ LRRDMAAWRR SGRIDQHRER LVSAMTRKGI
AAEFAERVFE QIRGFGEYGF PESHAASFAL IAYATAYMRC HFPAEYACAL LNAQPMGFYS
PATIINDARR HGVSVRPIDV GASAWDCTLE PLPASQRRTT EENGDSGDSD SDAPARICYA
IRMGLRYVKG LRRDAGTRIE DARARAPFAD LGDFVRRTRL DERSHTRLAE SGALAAFGRN
RRDVLWQVRG HQRAKSDTLS LPQTGPAPSL AQLDQLDEIL WDYQASLHST RGHPLEPLRA
SLRAQNIADA RSVQRMRHGQ RLRYAGLVIC RQRPPTAAGV TFMTLEDESG FVNLVIWQQV
WASYGVLAKS TAFLGVSGRV QAEEGLVHLV VESLWTPQVV RGDGVPPPKR RDFR