Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3204 |
Symbol | |
ID | 8545592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 4417633 |
End bp | 4420857 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646387871 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_003267599 |
Protein GI | 262196390 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0855873 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCT ACGCGCCGCT GTGGTGCAAG AGCAACTTCT CGTTCCTCGA GGGCGCCAGC CACCCCGACG AGCTCATCGA GGAGGCCCAC GCCCTGGGCC TGCGCGCGCT CGCGCTCAGC GATCGCGACG GCCTCTACGG CGTGGTCCGC GCCCACGTGT GCGCCGAGAA GATCGGCTTC AAGCTCATCC ACGGCGCCCA GGTGAGCGTC GATGACGGCA GCCAGATCGT GCTCCTGTGC CGCGACCGCG CCGGCTACGC CAACCTGTGC CATCTGCTCA CCAAGGGCCG GCGGCGCTCG GACAAGGGCA GCTCGCAGGT GTCGTGGCGC GAGGTGTGCG CGCACGCCGG CGGCCTCATC GCGCTGTGGG GCGGCGCCGG CAGCCTGCTC ACGCGCCCGG GCGAGAACCG GCCCGGCGGC GTCCCGGCCG ACATCCCGCG GCCCCTGGGC GGTCGCCCTC AGAGCTATCT CGGACGCGTG GCCGACGACT TGCGCGAGGC CTTTGGCGAC GCGCTCTACG CCCTGTGCGC GCGCCACCGC GAGGCCGAAG AGGTGGTCAC CGAGGCCCGC CTGCGCGCGC GCGCCGAGCG CTTCGGCCTG CCCGTGGCGG CCGCGGTCGA GGTGCTCTAC CACAGCCGCG CGCGCCGGCC GCTGCAGGAC GTCCTCACCT GCCTGCGCCA CCACGTCACC CTGAGCACCG CCGGCCGCTA CATCCGCGCC AACGACGAGC ACGACCTGCA CTCGCCCCAG GCCTTTGGCA TCCTCTTTGA CGACGACCCC GCGGCCGTGG CCCGCACCCT CGACATCGCC GCGCGCTGCC AGTTCGGGCT CGGCGAGATC CGCTACCGCT ACCCCTCGGA GCGGCTGCCG AGCGGCAAGA CCACCTCCGA GTGGCTGCGC GAGCTCAGCT TCGAGGGCGC GCGCTGGCGC TACCGCGGCG AGGTCCCCGC CGATGTCCGC GCGCAGCTCA CGCGCGAGCT CGCGCTCATC GACGAGCTCG ACTACGGCGG CTACTTCCTC ACGATGTACG AGATCGTCCG CTTCTGTCGC GCCCAGGGCA TCCTGTGCCA GGGCCGCGGC TCGGCCGCCA ACTCGGCCGT GTGCTACTGC CTCGACATCA CCGCCGTGGA CCCGGTGCGC ATGGGCCTCT TGTTCGAGCG CTTCCTGTCG CGCGAGCGCG CCGAGCCGCC CGACATCGAC CTCGACATCG AGCACGATCG CCGCGAAGAG GTCATCCAGC ACGTCTACGA CAAGTACGGG CGCGATCACG CCGCCATGGT CGCCGTGGTC ATCCGCTACC GGCCGCGCTC GGCCGTGCGC GACGTCGGCA AGGTGCTCGG CATCCCGGCG ACCTCGCTCG ACCGCTGCGC CAAGCTGCTC TCGCACTACG AGGGCATCAC GGCCGAGGCC CTCGAGCAGG CCGGCATGGA CCCGCACCTG CCCGCGCACC AGCACCTGGG CCGCCTGGCC AGCGAGATCC TCGACTTCCC GCGCCACCTC TCGATCCACC CCGGCGGCTT CCTGCTCGGC CACGAGCCCG TGCACAGCCT GGTGCCCATC GAGAACGGCG CCATGGCCGG GCGCACGGTC ATTCAGTGGG ACAAGAACGA CCTCGAAGAC CTCGGCCTGT TCAAGGTCGA CCTGCTCGGC CTGGGCGCGC TCAACCAGCT CCACCGCTGC TTCGACCTGG TGTCCGAACA CCGCGGCATC GACCTGAGCA TGGCCACCAT CCCGGCCGAC GACACCGCCA CCTACGACAT GATCTGCCGC GCCGATACCG TCGGCGTCTT CCAGATCGAG AGCCGCGCGC AGATGTCCAT GCTGCCGCGG CTGCGGCCGC GCTACTTCTA CGACCTGGTC ATCGAGGTGA GCATCGTGCG CCCGGGGCCG ATCACGGGCG GCATGGTCCA CCCCTACCTG CGCCGGCGCC ACGGCCTCGA GAAGATCGAA TATCCCCACG AGAGCCTCGA GCCGGTGCTC GAGCGCACCC TGGGCGTGCC GCTGTTTCAA GAGCAGGTGA TGCGCCTGGC CATGGTCGCG GCCGACTACA CCCCGGGCGA AGCCGACCAG CTCCGCCGCG ACATGGCCGC CTGGCGCCGC AGCGGCCGCA TCGACCAGCA CCGCGAGCGC CTGGTCTCGG CCATGACCCG CAAGGGCATC GCGGCTGAGT TCGCCGAGCG CGTGTTCGAA CAGATCCGCG GCTTCGGCGA GTACGGCTTC CCCGAGAGCC ACGCCGCCAG CTTCGCGCTC ATCGCCTACG CCACCGCCTA CATGCGCTGT CACTTCCCGG CCGAATACGC GTGCGCGCTG CTCAACGCCC AGCCCATGGG CTTCTACTCG CCGGCCACCA TCATCAACGA CGCCCGCCGT CACGGCGTGA GCGTGCGCCC TATCGACGTC GGCGCCAGCG CCTGGGACTG CACCCTCGAG CCCCTGCCGG CGAGCCAGCG CCGCACCACC GAAGAGAACG GCGACAGCGG CGACAGCGAC AGCGACGCGC CCGCGCGCAT CTGTTACGCC ATCCGCATGG GCCTGCGCTA CGTCAAGGGC CTGCGCCGCG ACGCCGGGAC GCGCATCGAA GATGCCCGCG CCCGCGCGCC CTTCGCCGAC CTCGGCGATT TCGTACGCCG CACCCGACTC GACGAGCGCT CGCACACCCG CCTGGCCGAA TCCGGCGCCC TGGCCGCCTT TGGGCGCAAT CGCCGCGACG TGCTGTGGCA GGTGCGCGGC CATCAGCGCG CGAAATCGGA CACCCTGTCC CTGCCCCAGA CCGGCCCGGC GCCCAGCCTG GCCCAGCTCG ACCAGCTCGA CGAGATCCTC TGGGACTACC AGGCCAGCCT GCACAGCACC CGCGGCCATC CGCTCGAGCC GCTGCGCGCC TCGCTGCGCG CCCAGAACAT CGCCGACGCG CGCTCTGTGC AGCGCATGCG CCACGGCCAG CGCCTGCGCT ACGCCGGCCT GGTCATCTGC CGACAACGCC CGCCCACGGC CGCCGGCGTG ACCTTCATGA CCCTCGAGGA CGAGAGCGGC TTCGTCAACC TGGTCATCTG GCAGCAGGTG TGGGCCAGCT ACGGCGTGCT CGCCAAATCC ACCGCGTTCC TGGGTGTGAG CGGCCGCGTA CAGGCCGAAG AGGGCCTGGT GCACCTGGTC GTCGAGTCGC TGTGGACGCC GCAGGTCGTG CGCGGCGACG GCGTGCCCCC GCCCAAGCGC CGCGACTTCC GCTGA
|
Protein sequence | MSTYAPLWCK SNFSFLEGAS HPDELIEEAH ALGLRALALS DRDGLYGVVR AHVCAEKIGF KLIHGAQVSV DDGSQIVLLC RDRAGYANLC HLLTKGRRRS DKGSSQVSWR EVCAHAGGLI ALWGGAGSLL TRPGENRPGG VPADIPRPLG GRPQSYLGRV ADDLREAFGD ALYALCARHR EAEEVVTEAR LRARAERFGL PVAAAVEVLY HSRARRPLQD VLTCLRHHVT LSTAGRYIRA NDEHDLHSPQ AFGILFDDDP AAVARTLDIA ARCQFGLGEI RYRYPSERLP SGKTTSEWLR ELSFEGARWR YRGEVPADVR AQLTRELALI DELDYGGYFL TMYEIVRFCR AQGILCQGRG SAANSAVCYC LDITAVDPVR MGLLFERFLS RERAEPPDID LDIEHDRREE VIQHVYDKYG RDHAAMVAVV IRYRPRSAVR DVGKVLGIPA TSLDRCAKLL SHYEGITAEA LEQAGMDPHL PAHQHLGRLA SEILDFPRHL SIHPGGFLLG HEPVHSLVPI ENGAMAGRTV IQWDKNDLED LGLFKVDLLG LGALNQLHRC FDLVSEHRGI DLSMATIPAD DTATYDMICR ADTVGVFQIE SRAQMSMLPR LRPRYFYDLV IEVSIVRPGP ITGGMVHPYL RRRHGLEKIE YPHESLEPVL ERTLGVPLFQ EQVMRLAMVA ADYTPGEADQ LRRDMAAWRR SGRIDQHRER LVSAMTRKGI AAEFAERVFE QIRGFGEYGF PESHAASFAL IAYATAYMRC HFPAEYACAL LNAQPMGFYS PATIINDARR HGVSVRPIDV GASAWDCTLE PLPASQRRTT EENGDSGDSD SDAPARICYA IRMGLRYVKG LRRDAGTRIE DARARAPFAD LGDFVRRTRL DERSHTRLAE SGALAAFGRN RRDVLWQVRG HQRAKSDTLS LPQTGPAPSL AQLDQLDEIL WDYQASLHST RGHPLEPLRA SLRAQNIADA RSVQRMRHGQ RLRYAGLVIC RQRPPTAAGV TFMTLEDESG FVNLVIWQQV WASYGVLAKS TAFLGVSGRV QAEEGLVHLV VESLWTPQVV RGDGVPPPKR RDFR
|
| |