Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3224 |
Symbol | |
ID | 5735092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4077728 |
End bp | 4080511 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280370 |
Product | DNA polymerase III, epsilon subunit |
Protein accession | YP_001545989 |
Protein GI | 159899742 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases [COG2176] DNA polymerase III, alpha subunit (gram-positive type) |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family [TIGR01407] DnaQ family exonuclease/DinG family helicase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00101103 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAACCGA TATATATCGC CTTGGATTTG GAAACAACTG GCTTAGAACC AGGGCGCGAT GAGATTATTG AAGTTGGGGC AGTTAAATTT CGAGGCAATG AGGTTCTCGA AACCTACCAA ACCTTGGTCA AACCCAAACA GGTGCTGCCG ATTAAAATCG CCCGTTTAAC AGGCATCGAT GCCCATGAAT TGACCACGGC TCCTACATTT AATAGTATTG GTGGCCAATT AGCCAAATTT CTCAAAAGCT ACCCCATTAT TGGCCATTCG GTCGATAACG ATTTGCGCTT TTTGCAACAA CAAGGGCTAA AAGTTACCCA GCCGCATTAC GATACCTTTG ATCTGGCGAC GCTGTTGATT CCCCAATTGC CCAATTACTC GCTTTCAACG ATTGCCGAAC ATTTGCAAAT TCAACACCCT GATGCCCACC GCGCCTTGGC CGATGCCGAG GCCAGTCGCT TGGTGTTTAG CGCATTGCTC GATAAATTAG CCGAATTATC GGCTGCCGAA TTGCATAGCA TCGCTCAAAC CACCCAAAAA TTGCAGTGGC CGCTGGCCAA ACTATTTGGC GAAATTGCCA AACGCCGCGT GCAAACGCTC TGGCAAGCGC CGATCGAATT TCAACCGAAA CCACTGGTGC GCCCGGTGGC CCTCGAACCA ACCGGCAATC AGCAAGAACT CGATGCCCAA GCAATTGGCG CGATGTTTGG CGCAGATGGT GGGTTTAGCC GCATGTTCCC AGGCTATGAG CCACGTCAGC CCCAAATTGA GATGACCGAG GCAATCGCCG AAGCACTCAA TCAAGGCGAT ACCTTGATGA TCGAAGCGCC AACTGGCACT GGCAAAAGTT TGGCTTACCT CGTGCCCGCT GCTCAATGGG CACGCCAGCG CGGCGAACGA GTGGTCATCT CAACCAACAC GATCAATCTC CAAGATCAGC TTTGCTCCAA AGATATTCCT ACCGTGCAAG CGTTGTTGGC CGAGCAACCC GACCAATGGC CCGCTTTGCG AGCGGTGCAA CTCAAAGGCC GCAGCAATTA CCTATGTTTG AAGCGCTATG AATCCTTTCG TGCCCACCCC GACCACAACG AAGATCAGAC CCGTGGGTTG TTGAAGCTCC AACTTTGGCT GCCCTCGACC AACAGCGGCG ACCGCGCTGA ATTGATGTTG ATTCAAGGCG AGCAGCAAGT TTGGAACAAT GTTAATGTTG ATCCCGACCA ATGTTTGCGC CAACGCTGCT CGCTCTACAA CGAATGTTTC TTCTTCAAAG CCCGCGCTGA AGCTGAAAAT GCCCATATCG TGGTGGCGAA CCATGCCTTG TTGATGTCGG ATGTCAAATC GCCTGGGATT TTGCCACGCT ACGATCATTT GATCATCGAC GAAGCGCATA ATCTCGAAGA TGTGGCGACT GATCAGTTGG GCTTTACGAT TTCACAACAT AGCCTAACTG GTTTGCTCAA TGATATGCAT AGCGCTGGCG GTGTGCGTTT GGCGGGTGGC GTGCTCAACG AATGGAGCCA AATCTTCCGT TTGAGCACCG TTGATCATAA AGAGCAGCGC AAACTCGAAG ATCTCAGCGC CGATTTGCGA CCAAATGTTG ATAAAGCCCG CGAAGCGGCC CAGCAATTAT TCAGCATTTT CAACGATATT ATGGCCAAAG ATCGCAGTGT GACCCAATAC GATCCCCAAT TGCGGATCAC CAGCAAAGTG CGTCGCCACA CCGAATGGAC TCAAGTTGAG CAAACATGGG AAAACTTGAG CATCAATTTG CGCAAGCTGG GCGATGGCTT TGGCAAGCTC CAAGCAATTT TGGATAATCT CGAAGGCCGC GATATCAATG GCTACGATGA TTTGGTGATG CGGGTCAAGG GTATGGTCAA TGCTTGCACT GAATTACAAC GCCAATTTGA TGTGGTAATT TATGGCAATG AAGAAACCGT CGCATGGCTG ACTGCCGATC AACGTCGCCG CGAATTGTTG GTGCAGGCTG CGCCAATTCA TGTTGGGCCA TTGCTCACTG AAGATTTATG GCTGAAAAAA CGCGCCAGCA TCTTGGTTTC GGCGACGCTT TCGGTCAGCA ACAGCTTCGA TTACCCCAAA CAGCGCTTGG GCTTGGACGA AGCCACGACG ATGCAACTCG ATTCGCCCTT CGATTACAGC AAATCAACCT TAATCTATTT GCCAACCGAT ATGCCCGAAC CCAACGAGCG CAATTATCAA CGGGCCATGG AAGATGCCCT GATCAATTTA TGCAAAGCGA CTGGCGGGCG CACTTTGGCA CTGTTTACCG CCAATGCCTC GCTGAAACAA ACCTATCATG GCATTAGCGA AAGCCTTGAG CAAGCCGATA TTTCGACCTT GGCCCAAGGC ATGGATGGCT CACGCCGCTC GTTGATCCAG CGCTTCAAAT CTGACCCACG CACGGTTTTG TTGGGCACAG CCTCGTTTTG GGAAGGTGTT GATGTGGTTG GCGATGCTTT GAGCGTACTG GTGATTACCA AATTGCCCTT TAGCGTGCCA AATGATCCGG TGTTTTCGGC GCGATCTGAG GGCTTTGATG ATGCTTTTGC TGAATATTCA GTGCCGCAGG CAATTTTGCG TTTCAAGCAA GGCTTTGGCC GCTTGATTCG CTCCAAAGAT GATCGCGGGA TTGTGGTGGT GCTTGATCGG CGCTTGCTTA GCAAAAATTA TGGGCGGCAA TTCCTCGAAT CATTGCCCGA TTGCACGATT CAACGCAAGC CGCTCGCCGA ATTGGCAACA ACGGCTGCTC GTTGGTTGGT TTAA
|
Protein sequence | MEPIYIALDL ETTGLEPGRD EIIEVGAVKF RGNEVLETYQ TLVKPKQVLP IKIARLTGID AHELTTAPTF NSIGGQLAKF LKSYPIIGHS VDNDLRFLQQ QGLKVTQPHY DTFDLATLLI PQLPNYSLST IAEHLQIQHP DAHRALADAE ASRLVFSALL DKLAELSAAE LHSIAQTTQK LQWPLAKLFG EIAKRRVQTL WQAPIEFQPK PLVRPVALEP TGNQQELDAQ AIGAMFGADG GFSRMFPGYE PRQPQIEMTE AIAEALNQGD TLMIEAPTGT GKSLAYLVPA AQWARQRGER VVISTNTINL QDQLCSKDIP TVQALLAEQP DQWPALRAVQ LKGRSNYLCL KRYESFRAHP DHNEDQTRGL LKLQLWLPST NSGDRAELML IQGEQQVWNN VNVDPDQCLR QRCSLYNECF FFKARAEAEN AHIVVANHAL LMSDVKSPGI LPRYDHLIID EAHNLEDVAT DQLGFTISQH SLTGLLNDMH SAGGVRLAGG VLNEWSQIFR LSTVDHKEQR KLEDLSADLR PNVDKAREAA QQLFSIFNDI MAKDRSVTQY DPQLRITSKV RRHTEWTQVE QTWENLSINL RKLGDGFGKL QAILDNLEGR DINGYDDLVM RVKGMVNACT ELQRQFDVVI YGNEETVAWL TADQRRRELL VQAAPIHVGP LLTEDLWLKK RASILVSATL SVSNSFDYPK QRLGLDEATT MQLDSPFDYS KSTLIYLPTD MPEPNERNYQ RAMEDALINL CKATGGRTLA LFTANASLKQ TYHGISESLE QADISTLAQG MDGSRRSLIQ RFKSDPRTVL LGTASFWEGV DVVGDALSVL VITKLPFSVP NDPVFSARSE GFDDAFAEYS VPQAILRFKQ GFGRLIRSKD DRGIVVVLDR RLLSKNYGRQ FLESLPDCTI QRKPLAELAT TAARWLV
|
| |