Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3922 |
Symbol | |
ID | 5735783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4915366 |
End bp | 4917093 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281073 |
Product | DNA polymerase III, subunits gamma and tau |
Protein accession | YP_001546684 |
Protein GI | 159900437 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2812] DNA polymerase III, gamma/tau subunits |
TIGRFAM ID | [TIGR00678] DNA polymerase III, delta' subunit [TIGR02397] DNA polymerase III, subunit gamma and tau |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0150926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCGC AAGCGTTATA TCGAAAATGG CGGTCGCAGA CCTTCGATGA TTTGGTGGGC CAAAGCCATA TTGTGCAAGC CTTGCGCAAT GCTATTGCCG CCAATCGCAT TGGCCATGCC TATTTATTTA CTGGGCCGCG TGGGGTTGGC AAAACCAGTG CTGCACGGAT TCTCTCCAAG GCGGTCAATT GCGAGCAAAG CGATCCGCGT TTGCGGCCTT GTGGCGAGTG CAGCACCTGC CGCGCAATTG CCGAAGGTCG GGCCGTCGAT GTGATTGAGA TGGACGCTGC CTCGCATACC AGCGTTGATG ATGCCCGCGA AATTATCGAA AAAGTGCAAT TTCGCCCAAC CCAATTTCGC AAAAAAGTCT ATGTGATCGA CGAAGTGCAT ATGCTGAGCA CCGCTGCCTT CAATGCGTTG CTCAAAACCC TCGAAGAACC ACCTGACCAT GCCATGTTTA TTTTGGCGAC GACTGAATTT CACAAAGTGC CAGCGACGAT TCTTTCGCGC TGTCAGCGCT TTGTGTTCAA TCGCCATACG ATTGCTAACA CAATTGCCCA CCTCGAATGG GTCGCTGGCG AAGAAGGCGT TTTCCTTGAG CCTGGCGTGG CTGAGGCGGT GGCACGCGCA GCAACTGGCT CGATGCGCGA TGCCATGAGC ATCCTTGACC AGTTAATGGG CTATGGCGAA CCGCAAATTC CATTGACTCG GGTGCAAAGT TTGCTGGGGG CAACCGCCTC GCGCGAGGTC GAAACCTTAG TCGCCGCCTT TGCCGCCGAA GATGTGGCCG CGGCATTAAG CGTGATCAAC ACGATTGCCG ACCAAGGCGC TGATTTACGT CAATTCACCC GCGATGTGGT GAGCTACTTA CGCGGCCTGA TGTTGCTTAA ATCGGGCGGA GCCGCCGATT TGCTTGATGT TGGGCACGAC GTGTTGGCTA CCATGCAAAG CCATAGCCAA CAACTGGCCT TGGCAGCGAT TTTGGCGTGG CTCAAAATTT TCAGCGGACT CGATCATCAA CTACGCACTA CGCCCTATGG TCAATTGCCC TTAGAAATGG CGGTTGTTGA AGCCTTGGTT GTGCCAGTGC CAGCTGCGGT GGCTGCACCA AGCCCAGTTC GCGGCACGGT GGCTCCGATA ACGCGCCCGA ATCCGGCGGT TGCACGCCCA GCTGAGCCAG CACCTGTCCA ACGCCAAACC CCAACGCCAA GTGTGGTTAC GCCACGCCCA GTCGAGCAAC CAGTCGTTGA GCAACCACCA CAGGCCGAAC CAACGCCAGT GCCAGTTGCT GCTGTAGCAG CTCAACCAAT GCCCGAAGCC GAGCAACATA TCGTTTTGGC TGAATCGGAA ATTTTGTTGG CCGAGGTCGA GGCGGTTTGG CTGCAAGTGA TCGAAGATCT CAAGCCCTAC AATCCGCGCT TGCAAGCGGT GCTCAAAAGC TGCGAACCAT TGGCGCTTGA AGACAACACC TTGGTGATCG GCACACCCTC GCCATTCCAT ACCAAACAAC TCGACGACCA AACCCAACGC CGGTTGATCG AAGATTTACT GGCCAAAGCG GTGAATCGGC AGATGTTTGT GCGCGGCGAA GAAGCCAACC GCGATCAGCA AAATCGTGCC CGTGATGCTC GTCGCCAGCG CGAAGAGATC ATGAAAGATC ATGTGGTCAA GGCTGCCCGT AATATTTTCG ATGCCCGCAT TGTCGGCGTG CAAGAGGATG GCAGCTAA
|
Protein sequence | MSSQALYRKW RSQTFDDLVG QSHIVQALRN AIAANRIGHA YLFTGPRGVG KTSAARILSK AVNCEQSDPR LRPCGECSTC RAIAEGRAVD VIEMDAASHT SVDDAREIIE KVQFRPTQFR KKVYVIDEVH MLSTAAFNAL LKTLEEPPDH AMFILATTEF HKVPATILSR CQRFVFNRHT IANTIAHLEW VAGEEGVFLE PGVAEAVARA ATGSMRDAMS ILDQLMGYGE PQIPLTRVQS LLGATASREV ETLVAAFAAE DVAAALSVIN TIADQGADLR QFTRDVVSYL RGLMLLKSGG AADLLDVGHD VLATMQSHSQ QLALAAILAW LKIFSGLDHQ LRTTPYGQLP LEMAVVEALV VPVPAAVAAP SPVRGTVAPI TRPNPAVARP AEPAPVQRQT PTPSVVTPRP VEQPVVEQPP QAEPTPVPVA AVAAQPMPEA EQHIVLAESE ILLAEVEAVW LQVIEDLKPY NPRLQAVLKS CEPLALEDNT LVIGTPSPFH TKQLDDQTQR RLIEDLLAKA VNRQMFVRGE EANRDQQNRA RDARRQREEI MKDHVVKAAR NIFDARIVGV QEDGS
|
| |