Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0014 |
Symbol | |
ID | 5736848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 16213 |
End bp | 18150 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277135 |
Product | DNA primase |
Protein accession | YP_001542794 |
Protein GI | 159896547 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0358] DNA primase (bacterial type) |
TIGRFAM ID | [TIGR01391] DNA primase, catalytic core |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0081672 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTCAG TTATCGACGA TATTAAAGAA CGCATTGACA TTGTGTCGTT CATTAGCGAT TATGTGCCGC TGCGTAAGGC GGGGCGCAAT TGGGTGGGCT TTTGCCCATT CCATTCGAAT ACCCGCACGC CAGCTTTTAC GGTGTTTGCC GATAGTAATA GCTACCACTG TTTTGGCTGC AAAGCCAGCG GCACAATTTT TGATTTTCTG ATGCAGCGCG AGGGCATCGA CTTCCCTGAG GCACTGAATC AACTTGCGGC CCGCGCAGGA GTGCAATTAC GCCAGCGTAC CGCCCAAGAT GACCAAGAAG ATGTGCTGCG TTCGCGCATG CTGGAGTTGA ACGCCGCCGC TGCCCGTTTT TGGAGTCATC AATTATTACA ATCGCCCAAA GCGGCTCATG TGCGTAGCTA CATCGAGCAA CGTGGGCTTA CCCCGCAAAC TGTTGAGCAA TTTCAGTTGG GCTATGCTAG CGAAGATTGG TCAAGTTTGC TGGGTTACTT GCACGACCGC CACGGAGCCA ATCCGAGTGA AGTGGCCGAG GTTGGCTTAG CCTTGGAACG TGAACAGGGC GGCTATTATG ATCGCTTTCG TGGGCGTTTG ATGTTTCCAA TTCATAATGC CAAGGGCCAG ATTGTAGGCT TTGGTGGGCG GATTTTGGGC GATGGCCACC CCAAATATAT CAACTCGCCG CAAACCTTGC TGTTTGATAA AAGTAGCCTG TTGTATGGGC TATTTCAAGC CCGCGAGACG ATTCGCAGCA CTGATAGCGT GGTGGTGGTT GAGGGCTATG TTGATGTGAT TATGGCCCAT CAAGCGGGCT TTCGCAATGT GCTAGCGCCG ATGGGCACAG CCTTGACCGA TGTGCATGCC GGCCAACTCA ACAAAATGAC CAAGCGCATC ACATTGGCAA TGGATGGCGA TGCTGCTGGC CAGTCGGCGG CCTTGCGTGG CTTGGAAACC TTGCGTGAAA CGCTCGATAC TCATGTGCGG CCTGTGCCAA CTGCTTCGGG CATGTTGCGC TGGGAGCGCG AGCTTGATGC CACCATCAAG ATTGCCCTGC TGCCTGATGG GCGTGACCCT GATGATATTG TGCGCAACAA CCCTGAGGAA TGGCGCTCGC TGATTGCCAA TGCTCAGCCG TTGATGGATT TTTATCTGGC AACTTTGACG AGAGGCTTGG ATTTGAATAG TGCCAAAGGC AAATCGACAG CAGTCGAGCG CCTTACGCCT TTGCTCAATC AAATTGCCGA TCCAGTTGAA AAAGCCCATT ATATTCAGCG GCTTGCCAAC CAGATCAAGA TCGATCAGCG ATTAATTGAA GGATCAATTG GTGGCGACCA ACCAGAGCGT CAGCGCAAAC CCAACCATCG TCAAAGCCGC CCCGCTGCAC CACCACCAGT TTTTTCAATC TTAGAGCAGG AGGCACGGCG CGAAGATTTT CTTTTATCGT TATTAATTCG CTTTCCCCAA GTGCAAGCAG TGGTTAGCGA ACAACTACGT GATGATTTAG CCGCAGCGCC AACCCTGCGC GAGTTGTGGA GTGGCGATGT GAGCGAACTT TTTGAACGAG TCGAAAATCG TGTATTATGG GATCTGTGGC GCTCAGCCCA GGCGATGGCT ACTCCCAATC CAATTGAATG GCTTGAAACG GTACCTGCCG AATTGCAAGC CCATGGCGAA CGCCTACTTA ATTGGGATCA TGCTCCACCA ACCCGTAGTT TCCGTGTGAC TCGTGAGGCC GAGGAATGTT TGCGCCCGTT GCGCCACCGT TTAGGCAAGC GCTGGAGCGA CCGCCTTGGT CAGATAATTG CCGCCGCCGA ACCTGATCAA CAGGAACGAT TGCTGGAGCA AGCCATGGCC ATACAACATT ATTTGCGTGC GACTTCGGAA CCGCGTCGCA GCAGCTATTT TCTTGATACC CGTGATTCGC TCAAATAA
|
Protein sequence | MNSVIDDIKE RIDIVSFISD YVPLRKAGRN WVGFCPFHSN TRTPAFTVFA DSNSYHCFGC KASGTIFDFL MQREGIDFPE ALNQLAARAG VQLRQRTAQD DQEDVLRSRM LELNAAAARF WSHQLLQSPK AAHVRSYIEQ RGLTPQTVEQ FQLGYASEDW SSLLGYLHDR HGANPSEVAE VGLALEREQG GYYDRFRGRL MFPIHNAKGQ IVGFGGRILG DGHPKYINSP QTLLFDKSSL LYGLFQARET IRSTDSVVVV EGYVDVIMAH QAGFRNVLAP MGTALTDVHA GQLNKMTKRI TLAMDGDAAG QSAALRGLET LRETLDTHVR PVPTASGMLR WERELDATIK IALLPDGRDP DDIVRNNPEE WRSLIANAQP LMDFYLATLT RGLDLNSAKG KSTAVERLTP LLNQIADPVE KAHYIQRLAN QIKIDQRLIE GSIGGDQPER QRKPNHRQSR PAAPPPVFSI LEQEARREDF LLSLLIRFPQ VQAVVSEQLR DDLAAAPTLR ELWSGDVSEL FERVENRVLW DLWRSAQAMA TPNPIEWLET VPAELQAHGE RLLNWDHAPP TRSFRVTREA EECLRPLRHR LGKRWSDRLG QIIAAAEPDQ QERLLEQAMA IQHYLRATSE PRRSSYFLDT RDSLK
|
| |