Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1688 |
Symbol | |
ID | 8534846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1815315 |
End bp | 1818137 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646384072 |
Product | DNA polymerase I |
Protein accession | YP_003263560 |
Protein GI | 261856277 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGATC TCTTTTCCTC GCACGTTGCC CTTCCAGTCG AGCCCGCCGA ACCGACAGTG AAATCCAAAT CGCTGATTCT GGTGGATGGT AGTTCGTTCC TGTTCCGTGC CTTCCATGCG CTGCCCCCGC TGACGGCACC GGATGGCACA CCGACGGGCG CGATTCACGG CGTGATCAAT ATGTTGCAGA AACTTCGGCG AGAGGAAAAT CCCGACCGTA TGGCCGTGGT GTTCGATGCG CCCGGCCCGA CATTTCGCGA TGAACTCTAT CCCGAATACA AAGCACAACG CCCACCGCTG CCGGATGACC TGCGCGTGCA AATCGAGCCG GTGCATGAGT TGGTGCGAGC ATTGGGTTTT CCGTTGTTGT GCGTTTCGGG TGTGGAAGCC GATGATGTGA TCGGGACACT AATGCATCAG GCGCGGCAGA ACGGTGAATC GGTGCTGGTC GCGACCGCCG ACAAGGACTT TGCGCAGTTG GTGACCGAGG GAATACGTCT GGTGAACACC ATGACCAATA CGGTGCTGGA TGAAGCGGCT ATCGAAGCGA AATATGGCAT CACCGCAGCT CAGTTCATCG ATTACCTGAC GCTGGTTGGG GATACAGTAG ATAACGTGCC CGGTGTGCCG GGATGTGGCC CGAAAACAGC CGCCAAATGG CTCAATGAAT GGCAATCGCT GGATAATTTG ATGGCCCATG CCGATCAAAT CAAAGGCAAG GTGGGCGAGT CGCTGCGGGC TGCCAAGGAA TTTCTGCCCA TCGGGCGTGA GCTGGTGACG ATTCGCACCG ATTGCGACCT GCCCATTGCC CTGGCGGATC TGGCTGTAGA GGAACCGGAT GTCGATGCAG TCCGTGCGCT GGCGGAGAAG TTTGGACTCA ATACGCTACG CAGGCAGTTT TCAGAGGCGT CTTCGGTTCC TGCTCCCGTT TCTACACCAA GCCAGGAAAC TCGGCGTACT TCTTCGGATG ATGGGCAACT GCCCCTGAGC GATCCGCCGA TGTACGAAAC CATCCTGACG GATGCCGATT GGCAGCGTTG GCTGGAGCAA ATCAAAAACG CGGACAAGAA AACCGACAAG AAGATGGATT GGGTGGCTTT TGACACCGAA ACCGATTCGC TGGATTTATT CGCTGGCCGG ATCGTCGGCG TTTCCTTTTC GATTGAAGAC AATCGTGCGG CCTACGTGCC GCTGGCACAC AACTATCCCG GCGCGCCGGC GCAACTGGAT CGGGACACGG TGCTGGCCGA TCTCAAACCT TGGCTTGAAG ATGCATCCCG AACCAAGGTG ATGCAGAACG CCAAGTTCGA CAGCCACATG TTGGCTAATC ATGGCATCAC GCTGCGCGGC GTGTTATTCG ACACCATGCT CGAATCCTAT GTGCTGGATT CAACCGCCAC ACGGCATGAC ATGGATTCGC TGGCGGCAAA GTATTTGGGG CGCTCGACCA TCACGTTTGA AGATATTGCC GGCAAGGGCG CCAAAGCACT GAGCTTCCCC GAGATACATC TTGAACAAGC GGGCCCCTAT GCGGCCGAAG ATGCCGATGT AACGGGGCAA TTGCAGCAGT GTTTGTGGCC TAAATTGTCG GTCGAACCCG ATTTACGCTC AGTGTACGAA ACAATCGAGC AGCCGCTGAT TGAAGTGCTG GTGGCCATGG AGCGCGCGGG GGTGCGGGTG GATCGGGGTG AGCTCGCAAT TCAGGGCAAG GCCATCGGTG AGCGGATTGC CGCGGTGGAG CAGGCCGCGT TCAAAGAAGC TGGGCGCGAA TTCAATCTGG GGTCGACCAA ACAGCTCAAG GAGTTGTTGT TCGATGAACT CAAACTGCCC GTGGGCAAAA AGACGCCAAA GGGCGAGCCG TCTACCGACG AAGAAGCGCT TGGCGAGCTG GTAGGTAGCC ATCCCTTGCC CGCGTTGATT CTCGATTACC GCGGGCTAAG CAAGCTCAAA TCAACCTATA TCGACCGATT GCCCGAAGAC ATCCACACCC ACACGGGCCG TGTCCATAGC GCTTTTCATC AAGCCGTGAC CGCGACCGGG CGGCTTTCTT CATCCAATCC GAACCTGCAG AACATTCCGA TTCGCAGTGA AGAAGGCCGG CGGATACGGC AGGCATTTGT CGCCGATCCG GGGTGCAAAC TCATTTCAGC AGACTACTCG CAGATCGAAT TACGCATCAT GGCGCATTTG TCCGAAGATG AGCGTCTATG CGCGGCCTTT GCCGCCGGAG AAGATATTCA CCGTGCCACG GCGGCGGAGG TGTTCGGGGT CAAGGAAGTC GAAGTTTCCG ATAATCAGCG TCGAGCGGCG AAAGCAATCA ATTTCGGTTT GATTTATGGC ATGTCCGCCT TCGGGTTGGC CAAACAGCTT GATGTGCCGC GTGGTGAGGC GCAGGCCTAT ATCGATCTCT ATTTCGCCCG CTATCCCGGC GTGGCGAAAT ACATGGAGCG AATGCGGCAG CAGGCGCGCC AGATGGGCTA CGTGGAAACC GTATTCGGTC GTCGCTTATA TTTGCCGGAG ATTCACAGCC GTAATGGCCA GCGACGCCAA TATGCCGAGC GAACCGCCAT CAACGCGCCG ATGCAGGGTA CGGCGGCGGA TATCATCAAG ATAGCGATGA TCGCTTTGCA TAAGCTGCTG GTGGTGCCCG GGCGAGCCCG GATGATTTTA CAGGTGCACG ATGAATTGAT CTTCGAGGTG CCCGAGTCCG ACGTCGCCGA GATCGAGCCG ATCATCCGCG CACAGATGAC AGGGGCTGCA AAATTGAATG TGCCGCTTGA AGTGGGTATC GGCATCGGGA GAAGTTGGGC CGAAGCGCAC TAG
|
Protein sequence | MNDLFSSHVA LPVEPAEPTV KSKSLILVDG SSFLFRAFHA LPPLTAPDGT PTGAIHGVIN MLQKLRREEN PDRMAVVFDA PGPTFRDELY PEYKAQRPPL PDDLRVQIEP VHELVRALGF PLLCVSGVEA DDVIGTLMHQ ARQNGESVLV ATADKDFAQL VTEGIRLVNT MTNTVLDEAA IEAKYGITAA QFIDYLTLVG DTVDNVPGVP GCGPKTAAKW LNEWQSLDNL MAHADQIKGK VGESLRAAKE FLPIGRELVT IRTDCDLPIA LADLAVEEPD VDAVRALAEK FGLNTLRRQF SEASSVPAPV STPSQETRRT SSDDGQLPLS DPPMYETILT DADWQRWLEQ IKNADKKTDK KMDWVAFDTE TDSLDLFAGR IVGVSFSIED NRAAYVPLAH NYPGAPAQLD RDTVLADLKP WLEDASRTKV MQNAKFDSHM LANHGITLRG VLFDTMLESY VLDSTATRHD MDSLAAKYLG RSTITFEDIA GKGAKALSFP EIHLEQAGPY AAEDADVTGQ LQQCLWPKLS VEPDLRSVYE TIEQPLIEVL VAMERAGVRV DRGELAIQGK AIGERIAAVE QAAFKEAGRE FNLGSTKQLK ELLFDELKLP VGKKTPKGEP STDEEALGEL VGSHPLPALI LDYRGLSKLK STYIDRLPED IHTHTGRVHS AFHQAVTATG RLSSSNPNLQ NIPIRSEEGR RIRQAFVADP GCKLISADYS QIELRIMAHL SEDERLCAAF AAGEDIHRAT AAEVFGVKEV EVSDNQRRAA KAINFGLIYG MSAFGLAKQL DVPRGEAQAY IDLYFARYPG VAKYMERMRQ QARQMGYVET VFGRRLYLPE IHSRNGQRRQ YAERTAINAP MQGTAADIIK IAMIALHKLL VVPGRARMIL QVHDELIFEV PESDVAEIEP IIRAQMTGAA KLNVPLEVGI GIGRSWAEAH
|
| |