Gene Hneap_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1688 
Symbol 
ID8534846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1815315 
End bp1818137 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content57% 
IMG OID646384072 
ProductDNA polymerase I 
Protein accessionYP_003263560 
Protein GI261856277 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATC TCTTTTCCTC GCACGTTGCC CTTCCAGTCG AGCCCGCCGA ACCGACAGTG 
AAATCCAAAT CGCTGATTCT GGTGGATGGT AGTTCGTTCC TGTTCCGTGC CTTCCATGCG
CTGCCCCCGC TGACGGCACC GGATGGCACA CCGACGGGCG CGATTCACGG CGTGATCAAT
ATGTTGCAGA AACTTCGGCG AGAGGAAAAT CCCGACCGTA TGGCCGTGGT GTTCGATGCG
CCCGGCCCGA CATTTCGCGA TGAACTCTAT CCCGAATACA AAGCACAACG CCCACCGCTG
CCGGATGACC TGCGCGTGCA AATCGAGCCG GTGCATGAGT TGGTGCGAGC ATTGGGTTTT
CCGTTGTTGT GCGTTTCGGG TGTGGAAGCC GATGATGTGA TCGGGACACT AATGCATCAG
GCGCGGCAGA ACGGTGAATC GGTGCTGGTC GCGACCGCCG ACAAGGACTT TGCGCAGTTG
GTGACCGAGG GAATACGTCT GGTGAACACC ATGACCAATA CGGTGCTGGA TGAAGCGGCT
ATCGAAGCGA AATATGGCAT CACCGCAGCT CAGTTCATCG ATTACCTGAC GCTGGTTGGG
GATACAGTAG ATAACGTGCC CGGTGTGCCG GGATGTGGCC CGAAAACAGC CGCCAAATGG
CTCAATGAAT GGCAATCGCT GGATAATTTG ATGGCCCATG CCGATCAAAT CAAAGGCAAG
GTGGGCGAGT CGCTGCGGGC TGCCAAGGAA TTTCTGCCCA TCGGGCGTGA GCTGGTGACG
ATTCGCACCG ATTGCGACCT GCCCATTGCC CTGGCGGATC TGGCTGTAGA GGAACCGGAT
GTCGATGCAG TCCGTGCGCT GGCGGAGAAG TTTGGACTCA ATACGCTACG CAGGCAGTTT
TCAGAGGCGT CTTCGGTTCC TGCTCCCGTT TCTACACCAA GCCAGGAAAC TCGGCGTACT
TCTTCGGATG ATGGGCAACT GCCCCTGAGC GATCCGCCGA TGTACGAAAC CATCCTGACG
GATGCCGATT GGCAGCGTTG GCTGGAGCAA ATCAAAAACG CGGACAAGAA AACCGACAAG
AAGATGGATT GGGTGGCTTT TGACACCGAA ACCGATTCGC TGGATTTATT CGCTGGCCGG
ATCGTCGGCG TTTCCTTTTC GATTGAAGAC AATCGTGCGG CCTACGTGCC GCTGGCACAC
AACTATCCCG GCGCGCCGGC GCAACTGGAT CGGGACACGG TGCTGGCCGA TCTCAAACCT
TGGCTTGAAG ATGCATCCCG AACCAAGGTG ATGCAGAACG CCAAGTTCGA CAGCCACATG
TTGGCTAATC ATGGCATCAC GCTGCGCGGC GTGTTATTCG ACACCATGCT CGAATCCTAT
GTGCTGGATT CAACCGCCAC ACGGCATGAC ATGGATTCGC TGGCGGCAAA GTATTTGGGG
CGCTCGACCA TCACGTTTGA AGATATTGCC GGCAAGGGCG CCAAAGCACT GAGCTTCCCC
GAGATACATC TTGAACAAGC GGGCCCCTAT GCGGCCGAAG ATGCCGATGT AACGGGGCAA
TTGCAGCAGT GTTTGTGGCC TAAATTGTCG GTCGAACCCG ATTTACGCTC AGTGTACGAA
ACAATCGAGC AGCCGCTGAT TGAAGTGCTG GTGGCCATGG AGCGCGCGGG GGTGCGGGTG
GATCGGGGTG AGCTCGCAAT TCAGGGCAAG GCCATCGGTG AGCGGATTGC CGCGGTGGAG
CAGGCCGCGT TCAAAGAAGC TGGGCGCGAA TTCAATCTGG GGTCGACCAA ACAGCTCAAG
GAGTTGTTGT TCGATGAACT CAAACTGCCC GTGGGCAAAA AGACGCCAAA GGGCGAGCCG
TCTACCGACG AAGAAGCGCT TGGCGAGCTG GTAGGTAGCC ATCCCTTGCC CGCGTTGATT
CTCGATTACC GCGGGCTAAG CAAGCTCAAA TCAACCTATA TCGACCGATT GCCCGAAGAC
ATCCACACCC ACACGGGCCG TGTCCATAGC GCTTTTCATC AAGCCGTGAC CGCGACCGGG
CGGCTTTCTT CATCCAATCC GAACCTGCAG AACATTCCGA TTCGCAGTGA AGAAGGCCGG
CGGATACGGC AGGCATTTGT CGCCGATCCG GGGTGCAAAC TCATTTCAGC AGACTACTCG
CAGATCGAAT TACGCATCAT GGCGCATTTG TCCGAAGATG AGCGTCTATG CGCGGCCTTT
GCCGCCGGAG AAGATATTCA CCGTGCCACG GCGGCGGAGG TGTTCGGGGT CAAGGAAGTC
GAAGTTTCCG ATAATCAGCG TCGAGCGGCG AAAGCAATCA ATTTCGGTTT GATTTATGGC
ATGTCCGCCT TCGGGTTGGC CAAACAGCTT GATGTGCCGC GTGGTGAGGC GCAGGCCTAT
ATCGATCTCT ATTTCGCCCG CTATCCCGGC GTGGCGAAAT ACATGGAGCG AATGCGGCAG
CAGGCGCGCC AGATGGGCTA CGTGGAAACC GTATTCGGTC GTCGCTTATA TTTGCCGGAG
ATTCACAGCC GTAATGGCCA GCGACGCCAA TATGCCGAGC GAACCGCCAT CAACGCGCCG
ATGCAGGGTA CGGCGGCGGA TATCATCAAG ATAGCGATGA TCGCTTTGCA TAAGCTGCTG
GTGGTGCCCG GGCGAGCCCG GATGATTTTA CAGGTGCACG ATGAATTGAT CTTCGAGGTG
CCCGAGTCCG ACGTCGCCGA GATCGAGCCG ATCATCCGCG CACAGATGAC AGGGGCTGCA
AAATTGAATG TGCCGCTTGA AGTGGGTATC GGCATCGGGA GAAGTTGGGC CGAAGCGCAC
TAG
 
Protein sequence
MNDLFSSHVA LPVEPAEPTV KSKSLILVDG SSFLFRAFHA LPPLTAPDGT PTGAIHGVIN 
MLQKLRREEN PDRMAVVFDA PGPTFRDELY PEYKAQRPPL PDDLRVQIEP VHELVRALGF
PLLCVSGVEA DDVIGTLMHQ ARQNGESVLV ATADKDFAQL VTEGIRLVNT MTNTVLDEAA
IEAKYGITAA QFIDYLTLVG DTVDNVPGVP GCGPKTAAKW LNEWQSLDNL MAHADQIKGK
VGESLRAAKE FLPIGRELVT IRTDCDLPIA LADLAVEEPD VDAVRALAEK FGLNTLRRQF
SEASSVPAPV STPSQETRRT SSDDGQLPLS DPPMYETILT DADWQRWLEQ IKNADKKTDK
KMDWVAFDTE TDSLDLFAGR IVGVSFSIED NRAAYVPLAH NYPGAPAQLD RDTVLADLKP
WLEDASRTKV MQNAKFDSHM LANHGITLRG VLFDTMLESY VLDSTATRHD MDSLAAKYLG
RSTITFEDIA GKGAKALSFP EIHLEQAGPY AAEDADVTGQ LQQCLWPKLS VEPDLRSVYE
TIEQPLIEVL VAMERAGVRV DRGELAIQGK AIGERIAAVE QAAFKEAGRE FNLGSTKQLK
ELLFDELKLP VGKKTPKGEP STDEEALGEL VGSHPLPALI LDYRGLSKLK STYIDRLPED
IHTHTGRVHS AFHQAVTATG RLSSSNPNLQ NIPIRSEEGR RIRQAFVADP GCKLISADYS
QIELRIMAHL SEDERLCAAF AAGEDIHRAT AAEVFGVKEV EVSDNQRRAA KAINFGLIYG
MSAFGLAKQL DVPRGEAQAY IDLYFARYPG VAKYMERMRQ QARQMGYVET VFGRRLYLPE
IHSRNGQRRQ YAERTAINAP MQGTAADIIK IAMIALHKLL VVPGRARMIL QVHDELIFEV
PESDVAEIEP IIRAQMTGAA KLNVPLEVGI GIGRSWAEAH