Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2421 |
Symbol | |
ID | 4710229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2658502 |
End bp | 2661216 |
Gene Length | 2715 bp |
Protein Length | 904 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639856897 |
Product | DNA polymerase I |
Protein accession | YP_001003986 |
Protein GI | 121999199 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0819967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAAC CGGACGACCG CCTGGTCCTG GTGGACGGCT CCTCCTACCT CTACCGCGCC TTCCACGCCC TGCCGGCACT GACCAACGCC AACGGCGAAC CGACCGGGGC GCTCTACGGC GTGGTTAACA TGCTCCACAA GCTCCTTGCC GAGGAGCCCG AGGCGCGCTT CGCCGTGGTC TTCGACGCCC CGGGCAAGAC CTTTCGCGAT GAGCTCTTCG AGCAGTACAA GGCGCACCGG CCACCCATGC CCGATGAACT GCGTGCCCAG CGGGAGCCGC TCAAGGCGAT CATCGCTGCG CTCGGGGTGC CGGTGCTCGA GGTGCCCGGT GTGGAGGCCG ACGATGTCAT CGGGACCCTG GCCGCGCGCG CCTCGGGGCC GGTCCTGATT TCCACCACGG ACAAGGACAT GGCGCAGCTG GTGGACGAAC AGGTGACCTT GCTCAACACC ATGAGCGGCA CCCGTCTGGA CCCGGAGGGG GTGCGCGAGA AGTTTGGTGT CCCCCCCGAG TTGATCCGCG ACTACCTGGC CCTAGTGGGC GACACCTCGG ACAACATCCC CGGTGTTCCC AAGGTCGGAC CGAAGACCGC TGCCAAGTGG CTCAATGCCT ACGGCAGCCT CGACGCACTG CGCGAGCAGG CCGACGAGAT CCGCGGTAAG GTCGGCGAGA GCCTCCGCGC CCATCTCGAC GAGCTGCCGC TGTCGGTGGA TCTGGTCACC ATCCGCTGCG ACCTGGCCCT CGAGGTCGCC CCGGAGGACC TGGTTCGCCA GAGCCCGGAC CGCGAGACCC TCGGTGGGCT CTATCAGCGC TACGGTATGC GTCGCTTCCT CGCCGAGCTG CAGGCCGGCG ACGCCGCCGC CGCAGCCGAC GGCACCGGCG CCAGTCTCCC CCCCAACGCG CCCGAGGTGG CCTACGAGGT GGTCCTCGAC GACCACGGTC TCGCCGCCTG GATGGAGCGG CTACGCAACG CCGATGCCTT CTCCATCGAC CTGGAGACGA ACAGCCTCAA CTACATGGAT GCCGAGATCG TCGGCGTGTC GTTGGCTGTC GAGCCGGGGC AGGCCGCCTA TCTGCCCGTG GCCCACTGCG GGCCCGGTGC CCCGGACCAG CTCGACCGGG ACCGGGTGCT CGACGCGCTG CGCCCCCTGC TCGAGGCCGA GCAGCCGGAG AAGATGGGTC AGAACCTCAA GTACGACATG AGTGTCCTGG CCCGCTACGG GATCGAGCTG CGCGGGGTGG CCTACGACAG TATGCTCGAG TCCTACGTCC TCGACTCCAC GGCGACCCGC CACGACATGG ACTCGCTGGC CAGCAAGTAC CTGGGGGTCG AGGTCACCAG CTACGAGCAG CTCTGCGGCA AAGGGGTGCG GCAGGTCCCG TTCGCCGAGA TCGACGTCGA GCGTGCCGGC CACTACGCCG CCGAGGACGC CGACATCGCG CTGCGCCTTC ATCAACTTCT TTACCCCCGG CTGCAGGCCG AATCGGGGCT GCTGCGAGTC TTCAGCCAAC TCGAGATGCC CCTGTTGCCG GTTCTTTCGC GCATGGAGCG CCACGGGGTG CGGGTCGATT GCGACCTGCT GGAGCGCCAG AGCGAGGAGC TGGCCGGGCG CATGGCCGAG GTGGAGCAGC GCGCCCACGA GGAGGCCGGC GAGGCGTTCA ACCTCGGCTC ACCCAAGCAG ATCCAGGAGA TCTTCTTCGA GCGTATGGGA TTGCCCGTGA TCCAGCGCAC CCCCAAGGGC CAGCCGTCTA CCGCCGAGTC GGTGCTCGAA GAGCTCAGCG CGCGGGGCCA CGAACTGCCG CGGTTGATCC TTGAGCATCG GGGGCTGTCC AAGCTCAAGT CCACCTATAC CGACAAGCTG CCGCAGCTGA TCCACCGGGA CACCGGTCGT GTGCACACCT CCTACCATCA GGCGGTGGCG GCGACCGGAC GGCTCTCCTC ATCCGATCCC AACCTGCAGA ACATCCCGGT GCGCACCCCG GAAGGGCGGC GCATCCGCAA GGCCTTCGTG GCCAGCCCCG GGCACCGGCT GATCACCGCC GATTACTCCC AGGTCGAGCT TCGCATCATG GCCCACCTCT CCGGCGATGA GGGCCTGCGC CGGGCCTTCG AGCAGGGCGA GGACATCCAC CGCGCCACCG CCGCCGAGGT CTTCGCCGCC GATGAGGTCA ACGACGAGCA GCGCCGCGCC GCCAAGGCGA TCAACTTCGG GCTGATCTAC GGCATGTCCG CCTGGGGCCT GGGGCGGCAG CTGGGCATCC CGCGCGACGA GGCGCAGACC TACATCGACC GTTACTTCGA GCGCTACCCC GGTGTGCGTG CCTTCATGGA TCGGGCGCGC GAGCAGGCCC GGGAGCAGGG TTATGTGGAG ACCGTGTTTG GCCGCCGACT GCACGTCCCG GAGATCCACA GCCGCAACCG TCAGCGCCGC GAGTACGCCG AGCGCACCGC CATCAATGCC CCTATGCAGG GGACCGCGGC GGATGTCATC AAGCGGGCCA TGATCGACGT CGACGCCCTG CTCAATGAGC GCTTCCCGGA GAGCCGACTG GTGATGCAGG TGCACGATGA GTTGGTGCTC GAGGTCCCTG AGGCGCAGGC AACGGCGGTG GGCGATGAGG TGCGCCGGCT GATGGAGGGA TCGGATCGCG GCATGGTGTC GGTTCCCTTG GAAGTCGAGC TCGGTGTTGG CGATGATTGG GAACAGGCCC ACTGA
|
Protein sequence | MTQPDDRLVL VDGSSYLYRA FHALPALTNA NGEPTGALYG VVNMLHKLLA EEPEARFAVV FDAPGKTFRD ELFEQYKAHR PPMPDELRAQ REPLKAIIAA LGVPVLEVPG VEADDVIGTL AARASGPVLI STTDKDMAQL VDEQVTLLNT MSGTRLDPEG VREKFGVPPE LIRDYLALVG DTSDNIPGVP KVGPKTAAKW LNAYGSLDAL REQADEIRGK VGESLRAHLD ELPLSVDLVT IRCDLALEVA PEDLVRQSPD RETLGGLYQR YGMRRFLAEL QAGDAAAAAD GTGASLPPNA PEVAYEVVLD DHGLAAWMER LRNADAFSID LETNSLNYMD AEIVGVSLAV EPGQAAYLPV AHCGPGAPDQ LDRDRVLDAL RPLLEAEQPE KMGQNLKYDM SVLARYGIEL RGVAYDSMLE SYVLDSTATR HDMDSLASKY LGVEVTSYEQ LCGKGVRQVP FAEIDVERAG HYAAEDADIA LRLHQLLYPR LQAESGLLRV FSQLEMPLLP VLSRMERHGV RVDCDLLERQ SEELAGRMAE VEQRAHEEAG EAFNLGSPKQ IQEIFFERMG LPVIQRTPKG QPSTAESVLE ELSARGHELP RLILEHRGLS KLKSTYTDKL PQLIHRDTGR VHTSYHQAVA ATGRLSSSDP NLQNIPVRTP EGRRIRKAFV ASPGHRLITA DYSQVELRIM AHLSGDEGLR RAFEQGEDIH RATAAEVFAA DEVNDEQRRA AKAINFGLIY GMSAWGLGRQ LGIPRDEAQT YIDRYFERYP GVRAFMDRAR EQAREQGYVE TVFGRRLHVP EIHSRNRQRR EYAERTAINA PMQGTAADVI KRAMIDVDAL LNERFPESRL VMQVHDELVL EVPEAQATAV GDEVRRLMEG SDRGMVSVPL EVELGVGDDW EQAH
|
| |