Gene Hhal_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2421 
Symbol 
ID4710229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2658502 
End bp2661216 
Gene Length2715 bp 
Protein Length904 aa 
Translation table11 
GC content68% 
IMG OID639856897 
ProductDNA polymerase I 
Protein accessionYP_001003986 
Protein GI121999199 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0819967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAAC CGGACGACCG CCTGGTCCTG GTGGACGGCT CCTCCTACCT CTACCGCGCC 
TTCCACGCCC TGCCGGCACT GACCAACGCC AACGGCGAAC CGACCGGGGC GCTCTACGGC
GTGGTTAACA TGCTCCACAA GCTCCTTGCC GAGGAGCCCG AGGCGCGCTT CGCCGTGGTC
TTCGACGCCC CGGGCAAGAC CTTTCGCGAT GAGCTCTTCG AGCAGTACAA GGCGCACCGG
CCACCCATGC CCGATGAACT GCGTGCCCAG CGGGAGCCGC TCAAGGCGAT CATCGCTGCG
CTCGGGGTGC CGGTGCTCGA GGTGCCCGGT GTGGAGGCCG ACGATGTCAT CGGGACCCTG
GCCGCGCGCG CCTCGGGGCC GGTCCTGATT TCCACCACGG ACAAGGACAT GGCGCAGCTG
GTGGACGAAC AGGTGACCTT GCTCAACACC ATGAGCGGCA CCCGTCTGGA CCCGGAGGGG
GTGCGCGAGA AGTTTGGTGT CCCCCCCGAG TTGATCCGCG ACTACCTGGC CCTAGTGGGC
GACACCTCGG ACAACATCCC CGGTGTTCCC AAGGTCGGAC CGAAGACCGC TGCCAAGTGG
CTCAATGCCT ACGGCAGCCT CGACGCACTG CGCGAGCAGG CCGACGAGAT CCGCGGTAAG
GTCGGCGAGA GCCTCCGCGC CCATCTCGAC GAGCTGCCGC TGTCGGTGGA TCTGGTCACC
ATCCGCTGCG ACCTGGCCCT CGAGGTCGCC CCGGAGGACC TGGTTCGCCA GAGCCCGGAC
CGCGAGACCC TCGGTGGGCT CTATCAGCGC TACGGTATGC GTCGCTTCCT CGCCGAGCTG
CAGGCCGGCG ACGCCGCCGC CGCAGCCGAC GGCACCGGCG CCAGTCTCCC CCCCAACGCG
CCCGAGGTGG CCTACGAGGT GGTCCTCGAC GACCACGGTC TCGCCGCCTG GATGGAGCGG
CTACGCAACG CCGATGCCTT CTCCATCGAC CTGGAGACGA ACAGCCTCAA CTACATGGAT
GCCGAGATCG TCGGCGTGTC GTTGGCTGTC GAGCCGGGGC AGGCCGCCTA TCTGCCCGTG
GCCCACTGCG GGCCCGGTGC CCCGGACCAG CTCGACCGGG ACCGGGTGCT CGACGCGCTG
CGCCCCCTGC TCGAGGCCGA GCAGCCGGAG AAGATGGGTC AGAACCTCAA GTACGACATG
AGTGTCCTGG CCCGCTACGG GATCGAGCTG CGCGGGGTGG CCTACGACAG TATGCTCGAG
TCCTACGTCC TCGACTCCAC GGCGACCCGC CACGACATGG ACTCGCTGGC CAGCAAGTAC
CTGGGGGTCG AGGTCACCAG CTACGAGCAG CTCTGCGGCA AAGGGGTGCG GCAGGTCCCG
TTCGCCGAGA TCGACGTCGA GCGTGCCGGC CACTACGCCG CCGAGGACGC CGACATCGCG
CTGCGCCTTC ATCAACTTCT TTACCCCCGG CTGCAGGCCG AATCGGGGCT GCTGCGAGTC
TTCAGCCAAC TCGAGATGCC CCTGTTGCCG GTTCTTTCGC GCATGGAGCG CCACGGGGTG
CGGGTCGATT GCGACCTGCT GGAGCGCCAG AGCGAGGAGC TGGCCGGGCG CATGGCCGAG
GTGGAGCAGC GCGCCCACGA GGAGGCCGGC GAGGCGTTCA ACCTCGGCTC ACCCAAGCAG
ATCCAGGAGA TCTTCTTCGA GCGTATGGGA TTGCCCGTGA TCCAGCGCAC CCCCAAGGGC
CAGCCGTCTA CCGCCGAGTC GGTGCTCGAA GAGCTCAGCG CGCGGGGCCA CGAACTGCCG
CGGTTGATCC TTGAGCATCG GGGGCTGTCC AAGCTCAAGT CCACCTATAC CGACAAGCTG
CCGCAGCTGA TCCACCGGGA CACCGGTCGT GTGCACACCT CCTACCATCA GGCGGTGGCG
GCGACCGGAC GGCTCTCCTC ATCCGATCCC AACCTGCAGA ACATCCCGGT GCGCACCCCG
GAAGGGCGGC GCATCCGCAA GGCCTTCGTG GCCAGCCCCG GGCACCGGCT GATCACCGCC
GATTACTCCC AGGTCGAGCT TCGCATCATG GCCCACCTCT CCGGCGATGA GGGCCTGCGC
CGGGCCTTCG AGCAGGGCGA GGACATCCAC CGCGCCACCG CCGCCGAGGT CTTCGCCGCC
GATGAGGTCA ACGACGAGCA GCGCCGCGCC GCCAAGGCGA TCAACTTCGG GCTGATCTAC
GGCATGTCCG CCTGGGGCCT GGGGCGGCAG CTGGGCATCC CGCGCGACGA GGCGCAGACC
TACATCGACC GTTACTTCGA GCGCTACCCC GGTGTGCGTG CCTTCATGGA TCGGGCGCGC
GAGCAGGCCC GGGAGCAGGG TTATGTGGAG ACCGTGTTTG GCCGCCGACT GCACGTCCCG
GAGATCCACA GCCGCAACCG TCAGCGCCGC GAGTACGCCG AGCGCACCGC CATCAATGCC
CCTATGCAGG GGACCGCGGC GGATGTCATC AAGCGGGCCA TGATCGACGT CGACGCCCTG
CTCAATGAGC GCTTCCCGGA GAGCCGACTG GTGATGCAGG TGCACGATGA GTTGGTGCTC
GAGGTCCCTG AGGCGCAGGC AACGGCGGTG GGCGATGAGG TGCGCCGGCT GATGGAGGGA
TCGGATCGCG GCATGGTGTC GGTTCCCTTG GAAGTCGAGC TCGGTGTTGG CGATGATTGG
GAACAGGCCC ACTGA
 
Protein sequence
MTQPDDRLVL VDGSSYLYRA FHALPALTNA NGEPTGALYG VVNMLHKLLA EEPEARFAVV 
FDAPGKTFRD ELFEQYKAHR PPMPDELRAQ REPLKAIIAA LGVPVLEVPG VEADDVIGTL
AARASGPVLI STTDKDMAQL VDEQVTLLNT MSGTRLDPEG VREKFGVPPE LIRDYLALVG
DTSDNIPGVP KVGPKTAAKW LNAYGSLDAL REQADEIRGK VGESLRAHLD ELPLSVDLVT
IRCDLALEVA PEDLVRQSPD RETLGGLYQR YGMRRFLAEL QAGDAAAAAD GTGASLPPNA
PEVAYEVVLD DHGLAAWMER LRNADAFSID LETNSLNYMD AEIVGVSLAV EPGQAAYLPV
AHCGPGAPDQ LDRDRVLDAL RPLLEAEQPE KMGQNLKYDM SVLARYGIEL RGVAYDSMLE
SYVLDSTATR HDMDSLASKY LGVEVTSYEQ LCGKGVRQVP FAEIDVERAG HYAAEDADIA
LRLHQLLYPR LQAESGLLRV FSQLEMPLLP VLSRMERHGV RVDCDLLERQ SEELAGRMAE
VEQRAHEEAG EAFNLGSPKQ IQEIFFERMG LPVIQRTPKG QPSTAESVLE ELSARGHELP
RLILEHRGLS KLKSTYTDKL PQLIHRDTGR VHTSYHQAVA ATGRLSSSDP NLQNIPVRTP
EGRRIRKAFV ASPGHRLITA DYSQVELRIM AHLSGDEGLR RAFEQGEDIH RATAAEVFAA
DEVNDEQRRA AKAINFGLIY GMSAWGLGRQ LGIPRDEAQT YIDRYFERYP GVRAFMDRAR
EQAREQGYVE TVFGRRLHVP EIHSRNRQRR EYAERTAINA PMQGTAADVI KRAMIDVDAL
LNERFPESRL VMQVHDELVL EVPEAQATAV GDEVRRLMEG SDRGMVSVPL EVELGVGDDW
EQAH