Gene Haur_0069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0069 
Symbol 
ID5731942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp88237 
End bp91098 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content52% 
IMG OID641277191 
ProductDNA polymerase I 
Protein accessionYP_001542849 
Protein GI159896602 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.512123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAGC GACCCACGCT CGTCTTGGTT GATGGCCATG CACTGGCATT TCGCGCTTTT 
TTTGCGCTTC GCGACACAGG CATGTCGGTT CGCGCCACGG GCGAGCCAAC CTATGCCGTG
CAAGGCTTTT TATCAATTTT GCTCAACTTG TTACGTGAGC GCCAACCAGA GTATGTGGCG
GTTTCGTTTG ATATTGGTCG AACCTTCCGC GATGATCTGT ACCCCGATTA TAAGGCTGGC
CGCGCTGAAA CACCCGCCGA TTTCCACCCC CAACTTGAAC GGATCAAGCA AATTATCAAT
GCTTTGAATA TTCCGATCTA CACTGCCGAG AACTATGAGG CCGATGATGT GATTGGGACG
TTGTGTCGTC AAGCTGAGGC GCAGGGTGTC GATACCTTGA TTATCACAGG CGATACCGAC
ACCCTGCAAT TGGTCAACGA CTATACCAAG GTGCTGTTGG CCAATCCTTA TGGCAAGGGC
AATGTTTCGC TCTACGATGA AGCTCAAGTG CGCGAACGCT ACAAAGGCTT GGCTCCCAAC
CAACTGGCCG ATTTGCGCGG CCTCAAGGGC GATACCTCCG ACAACATTCC TGGGGTCAAG
GGCATTGGTG AGGCTGGGGC AATCAGCATG CTCAATGAAT GGGGCAGCGT CGAAAATATC
TACGCCAACC TCGATAAAGT CGCCAATCGC TATCGTTCAA AGCTCGATGG CCAGCAAGAA
GCCGCGCGAT TTAGCACTCA CTTAGCAACT ATCGTTACTA ACGCCCCGGT AACCTTGGAT
CTTGAAGCCA CCAAAGTGCA CGATTATGAT CGTGATACGG TCTTGGCCTT GTTCAGCCAG
CTTGAATTTC GCAAGTTGGT CGATAAGCTG CCACTTTCCA GCCAAGTGAG CGCTGTCCAA
GTTGTTGCCA TTCCGCAAAC GACCAATCCT AACCAATTAA CCATGTTCGA TGATGCCACG
CCCCCAAGTG CTGAGCCGAT CGCTCAATCT GGCGATTATC AAGCGGTGAC TACCAGCGAA
CAATTGGCCG AATTGGTAAG AATTTTGACG GCAGCTGAAC GTTTAATCTT CGATGTGGAA
ACCAATAGTT TGAATTTGTT CTCGCCTGTG CCCGCCAAGG TTGTCGGCAT CGCCCTGACC
CATACTGCTG GCTGCGGTTG GTATATTCCA TTGGGCCATC GCAGCGGCCA GCAATTGCCG
ATAGCCGAAG TTGTCGCGGC ACTGCAACCA TTGTTTAGCG ACCCACAAAA AGCTGTGGTA
GCGCATAATG GCAAGTTTGA TATGAGCGCC TTGAGCTTGA TTGGGCTTGA TGTGCCCCAT
TTGAGTTTCG ATACGGCCAT CGCCGCCGCC TTGTTGGGCA AACGCCAGAG CCTCAAAGAT
TTGGCCTTTG CCGAATTACG CGATGCCGAT GATCGCCCAA TTGAGATGAC CCGGATCGAA
ACACTGATCG GTACTGGCAA AAAGCAAATC ACCATGGATC AGGTGGCGAT TGAACAAGTA
ACGCCTTATG CAAGCGCCGA TGTTGATATG ACGGCGCGTT TATTGGCGCT ATTCATGCCG
CAACTTGGGG CAATTCCGGC GGTACGCGAG GTGTTTGAGC AAATCGAAAT GCCGCTTAGC
CCTGTGTTGA TGCGCATGGA AGCCTGTGGC ATTGGGCTTG ATCGGGCGCA ATTGGTGCAA
CAAGGTCAAG TGTTAGGTCA AAGTTTACGC GAAATCGAGC AACATATTGC CGATTTTGTC
GGCGAACCAC TCAACATTAA TTCGCGCTTC GATTTAAATG ATCTGTTGTT TATTCGCTTG
AAGTTGCCAA CCGCCAATCT CAAGCGTTTG GCTGGTACAA CTCGCAGTGG CGGCGCGGTT
TACTCAGTTA ACGCTGAAAC CTTGGAAGAT TTACAAACTC ACGATCAAAG CGGGATTGTA
GCCATGATTT TGCGCTATCG CCGTTTGTCG AAGCTCAAAT CGACCTATGT TGATGCCTTG
ATTGAGTTGA TCAACCAGCA AACAGGCCGC GTGCATACCC AATATCGCCA AATTGGCGCG
GAAACTGGGC GGCTTAGCTC CGACTCACCC AACTTGCAAA ATATTCCGGT GCGCAGCGAG
GAAGGCCGCG AAATTCGACG GGCTTTTGTT GCGCGGCCAG GCCATGTACT GATGACCGCC
GACTATTCAC AGATTGAACT ACGAGTCTTG GCTCATATCA CCGCCGATCC AGCCTTAGTC
GAAGTGTTTA AAACTGGCCA AGATATTCAC GCAGCCACCG CTGCCCGTTT GTTTGATATT
CCCATGGATG AAGTCAGCAA AAATCAGCGG CGGATCGCCA AAATGACGGT CTTTGGTATT
ATTTACGGCA TTAGCAGCTT TGGCTTGGCC GCTCGCACGG CGCTTTCACG CACCGAAGCC
CAACAAATGA TCAACGGCTT GTTTGCTCAA TACCCAGGCC TGAAGAGCTA TATCGAACGA
ACATTGGAGC GAGTTAAGGC AGTTGGCTAT GTTGAAACCT TGTTTGGCCG CCGCCGCTAC
TTCCGCGAAT TGCAAGACGG TGGCGTAACT GGGCCTCGCC GTAGCGCCTT TGAGCGTGAA
GCGACCAACG CTGGGATTCA AGGCACAGCC GCCGATTTGA TCAAGTTGGC CATGATTCGG
CTGGAACAAG CATTAATTGC TGGCGGCTAT CAGGCCAAAA TGCTGCTGCA AGTGCATGAC
GAATTGGTTT TGGAAGTGCC TGAGGATGAG CGTGATGCCG TAGCCCAATT AGTTTGTGAT
ACGATGACCC AAGTCTATCC CGATTTGGCC GTGCCATTGG AAGTAAACGT TGAAACTGGG
CTGAATTGGG ATCAGCTTCA GCGCTGGCAT GCCCCAGCCT AG
 
Protein sequence
MQQRPTLVLV DGHALAFRAF FALRDTGMSV RATGEPTYAV QGFLSILLNL LRERQPEYVA 
VSFDIGRTFR DDLYPDYKAG RAETPADFHP QLERIKQIIN ALNIPIYTAE NYEADDVIGT
LCRQAEAQGV DTLIITGDTD TLQLVNDYTK VLLANPYGKG NVSLYDEAQV RERYKGLAPN
QLADLRGLKG DTSDNIPGVK GIGEAGAISM LNEWGSVENI YANLDKVANR YRSKLDGQQE
AARFSTHLAT IVTNAPVTLD LEATKVHDYD RDTVLALFSQ LEFRKLVDKL PLSSQVSAVQ
VVAIPQTTNP NQLTMFDDAT PPSAEPIAQS GDYQAVTTSE QLAELVRILT AAERLIFDVE
TNSLNLFSPV PAKVVGIALT HTAGCGWYIP LGHRSGQQLP IAEVVAALQP LFSDPQKAVV
AHNGKFDMSA LSLIGLDVPH LSFDTAIAAA LLGKRQSLKD LAFAELRDAD DRPIEMTRIE
TLIGTGKKQI TMDQVAIEQV TPYASADVDM TARLLALFMP QLGAIPAVRE VFEQIEMPLS
PVLMRMEACG IGLDRAQLVQ QGQVLGQSLR EIEQHIADFV GEPLNINSRF DLNDLLFIRL
KLPTANLKRL AGTTRSGGAV YSVNAETLED LQTHDQSGIV AMILRYRRLS KLKSTYVDAL
IELINQQTGR VHTQYRQIGA ETGRLSSDSP NLQNIPVRSE EGREIRRAFV ARPGHVLMTA
DYSQIELRVL AHITADPALV EVFKTGQDIH AATAARLFDI PMDEVSKNQR RIAKMTVFGI
IYGISSFGLA ARTALSRTEA QQMINGLFAQ YPGLKSYIER TLERVKAVGY VETLFGRRRY
FRELQDGGVT GPRRSAFERE ATNAGIQGTA ADLIKLAMIR LEQALIAGGY QAKMLLQVHD
ELVLEVPEDE RDAVAQLVCD TMTQVYPDLA VPLEVNVETG LNWDQLQRWH APA