Gene Haur_3224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3224 
Symbol 
ID5735092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4077728 
End bp4080511 
Gene Length2784 bp 
Protein Length927 aa 
Translation table11 
GC content51% 
IMG OID641280370 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_001545989 
Protein GI159899742 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases
[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01407] DnaQ family exonuclease/DinG family helicase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00101103 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAACCGA TATATATCGC CTTGGATTTG GAAACAACTG GCTTAGAACC AGGGCGCGAT 
GAGATTATTG AAGTTGGGGC AGTTAAATTT CGAGGCAATG AGGTTCTCGA AACCTACCAA
ACCTTGGTCA AACCCAAACA GGTGCTGCCG ATTAAAATCG CCCGTTTAAC AGGCATCGAT
GCCCATGAAT TGACCACGGC TCCTACATTT AATAGTATTG GTGGCCAATT AGCCAAATTT
CTCAAAAGCT ACCCCATTAT TGGCCATTCG GTCGATAACG ATTTGCGCTT TTTGCAACAA
CAAGGGCTAA AAGTTACCCA GCCGCATTAC GATACCTTTG ATCTGGCGAC GCTGTTGATT
CCCCAATTGC CCAATTACTC GCTTTCAACG ATTGCCGAAC ATTTGCAAAT TCAACACCCT
GATGCCCACC GCGCCTTGGC CGATGCCGAG GCCAGTCGCT TGGTGTTTAG CGCATTGCTC
GATAAATTAG CCGAATTATC GGCTGCCGAA TTGCATAGCA TCGCTCAAAC CACCCAAAAA
TTGCAGTGGC CGCTGGCCAA ACTATTTGGC GAAATTGCCA AACGCCGCGT GCAAACGCTC
TGGCAAGCGC CGATCGAATT TCAACCGAAA CCACTGGTGC GCCCGGTGGC CCTCGAACCA
ACCGGCAATC AGCAAGAACT CGATGCCCAA GCAATTGGCG CGATGTTTGG CGCAGATGGT
GGGTTTAGCC GCATGTTCCC AGGCTATGAG CCACGTCAGC CCCAAATTGA GATGACCGAG
GCAATCGCCG AAGCACTCAA TCAAGGCGAT ACCTTGATGA TCGAAGCGCC AACTGGCACT
GGCAAAAGTT TGGCTTACCT CGTGCCCGCT GCTCAATGGG CACGCCAGCG CGGCGAACGA
GTGGTCATCT CAACCAACAC GATCAATCTC CAAGATCAGC TTTGCTCCAA AGATATTCCT
ACCGTGCAAG CGTTGTTGGC CGAGCAACCC GACCAATGGC CCGCTTTGCG AGCGGTGCAA
CTCAAAGGCC GCAGCAATTA CCTATGTTTG AAGCGCTATG AATCCTTTCG TGCCCACCCC
GACCACAACG AAGATCAGAC CCGTGGGTTG TTGAAGCTCC AACTTTGGCT GCCCTCGACC
AACAGCGGCG ACCGCGCTGA ATTGATGTTG ATTCAAGGCG AGCAGCAAGT TTGGAACAAT
GTTAATGTTG ATCCCGACCA ATGTTTGCGC CAACGCTGCT CGCTCTACAA CGAATGTTTC
TTCTTCAAAG CCCGCGCTGA AGCTGAAAAT GCCCATATCG TGGTGGCGAA CCATGCCTTG
TTGATGTCGG ATGTCAAATC GCCTGGGATT TTGCCACGCT ACGATCATTT GATCATCGAC
GAAGCGCATA ATCTCGAAGA TGTGGCGACT GATCAGTTGG GCTTTACGAT TTCACAACAT
AGCCTAACTG GTTTGCTCAA TGATATGCAT AGCGCTGGCG GTGTGCGTTT GGCGGGTGGC
GTGCTCAACG AATGGAGCCA AATCTTCCGT TTGAGCACCG TTGATCATAA AGAGCAGCGC
AAACTCGAAG ATCTCAGCGC CGATTTGCGA CCAAATGTTG ATAAAGCCCG CGAAGCGGCC
CAGCAATTAT TCAGCATTTT CAACGATATT ATGGCCAAAG ATCGCAGTGT GACCCAATAC
GATCCCCAAT TGCGGATCAC CAGCAAAGTG CGTCGCCACA CCGAATGGAC TCAAGTTGAG
CAAACATGGG AAAACTTGAG CATCAATTTG CGCAAGCTGG GCGATGGCTT TGGCAAGCTC
CAAGCAATTT TGGATAATCT CGAAGGCCGC GATATCAATG GCTACGATGA TTTGGTGATG
CGGGTCAAGG GTATGGTCAA TGCTTGCACT GAATTACAAC GCCAATTTGA TGTGGTAATT
TATGGCAATG AAGAAACCGT CGCATGGCTG ACTGCCGATC AACGTCGCCG CGAATTGTTG
GTGCAGGCTG CGCCAATTCA TGTTGGGCCA TTGCTCACTG AAGATTTATG GCTGAAAAAA
CGCGCCAGCA TCTTGGTTTC GGCGACGCTT TCGGTCAGCA ACAGCTTCGA TTACCCCAAA
CAGCGCTTGG GCTTGGACGA AGCCACGACG ATGCAACTCG ATTCGCCCTT CGATTACAGC
AAATCAACCT TAATCTATTT GCCAACCGAT ATGCCCGAAC CCAACGAGCG CAATTATCAA
CGGGCCATGG AAGATGCCCT GATCAATTTA TGCAAAGCGA CTGGCGGGCG CACTTTGGCA
CTGTTTACCG CCAATGCCTC GCTGAAACAA ACCTATCATG GCATTAGCGA AAGCCTTGAG
CAAGCCGATA TTTCGACCTT GGCCCAAGGC ATGGATGGCT CACGCCGCTC GTTGATCCAG
CGCTTCAAAT CTGACCCACG CACGGTTTTG TTGGGCACAG CCTCGTTTTG GGAAGGTGTT
GATGTGGTTG GCGATGCTTT GAGCGTACTG GTGATTACCA AATTGCCCTT TAGCGTGCCA
AATGATCCGG TGTTTTCGGC GCGATCTGAG GGCTTTGATG ATGCTTTTGC TGAATATTCA
GTGCCGCAGG CAATTTTGCG TTTCAAGCAA GGCTTTGGCC GCTTGATTCG CTCCAAAGAT
GATCGCGGGA TTGTGGTGGT GCTTGATCGG CGCTTGCTTA GCAAAAATTA TGGGCGGCAA
TTCCTCGAAT CATTGCCCGA TTGCACGATT CAACGCAAGC CGCTCGCCGA ATTGGCAACA
ACGGCTGCTC GTTGGTTGGT TTAA
 
Protein sequence
MEPIYIALDL ETTGLEPGRD EIIEVGAVKF RGNEVLETYQ TLVKPKQVLP IKIARLTGID 
AHELTTAPTF NSIGGQLAKF LKSYPIIGHS VDNDLRFLQQ QGLKVTQPHY DTFDLATLLI
PQLPNYSLST IAEHLQIQHP DAHRALADAE ASRLVFSALL DKLAELSAAE LHSIAQTTQK
LQWPLAKLFG EIAKRRVQTL WQAPIEFQPK PLVRPVALEP TGNQQELDAQ AIGAMFGADG
GFSRMFPGYE PRQPQIEMTE AIAEALNQGD TLMIEAPTGT GKSLAYLVPA AQWARQRGER
VVISTNTINL QDQLCSKDIP TVQALLAEQP DQWPALRAVQ LKGRSNYLCL KRYESFRAHP
DHNEDQTRGL LKLQLWLPST NSGDRAELML IQGEQQVWNN VNVDPDQCLR QRCSLYNECF
FFKARAEAEN AHIVVANHAL LMSDVKSPGI LPRYDHLIID EAHNLEDVAT DQLGFTISQH
SLTGLLNDMH SAGGVRLAGG VLNEWSQIFR LSTVDHKEQR KLEDLSADLR PNVDKAREAA
QQLFSIFNDI MAKDRSVTQY DPQLRITSKV RRHTEWTQVE QTWENLSINL RKLGDGFGKL
QAILDNLEGR DINGYDDLVM RVKGMVNACT ELQRQFDVVI YGNEETVAWL TADQRRRELL
VQAAPIHVGP LLTEDLWLKK RASILVSATL SVSNSFDYPK QRLGLDEATT MQLDSPFDYS
KSTLIYLPTD MPEPNERNYQ RAMEDALINL CKATGGRTLA LFTANASLKQ TYHGISESLE
QADISTLAQG MDGSRRSLIQ RFKSDPRTVL LGTASFWEGV DVVGDALSVL VITKLPFSVP
NDPVFSARSE GFDDAFAEYS VPQAILRFKQ GFGRLIRSKD DRGIVVVLDR RLLSKNYGRQ
FLESLPDCTI QRKPLAELAT TAARWLV