Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3529 |
Symbol | |
ID | 5735390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4442283 |
End bp | 4444064 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280676 |
Product | replicative DNA helicase |
Protein accession | YP_001546293 |
Protein GI | 159900046 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0305] Replicative DNA helicase |
TIGRFAM ID | [TIGR00665] replicative DNA helicase [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0449592 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAGT CGTTACCCGC CGATATTAAC GCCGAACGTG CCACCCTTGG TTCGATTTTG CTTGATCGTG ATGCAATTAT TCCAGTTGCA CCTTGGCTTG CTAGCGACTA TTTTTATCTT GAAAAGCATG GCTTGGTTTT TAATGCCCAA GTGGCTTGTT ACAACCGCCG TGTGCCACCC GATTTGGTCA ATGTTACCGA TGAGCTGCGC CGCAAAGATC AATTAGAGGT CGTCGGCGGG GTTCAGTATT TGCTTGAGCT TTCGAATAGT GTGCCAACCT CGGTGCACGT CGAATATTAT GCCCGCATCG TTGAACGCAC CGCACTATTG CGCCGTTTGA TCATCGCTGG CGGCAAGATT GCGGCCCTCG GTTATGACGA AACTCAAGAG CTTGAGGCTG CGCTCGATCA GGCCGAATCA GAATTATTTG CGGTTTCGCA GCGGCGTACC GCCGATGGCT TTATTCATAT CGGTGCAGTT GTCGATAGTT TCTTCGAGCA AATTAGCCAA ATGCAGGAGC GCGGCGGCGA AGTTGTGGGC CTCAAAACAG GCTTTACCGA TTTTGATAAA TTAACTGGTG GCTTGCAACG CTCCGACTTG TTGATTTTGG CGGCGCGGCC AGCCACTGGC AAAACCAGCT TGGCCTTGAA TATTGCCTAC AACGCCGCCA AGGAATCTGA GGCCTGCGTG GCGATTTTTA GCCTTGAAAT GAGTCGCGAT CAGCTGATGC AGCGGATTTT GGCCACCGAA ACTGGCGTGG ATATGCAAAA GCTGAGGACT GGCCAAATTC GCGATAGTGA TTTGCAGTTG CTGACCGAGG CGCTTGGTAA ACTCTCCACC ATGTCAATTT ATATCGATGA TTCACCTGGA GCCAGCATTA TGGATGTGCG CTCCAAATGT CGGCGCTTGC AAGCTGAGGC AGGCATCGAT CTGATTATCA TCGATTATTT GCAGTTGATG CAGGGTGGCG GCAAACGCGA TGGCAACCGT GTCCAGGAAA TTAGCGAAAT TAGCCGTGGA CTCAAGGCCT TGGCCCGCGA AATCAATGTG CCAGTGATTG CGCTTTCGCA GCTTTCGCGG GCCGTCGAAA GCCGCACCAC CCACGTTCCC ATGCTCTCCG ATTTGCGCGA ATCAGGGTGT CTCACAGGCG ATACCCAAAT CTATTTGCCT GATCAAGGTC ATTATGTCCG AATCGATCAA TTAGTTGGGC AACAAGGCTT TAATGTTCTA GCGCTGAATC AAGCAACTTG GAAACTTGAA TCATGGCCTG TTACCCATGC CTTTACAACT GGCACAAAAC CTGTTTTTAA ACTCCGTACT AAGCTAGGGC GCAGTATTCG CGCCACGGTT AACCATAAAT TCTTAACCAT TGATGGTTGG CAACGGCTTG ATCAGTTGCA TCAAACCAGC CAAATTGCCA TCTCACAGGA AATTGCCCAT GAATTGGGTA GCAACAGCCA ACCAACCGCG ATTGAATGGG AAACAATCAT TAGCATCGAG CCTGATGGCG TTGAGCCAGT TTATGATTTG ACGGTGGATA CGCTCCATAA TTTTGTGGCA AATAATATCA TTGTGCATAA CAGTATCGAG CAAGATGCGG ATATCGTGAT GTTTATTTAT CGTGAAGAGC TGTATGATCC CGAAACTGAT AAGAAGGGCA TCGCCGAAAT TCACCTAGCC AAGCACCGTA ACGGCCCAAC CGGGATCGTG CCTTTGCGCT TTTTTCGCTC GACAACGAAA TTTTCCAATT TAGAGACCTA TCGCCAACCT GAAGGCTATT AG
|
Protein sequence | MDKSLPADIN AERATLGSIL LDRDAIIPVA PWLASDYFYL EKHGLVFNAQ VACYNRRVPP DLVNVTDELR RKDQLEVVGG VQYLLELSNS VPTSVHVEYY ARIVERTALL RRLIIAGGKI AALGYDETQE LEAALDQAES ELFAVSQRRT ADGFIHIGAV VDSFFEQISQ MQERGGEVVG LKTGFTDFDK LTGGLQRSDL LILAARPATG KTSLALNIAY NAAKESEACV AIFSLEMSRD QLMQRILATE TGVDMQKLRT GQIRDSDLQL LTEALGKLST MSIYIDDSPG ASIMDVRSKC RRLQAEAGID LIIIDYLQLM QGGGKRDGNR VQEISEISRG LKALAREINV PVIALSQLSR AVESRTTHVP MLSDLRESGC LTGDTQIYLP DQGHYVRIDQ LVGQQGFNVL ALNQATWKLE SWPVTHAFTT GTKPVFKLRT KLGRSIRATV NHKFLTIDGW QRLDQLHQTS QIAISQEIAH ELGSNSQPTA IEWETIISIE PDGVEPVYDL TVDTLHNFVA NNIIVHNSIE QDADIVMFIY REELYDPETD KKGIAEIHLA KHRNGPTGIV PLRFFRSTTK FSNLETYRQP EGY
|
| |