Gene Haur_3529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3529 
Symbol 
ID5735390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4442283 
End bp4444064 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content49% 
IMG OID641280676 
Productreplicative DNA helicase 
Protein accessionYP_001546293 
Protein GI159900046 
COG category[L] Replication, recombination and repair 
COG ID[COG0305] Replicative DNA helicase 
TIGRFAM ID[TIGR00665] replicative DNA helicase
[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0449592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAGT CGTTACCCGC CGATATTAAC GCCGAACGTG CCACCCTTGG TTCGATTTTG 
CTTGATCGTG ATGCAATTAT TCCAGTTGCA CCTTGGCTTG CTAGCGACTA TTTTTATCTT
GAAAAGCATG GCTTGGTTTT TAATGCCCAA GTGGCTTGTT ACAACCGCCG TGTGCCACCC
GATTTGGTCA ATGTTACCGA TGAGCTGCGC CGCAAAGATC AATTAGAGGT CGTCGGCGGG
GTTCAGTATT TGCTTGAGCT TTCGAATAGT GTGCCAACCT CGGTGCACGT CGAATATTAT
GCCCGCATCG TTGAACGCAC CGCACTATTG CGCCGTTTGA TCATCGCTGG CGGCAAGATT
GCGGCCCTCG GTTATGACGA AACTCAAGAG CTTGAGGCTG CGCTCGATCA GGCCGAATCA
GAATTATTTG CGGTTTCGCA GCGGCGTACC GCCGATGGCT TTATTCATAT CGGTGCAGTT
GTCGATAGTT TCTTCGAGCA AATTAGCCAA ATGCAGGAGC GCGGCGGCGA AGTTGTGGGC
CTCAAAACAG GCTTTACCGA TTTTGATAAA TTAACTGGTG GCTTGCAACG CTCCGACTTG
TTGATTTTGG CGGCGCGGCC AGCCACTGGC AAAACCAGCT TGGCCTTGAA TATTGCCTAC
AACGCCGCCA AGGAATCTGA GGCCTGCGTG GCGATTTTTA GCCTTGAAAT GAGTCGCGAT
CAGCTGATGC AGCGGATTTT GGCCACCGAA ACTGGCGTGG ATATGCAAAA GCTGAGGACT
GGCCAAATTC GCGATAGTGA TTTGCAGTTG CTGACCGAGG CGCTTGGTAA ACTCTCCACC
ATGTCAATTT ATATCGATGA TTCACCTGGA GCCAGCATTA TGGATGTGCG CTCCAAATGT
CGGCGCTTGC AAGCTGAGGC AGGCATCGAT CTGATTATCA TCGATTATTT GCAGTTGATG
CAGGGTGGCG GCAAACGCGA TGGCAACCGT GTCCAGGAAA TTAGCGAAAT TAGCCGTGGA
CTCAAGGCCT TGGCCCGCGA AATCAATGTG CCAGTGATTG CGCTTTCGCA GCTTTCGCGG
GCCGTCGAAA GCCGCACCAC CCACGTTCCC ATGCTCTCCG ATTTGCGCGA ATCAGGGTGT
CTCACAGGCG ATACCCAAAT CTATTTGCCT GATCAAGGTC ATTATGTCCG AATCGATCAA
TTAGTTGGGC AACAAGGCTT TAATGTTCTA GCGCTGAATC AAGCAACTTG GAAACTTGAA
TCATGGCCTG TTACCCATGC CTTTACAACT GGCACAAAAC CTGTTTTTAA ACTCCGTACT
AAGCTAGGGC GCAGTATTCG CGCCACGGTT AACCATAAAT TCTTAACCAT TGATGGTTGG
CAACGGCTTG ATCAGTTGCA TCAAACCAGC CAAATTGCCA TCTCACAGGA AATTGCCCAT
GAATTGGGTA GCAACAGCCA ACCAACCGCG ATTGAATGGG AAACAATCAT TAGCATCGAG
CCTGATGGCG TTGAGCCAGT TTATGATTTG ACGGTGGATA CGCTCCATAA TTTTGTGGCA
AATAATATCA TTGTGCATAA CAGTATCGAG CAAGATGCGG ATATCGTGAT GTTTATTTAT
CGTGAAGAGC TGTATGATCC CGAAACTGAT AAGAAGGGCA TCGCCGAAAT TCACCTAGCC
AAGCACCGTA ACGGCCCAAC CGGGATCGTG CCTTTGCGCT TTTTTCGCTC GACAACGAAA
TTTTCCAATT TAGAGACCTA TCGCCAACCT GAAGGCTATT AG
 
Protein sequence
MDKSLPADIN AERATLGSIL LDRDAIIPVA PWLASDYFYL EKHGLVFNAQ VACYNRRVPP 
DLVNVTDELR RKDQLEVVGG VQYLLELSNS VPTSVHVEYY ARIVERTALL RRLIIAGGKI
AALGYDETQE LEAALDQAES ELFAVSQRRT ADGFIHIGAV VDSFFEQISQ MQERGGEVVG
LKTGFTDFDK LTGGLQRSDL LILAARPATG KTSLALNIAY NAAKESEACV AIFSLEMSRD
QLMQRILATE TGVDMQKLRT GQIRDSDLQL LTEALGKLST MSIYIDDSPG ASIMDVRSKC
RRLQAEAGID LIIIDYLQLM QGGGKRDGNR VQEISEISRG LKALAREINV PVIALSQLSR
AVESRTTHVP MLSDLRESGC LTGDTQIYLP DQGHYVRIDQ LVGQQGFNVL ALNQATWKLE
SWPVTHAFTT GTKPVFKLRT KLGRSIRATV NHKFLTIDGW QRLDQLHQTS QIAISQEIAH
ELGSNSQPTA IEWETIISIE PDGVEPVYDL TVDTLHNFVA NNIIVHNSIE QDADIVMFIY
REELYDPETD KKGIAEIHLA KHRNGPTGIV PLRFFRSTTK FSNLETYRQP EGY