Gene Haur_5079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5079 
Symbol 
ID5737037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp96433 
End bp98847 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content44% 
IMG OID641282244 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001547835 
Protein GI159901589 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGATG GGTTTGTTGA GCCAGATTTT CTCATGGCAA TTTTAGCCAA AAGCGCTGAC 
CGTAGTGAGC ATAACATTCC CGAAACTTTA GCGCGGCATA CGTGGCTGGT GGTTACAAAA
GTTGCCGAAT TAGCCAAAAT TCGCCCTGAT CTAACCGTGG TCGCCAATAC AGCAGATCTC
TGGCATCTGC TGTATTGGGG TTGTTTTTTG CACGATTTTG GCAAAGCCAC AGGTGGGTTT
CAGCAGCAAT TGCAGGGTGT TTCTTGGAAT GGTCATCGTC ATGAAGTGAT TTCGCTAGTG
TTTCTCGATT GGATTGCCGC TAATTTTCCT AAGCATCAAC AAGCATGGTT GTCCGCCGCA
ATTGCTTCGC ATCATCGTGA TCGCGGCATT ATTCAAGAAA AATATGTGGT TGATAGTGTG
CTGGCTGAAG CTTGGTCAAC CTTTGACTTA ACCCATGTTC CATTGATGTG GCGTTGGCTG
ACCGAATATG CCAACCAATG GATTGGTTTG CTTGGCCTTG ATGCCGTTGG CGTGCGACCC
TTACAATTTC CCAACGCAGA TCAGGCAATT CAGCACGTTC AAACCGATGC ATGTAGTCGA
ATTCGCTATT GGTTGTGCCA ATATTATCGG CTGAAACAGG TTTTTGACGA TCAACCAGCG
CATGCCCCAG TGCCGTTATT AATCTTGCTC CGTGGCCTAA CCACTACTGC TGATCATATG
GCCTCAGCTC ATTTAGCGGC GATTCCTCAG CCCATTCAAG AAAATTGGCA AGCCCTAGCG
AAGCGAATTC TGAACGCAGA TCAACAACCG TATTCCCATC AACAACAGAG TGCTGATGCC
CATAACACAT CCTGTTTGTT GATTGCTCCC ACTGGTAGCG GCAAAACCGA AGCTGCGTTG
TATTGGGCCT TGGGCGAAGG CGAGCAACCA GTTCCACGAA TTTTCTATGC CTTACCATTC
CAAGCAAGTA TGAATGCCAT GTTTGATCGC TTACGTCAGC CAGCGAAAGG TTTTGGCGAG
CAAGCAGTTG GTTTGCAGCA TGGTCGGGCC TTGCAGGTGT TATATCTTCG CTTGCTTGAG
AGCGAAAATG GCTTGGATTC GCGCACGGCA GCCGAAGGAG AACGCTGGGA ACGGAATATC
AACAGCTTGC ATGCCCGTCC TCTAAAAGTG TTTAGTCCCT ATCAAATGCT TAAAGCCTTA
TTTCAGATTA AAGGCTTTGA GGCAATTTTG AGCGATTATG CCCAGGCACG TTTTATTTTT
GATGAAATTC ATGCATATGA GCCACAGCGC CTTGCTTTGA TTATTTGTCT GATTAACTAT
TTGTCTGAGC ATTTTGCGGC AACCTTTTTT GTCATGTCGG CAACATTTCC TACGATTATT
CGTGAGCACT TGGCGAATGC CTTAGGCCGA CATCAGGTTA TTCACGCTTC AGCCAGTTTA
TTTCAAGCAT TTGCTCGCCA TCAATTGCAA TTGCTCGATG GAGAATTAAT CTCAGCCAGT
AGTATTGAAC ATATTGTTAA TGATTTCAAG GCAGGTAAGC AAGTTCTAGT TTGTGCCAAT
ACAGTACGAC GTTCGCAAAC GATTTTAGCG CTTTTGCGCG ATGCTGGGGT TGCTGAATCC
GATTTATTGC TGATTCATAG TCGTTTTACC ATGAAAGATC GCAGCGCCTT AGAACAACGA
GTTGTTCAAC GTTGTCAATT AGGTTTAGAT CAACCAGAAC CATTTATATT AATTACAACA
CAAGTGATTG AAGTTAGTTT AAATATCGAC CTTGATACAT TGTATAGCGA TCCTGCCCCA
CTTGAGGCAT TGCTACAACG TTTTGGGCGG GTAAATCGGT CGCGTAAAAA AGGTATTGTT
CCTGTCCATG TTTTTCGCGA ACCACGCGAT GGCCAAGGAG TATATGGCCG AAGTAAAGAT
CCAAAACAGC AAGGACGTAT TGTTCAAGTA ACGTTAAGCG AATTGGAAAA ACATAATGGT
GAAATCATCG ACGAATCAAT GATTGATCAA TGGCTTGATA CGATTTATGC TGATGCAATT
TTGGCTAAGC AATGGCAGGA TGAGTATCAA AAAATCTATG ATAATGCTCA GTGGATTGCT
CATAATTTGC GGCCATTCGA GAGCGATAAA ACCACTGAAG ATCAATTCGA TGAGCTTTTT
GATAATGTTG ATGTTATTCC ACAATCACTG GTACAAACCT ATCTTGATTT GCTCAATAAC
CATGAATATG TTGAATCTAG TCGTTATTTT GTTGGAATCA GTAAACAAAA ATACGCCCAA
TTCAAACAAA ACGGTTTAAT TCTCGCCTTA GAAGATGCAG CATTAAAACA TCCACGTTGG
ATTATCAATC TTCCCTACAG TAGCGAAAGT GGTTTGTCAT TTGAACAAAC TACAACGGAT
GACGATTGGA GCTGA
 
Protein sequence
MRDGFVEPDF LMAILAKSAD RSEHNIPETL ARHTWLVVTK VAELAKIRPD LTVVANTADL 
WHLLYWGCFL HDFGKATGGF QQQLQGVSWN GHRHEVISLV FLDWIAANFP KHQQAWLSAA
IASHHRDRGI IQEKYVVDSV LAEAWSTFDL THVPLMWRWL TEYANQWIGL LGLDAVGVRP
LQFPNADQAI QHVQTDACSR IRYWLCQYYR LKQVFDDQPA HAPVPLLILL RGLTTTADHM
ASAHLAAIPQ PIQENWQALA KRILNADQQP YSHQQQSADA HNTSCLLIAP TGSGKTEAAL
YWALGEGEQP VPRIFYALPF QASMNAMFDR LRQPAKGFGE QAVGLQHGRA LQVLYLRLLE
SENGLDSRTA AEGERWERNI NSLHARPLKV FSPYQMLKAL FQIKGFEAIL SDYAQARFIF
DEIHAYEPQR LALIICLINY LSEHFAATFF VMSATFPTII REHLANALGR HQVIHASASL
FQAFARHQLQ LLDGELISAS SIEHIVNDFK AGKQVLVCAN TVRRSQTILA LLRDAGVAES
DLLLIHSRFT MKDRSALEQR VVQRCQLGLD QPEPFILITT QVIEVSLNID LDTLYSDPAP
LEALLQRFGR VNRSRKKGIV PVHVFREPRD GQGVYGRSKD PKQQGRIVQV TLSELEKHNG
EIIDESMIDQ WLDTIYADAI LAKQWQDEYQ KIYDNAQWIA HNLRPFESDK TTEDQFDELF
DNVDVIPQSL VQTYLDLLNN HEYVESSRYF VGISKQKYAQ FKQNGLILAL EDAALKHPRW
IINLPYSSES GLSFEQTTTD DDWS