Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5079 |
Symbol | |
ID | 5737037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 96433 |
End bp | 98847 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641282244 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_001547835 |
Protein GI | 159901589 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGATG GGTTTGTTGA GCCAGATTTT CTCATGGCAA TTTTAGCCAA AAGCGCTGAC CGTAGTGAGC ATAACATTCC CGAAACTTTA GCGCGGCATA CGTGGCTGGT GGTTACAAAA GTTGCCGAAT TAGCCAAAAT TCGCCCTGAT CTAACCGTGG TCGCCAATAC AGCAGATCTC TGGCATCTGC TGTATTGGGG TTGTTTTTTG CACGATTTTG GCAAAGCCAC AGGTGGGTTT CAGCAGCAAT TGCAGGGTGT TTCTTGGAAT GGTCATCGTC ATGAAGTGAT TTCGCTAGTG TTTCTCGATT GGATTGCCGC TAATTTTCCT AAGCATCAAC AAGCATGGTT GTCCGCCGCA ATTGCTTCGC ATCATCGTGA TCGCGGCATT ATTCAAGAAA AATATGTGGT TGATAGTGTG CTGGCTGAAG CTTGGTCAAC CTTTGACTTA ACCCATGTTC CATTGATGTG GCGTTGGCTG ACCGAATATG CCAACCAATG GATTGGTTTG CTTGGCCTTG ATGCCGTTGG CGTGCGACCC TTACAATTTC CCAACGCAGA TCAGGCAATT CAGCACGTTC AAACCGATGC ATGTAGTCGA ATTCGCTATT GGTTGTGCCA ATATTATCGG CTGAAACAGG TTTTTGACGA TCAACCAGCG CATGCCCCAG TGCCGTTATT AATCTTGCTC CGTGGCCTAA CCACTACTGC TGATCATATG GCCTCAGCTC ATTTAGCGGC GATTCCTCAG CCCATTCAAG AAAATTGGCA AGCCCTAGCG AAGCGAATTC TGAACGCAGA TCAACAACCG TATTCCCATC AACAACAGAG TGCTGATGCC CATAACACAT CCTGTTTGTT GATTGCTCCC ACTGGTAGCG GCAAAACCGA AGCTGCGTTG TATTGGGCCT TGGGCGAAGG CGAGCAACCA GTTCCACGAA TTTTCTATGC CTTACCATTC CAAGCAAGTA TGAATGCCAT GTTTGATCGC TTACGTCAGC CAGCGAAAGG TTTTGGCGAG CAAGCAGTTG GTTTGCAGCA TGGTCGGGCC TTGCAGGTGT TATATCTTCG CTTGCTTGAG AGCGAAAATG GCTTGGATTC GCGCACGGCA GCCGAAGGAG AACGCTGGGA ACGGAATATC AACAGCTTGC ATGCCCGTCC TCTAAAAGTG TTTAGTCCCT ATCAAATGCT TAAAGCCTTA TTTCAGATTA AAGGCTTTGA GGCAATTTTG AGCGATTATG CCCAGGCACG TTTTATTTTT GATGAAATTC ATGCATATGA GCCACAGCGC CTTGCTTTGA TTATTTGTCT GATTAACTAT TTGTCTGAGC ATTTTGCGGC AACCTTTTTT GTCATGTCGG CAACATTTCC TACGATTATT CGTGAGCACT TGGCGAATGC CTTAGGCCGA CATCAGGTTA TTCACGCTTC AGCCAGTTTA TTTCAAGCAT TTGCTCGCCA TCAATTGCAA TTGCTCGATG GAGAATTAAT CTCAGCCAGT AGTATTGAAC ATATTGTTAA TGATTTCAAG GCAGGTAAGC AAGTTCTAGT TTGTGCCAAT ACAGTACGAC GTTCGCAAAC GATTTTAGCG CTTTTGCGCG ATGCTGGGGT TGCTGAATCC GATTTATTGC TGATTCATAG TCGTTTTACC ATGAAAGATC GCAGCGCCTT AGAACAACGA GTTGTTCAAC GTTGTCAATT AGGTTTAGAT CAACCAGAAC CATTTATATT AATTACAACA CAAGTGATTG AAGTTAGTTT AAATATCGAC CTTGATACAT TGTATAGCGA TCCTGCCCCA CTTGAGGCAT TGCTACAACG TTTTGGGCGG GTAAATCGGT CGCGTAAAAA AGGTATTGTT CCTGTCCATG TTTTTCGCGA ACCACGCGAT GGCCAAGGAG TATATGGCCG AAGTAAAGAT CCAAAACAGC AAGGACGTAT TGTTCAAGTA ACGTTAAGCG AATTGGAAAA ACATAATGGT GAAATCATCG ACGAATCAAT GATTGATCAA TGGCTTGATA CGATTTATGC TGATGCAATT TTGGCTAAGC AATGGCAGGA TGAGTATCAA AAAATCTATG ATAATGCTCA GTGGATTGCT CATAATTTGC GGCCATTCGA GAGCGATAAA ACCACTGAAG ATCAATTCGA TGAGCTTTTT GATAATGTTG ATGTTATTCC ACAATCACTG GTACAAACCT ATCTTGATTT GCTCAATAAC CATGAATATG TTGAATCTAG TCGTTATTTT GTTGGAATCA GTAAACAAAA ATACGCCCAA TTCAAACAAA ACGGTTTAAT TCTCGCCTTA GAAGATGCAG CATTAAAACA TCCACGTTGG ATTATCAATC TTCCCTACAG TAGCGAAAGT GGTTTGTCAT TTGAACAAAC TACAACGGAT GACGATTGGA GCTGA
|
Protein sequence | MRDGFVEPDF LMAILAKSAD RSEHNIPETL ARHTWLVVTK VAELAKIRPD LTVVANTADL WHLLYWGCFL HDFGKATGGF QQQLQGVSWN GHRHEVISLV FLDWIAANFP KHQQAWLSAA IASHHRDRGI IQEKYVVDSV LAEAWSTFDL THVPLMWRWL TEYANQWIGL LGLDAVGVRP LQFPNADQAI QHVQTDACSR IRYWLCQYYR LKQVFDDQPA HAPVPLLILL RGLTTTADHM ASAHLAAIPQ PIQENWQALA KRILNADQQP YSHQQQSADA HNTSCLLIAP TGSGKTEAAL YWALGEGEQP VPRIFYALPF QASMNAMFDR LRQPAKGFGE QAVGLQHGRA LQVLYLRLLE SENGLDSRTA AEGERWERNI NSLHARPLKV FSPYQMLKAL FQIKGFEAIL SDYAQARFIF DEIHAYEPQR LALIICLINY LSEHFAATFF VMSATFPTII REHLANALGR HQVIHASASL FQAFARHQLQ LLDGELISAS SIEHIVNDFK AGKQVLVCAN TVRRSQTILA LLRDAGVAES DLLLIHSRFT MKDRSALEQR VVQRCQLGLD QPEPFILITT QVIEVSLNID LDTLYSDPAP LEALLQRFGR VNRSRKKGIV PVHVFREPRD GQGVYGRSKD PKQQGRIVQV TLSELEKHNG EIIDESMIDQ WLDTIYADAI LAKQWQDEYQ KIYDNAQWIA HNLRPFESDK TTEDQFDELF DNVDVIPQSL VQTYLDLLNN HEYVESSRYF VGISKQKYAQ FKQNGLILAL EDAALKHPRW IINLPYSSES GLSFEQTTTD DDWS
|
| |