Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1937 |
Symbol | |
ID | 5733826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2347018 |
End bp | 2350092 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641279081 |
Product | superfamily II DNA/RNA helicase |
Protein accession | YP_001544708 |
Protein GI | 159898461 |
COG category | [R] General function prediction only |
COG ID | [COG4889] Predicted helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00156892 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTCGA TCAAGCCAAA TCATAAGGCG GTGCGCGAGT ATTACGCCAG TTTGCGCAGT TTGGCCGAGG CCCGTGCCCA ACACGAGGGC GCAGTTGCTC CAGCCTTTGC GGCCTTGCTC CGCGCTTGCG CAAGCCAGAT GGGCTGGACA TTGGTCGAAC AGTATTCGAT CCGCTTGAAA AAAAGCTCGA TTCGCGCCGA TGGTGCTTTG CTCGATAGTT TTACCTTGAT TCGCGGGGTC TGGGAAGCCA AAGATAGTAA CGATGATTTG GCAACTGAGG TGCGTAAAAA GTTCGCCGCT GGCTATCCCG CTGAAAATAT CTTGTTTCAG GCTCCGCAAC GGATTATTCT TTGGCAAAAT CAGCGCCAAG TGCTCGATGT GGATATTAGC CAGCCCGATG CGTTGATCGA TGCGCTGCTG CTGTTTTTCA ATTATCAGCC GCCCCAATAT TTGCAATGGG ATCACGCGGT CGTCGAGTTT CGCGAGCGCG TGCCCGAACT CGCCCAAGGC GTGCTGCGTT TGATCGAGCG CGAAATTAGC GAGAAGAATC AGCGTTTTAT CACGGCGCTT GAGCGCTTTA TGGCCTTGGT GCGCGAGGCG ATCAACCCCA ATATTTCGGT TTCAGCGGTC GAGGAGATGT TGATTCAGCA CTTGTTGACC GAGCGGATTT TTCGCAAGGT CTTTAATAAT CCCGATTTTG TTAATCGCAA TGTGATTGCC CGCGAAATTG AAACCGTGAT TCAGGCATTA ACTTCGCGTT CGTTCAATCG TAATGATTTT TTGCGTGAAC TCGACCGTTT TTATGGGGCA ATCGAATCGA CCGCCGCCAC CATCGAGAAT TTCAGCCATA AACAGGATTT CTTGAACACG GTGTATGAGA ATTTTTTCCA AGGCTTTTCG ATTAAGGTCG CTGATACGCA TGGGATTGTT TACACGCCTC AGCCGATTGT CGATTTTATG GTGCGTTCGG TCGAGGAGTT GTTGCGGCGC GAATTTAATA CCTCGCTCGG CAACGCGGGC GTGCATGTGC TCGACCCATT TGTTGGCACT GGCAACTTTT TGCTGCGGGT GATGCACGAA ATTCCGCGCA GCAAATTGCG CCAAAAATAT GCCGAGGAAT TACACTGCAA CGAGGTGATG TTGTTGCCCT ACTACATCGC TTCGATGAAT ATCGAGCATT TGTATTATGA ATTGACCAAT AGCTATCAAG AATTCAATGG CATTTGTTTG GTCGATACCT TTGAATTAGC TCAAGTTGGC GCAGGCCAGC AATTGGGCTT GTTTGTGCCC GAAAACACCG AGCGCGTGCT CAAACAACAA CAACAAGATA TTTTTGTAAT CATCGGCAAC CCACCTTACA ACGCCCGCCA AGTTAACGAG AATGATAATA ATAAAAATCG TAAGTATGAA ATTATCGATC AACGGGTGGC TATGACCTAT AGCCGCGATT CACAACAAAC CAACAAAAAT GCATTGAACG ATCCATATGT TAAATCCTTT CGCTGGGCGG CTGATCGAAT TATACGCAAC GGTGATGAAG GTATTGTAGC CCTTGTTACC AACAATAGTT TTATTGATGA TTTATCGTTT GATGGCATGC GCAAGCATTT AGCACAGGAT TTTGATGCAA TCTATGTGCT TGATCTTGGT GGCAATGTGC GCAAAAATCC CAAACTTTCC GGCACAACAC ACAATGTGTT TGGCATTCAG GTTGGAGTCA GTATCATTTT TCTGATCAAA AAGCGTGGTT CAACTAAAGC AAGTGATGCC AAAATTTGGT ATGCGCGAGC TGGCGAGATG TGGAAAAAAC AAGAAAAGTT TAATTTGCTT AATCAGGCTG AAACAATAGA TAAAATCGAA TGGCAAGAAA TTTTACCTGA TAAAAAGCAT ACTTGGCTCA CGGATGGCTT AGAAAATGAT TTTGATAATT TTATTCCCTT AGGTACAAAG GAAGCGAAAA AAGGATTTGG TCAAGCAATT TTCACACAAT TTACAAATGG TGTAAAAAGT AATCGTGATG CTTGGGTTTG GAATTTTGAT TCTGATACCT TATCAAACAA CATCAAAACC ACTATTGATT ATTACAATGA TCATGTTTCT CGCTGGCAAA GATTGGTGAC CAAACAAGAA ATTGATAGTT TTATTTCTAC TGATGACAAA AAAATTAGTT GGAGCGGTGA TCTAAAAAGT AATATCCAAC GAGGGCGTTA TATCCAATAT GATGCAAATA AAGTTATAGA TGGCATCTAT CGACCATACA CAAAACAGAA AATTTACTTT GAACGGCTTC TCAATGAGCG AGTCTACCTG ATACCATCTC TATTCCCAAC AGCTAGTGAA AATCGGGTAA TTTGCGTTGT CAACGAGGCA CAAATCCCAT TTTCAGCCCA AATCACTAAC GTCATTCCTT GTTTGCATTA TGGTGGGCGG CAAACTCAAT GCTTCCCATA TTATGTCTAT GATGACGATG GCAGCAACCA GCGCGAAAAC ATCAGCGATT GGGCGCTTGA GCACTTCCGC AGCCAACTTG GCGAGCCAAG CATCGAAAAA TGGGATATTT TTTATTATGT GTATGGGCTG CTGCACTCGC CGCATTACCG CGAACGCTAC GCCGCCAACT TACGCCGCGA ACTGCCGCGC ATCCCAATTG TGGCCTTGGC CGATTTTCAG GCCTTGGCCC AAGCAGGCCG CGAGCTGGCC GAACTGCACA TTAATTACGA AAGTCAGCCT GAATATAATT TACAATGGCT CGAAAACCGC GACGAACCAC TGAATTGGCG GGTTGAAAGC ATGAAACTCA GCAAAGATCG CACAACCCTG CGCTACAACA ACTTTCTGAG TTTGGCGGGC ATTCCGGCGG CGGCCTTTGA ATATAAGCTG GGCAACCGCT CGGCGCTCGA TTGGGTGATT GATCAATATC GGGTCAGCAC CGATGCGCGG TCGGGCATCA CCAACGATCC CAATCGCCAC GACGATCCTG AGTATATCGT GCGGCTGATC GGCAAAATCA TCACCATCAG CCTCAAAACC GTCGAGATTG TGACGCGAAT TGGGGATGTG GCCCTCACCC CCTAA
|
Protein sequence | MLSIKPNHKA VREYYASLRS LAEARAQHEG AVAPAFAALL RACASQMGWT LVEQYSIRLK KSSIRADGAL LDSFTLIRGV WEAKDSNDDL ATEVRKKFAA GYPAENILFQ APQRIILWQN QRQVLDVDIS QPDALIDALL LFFNYQPPQY LQWDHAVVEF RERVPELAQG VLRLIEREIS EKNQRFITAL ERFMALVREA INPNISVSAV EEMLIQHLLT ERIFRKVFNN PDFVNRNVIA REIETVIQAL TSRSFNRNDF LRELDRFYGA IESTAATIEN FSHKQDFLNT VYENFFQGFS IKVADTHGIV YTPQPIVDFM VRSVEELLRR EFNTSLGNAG VHVLDPFVGT GNFLLRVMHE IPRSKLRQKY AEELHCNEVM LLPYYIASMN IEHLYYELTN SYQEFNGICL VDTFELAQVG AGQQLGLFVP ENTERVLKQQ QQDIFVIIGN PPYNARQVNE NDNNKNRKYE IIDQRVAMTY SRDSQQTNKN ALNDPYVKSF RWAADRIIRN GDEGIVALVT NNSFIDDLSF DGMRKHLAQD FDAIYVLDLG GNVRKNPKLS GTTHNVFGIQ VGVSIIFLIK KRGSTKASDA KIWYARAGEM WKKQEKFNLL NQAETIDKIE WQEILPDKKH TWLTDGLEND FDNFIPLGTK EAKKGFGQAI FTQFTNGVKS NRDAWVWNFD SDTLSNNIKT TIDYYNDHVS RWQRLVTKQE IDSFISTDDK KISWSGDLKS NIQRGRYIQY DANKVIDGIY RPYTKQKIYF ERLLNERVYL IPSLFPTASE NRVICVVNEA QIPFSAQITN VIPCLHYGGR QTQCFPYYVY DDDGSNQREN ISDWALEHFR SQLGEPSIEK WDIFYYVYGL LHSPHYRERY AANLRRELPR IPIVALADFQ ALAQAGRELA ELHINYESQP EYNLQWLENR DEPLNWRVES MKLSKDRTTL RYNNFLSLAG IPAAAFEYKL GNRSALDWVI DQYRVSTDAR SGITNDPNRH DDPEYIVRLI GKIITISLKT VEIVTRIGDV ALTP
|
| |