Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2539 |
Symbol | |
ID | 5734417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3253308 |
End bp | 3256703 |
Gene Length | 3396 bp |
Protein Length | 1131 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279679 |
Product | helicase domain-containing protein |
Protein accession | YP_001545305 |
Protein GI | 159899058 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATTC CCTATGTGAT TGATAATATT GAGCAGCGCC TCGCCGACGT GCTGCGCTAC TTACTGAGCC GAGCGAGCGA CCAATCGCTA GATATTGCCA CGGCCTATTT CAGCATTCGC GGCTTTGAAC AGCTGCGCCA CGAGCTGGAA ACCACAGGCC AAGTGCGTTT ACTGCTGGGC GATAAACCAG TGGCAGCTGA TGATTTGGGT ATGCGACCCG ATTCGAGCGC GTTTTTGCGC CATGAACTGA ACGCCGAAGC CTTTACTGAA CGCACCTTGC GCTTAGTCGA ACAGCTGATT CGCTTTTTGC GCCGCGACGA GGTTGAGGTG CGTTTGTATA CGGGCTTGTT GGCTGGCGAA AAAGGCCGCC GCGCATTTTT GCATGCCAAA TGCTATTTAT TGTATGGTCG CTTGCCCGCT GCCAATGTGT TTGATCCGCT CACGCCGATT GTTGGCATTG TGGGCAGCAG CAATTTCACT GGGCCAGGCT TGGTCAGCAA TCATGAACTA AATTTGGTGC ACAAAACCCT GCTCACCGAC GCTGAGCTTG ATGATGCTGA AGCTAAAGCT ACGGTTGCCC AGCATGCCGC CCAACTGAGT GCTGCGCCGC TTGAACTTGA ACATGAACGC ATGCTCAAAA GCGAAATTGG CACGCGAGCT ATTCTCGATC TTTCGCGTTG GTATGAGCAA CAATGGCAGC ATGCGCTTGA TTATAAAGAA CAATTGATCG AACTGTTGGA AAACTCCAAA TTTGGCGGGC GCGAATATAG CCCCTACGAA ATTTACATGA AAGCGCTGTA TACCTATTTT AAAGATGATT TAGCTGAGCG CGACCAACAA CCAGCCACCC GTTCGGCGGT TGAGCTAGCC GAATTTCAAG AAGATGCAGT GCGCAAGGCG CGGCGCATTT TAGCCCAATA CGATGGCGCG ATTGTCGCCG ACTCGGTGGG CTTGGGCAAA ACTTGGATTG GCAAAAAACT GCTGGAAGAT TACGCCTACC ATCAGCGCCA AAAAGCCTTG GTGATTTGTC CGGCCTCGCT GCGTGGCATG TGGGAGCGTG AATTGCACAG CGCCACGATC GCCGCCCAAG TGCTGACCCA AGAACGGTTG GGCCTCGACG AATTTGATGG CCGCGAGTTT TTCGATGTTG ATCTGATTTT GGTCGATGAA GCCCACAACT TTCGCAACAA ACGCGCCAAA CGCTACCAAC AACTTGAGCT TTTACTGGCG GCCAATGGGC GACGCGGGCG CAGCGGTAGC CGCAAAAAGC TGATTTTGCT GACCGCCACC CCCATTAACA ACAATATTTT CGATTTATAC AACCAAATTA ATCTGTTTAC CGGCAACGAC CGCAGCTACT TTGCTTCGGC TGGCATCGGC GATTTATACA AATATTTCTT GGCAGCGCGG TGCGAATCGT TGGAAGCTGG TTCGATTCGC ATCTTTAACC TGCTCGAAGA AGTGGTGATT CGCCGCACGC GCCAATTTAT TCGGCGAGCC TACCCCGAAG CGCTGATTCA TGGCGAGCCA ATTCGCTGGC CAACCCGCCA ACTGCACAGC GTCGAATACG ATTTACAAGC GGCCTACAGT GGCTTATATC AAAATATTAT TGGTTCAATC GAAGGCCTAC ATCTCGCTCA CTACAACCTT GAGGCCTACA AACTGCGCCC CAGCGACCAA GATGAATTTG AGCTGGGGCG ACAAGCAGCG CTGGTGGGTA TTTTCAAAAG TCGCTTTCTC AAGCGGCTCG AATCGAGTAT CGAAGCCTTT CGCATTTCAA TTCGCCGTTC GTTGGCCTTT GTCAAAACCT TTGCCGAATA TGTGCAAGAT GGGATTGTGC TCGATTCGGT GTCGTTTCAG CAGGCCATGC GCTTGCTCGA AGCCGACGAA GAAGATAGCG ACGATAGCAC GCCCAGTTCG CAGGCCTCGG CGCTTGATGA GCATGCCGCC GCCAGCCTGA TTATTGCCAA ACTGCCCAAA CTTGAGGCCA GTAAATACGA TCGGCGGCGT TTGCATCGGG CTTTGCAAGC AGATATTGAT GCGCTGAACG AAATTTGGCA TGCGATTAAA CACATTCAAG CCAGCCACGA TGCCAAATTG CTGCAATTGC AAAGCCTACT TGCCAGCCAA CTCAACGGCT GCAAAGTGGT GATTTTTACC TATTACAAAG ATACAGCCCG CTATGTCTAT CAAGCCTTGA CTGAGGCAAG CAATGCTGAA TGGCTGGCCA CGCTTGGCAA TCCAACAATT CGGCGCATTG ATAGCAGCGT CAAAACGACC GATCGGACGC GAATTGTGAG CCATTTTGCG CCACGCGCCA GCAACCAACC CGAATTGGTT GGCACAAGCG ACGAAGTACA AATTTTGATC GCCACCGATG TGCTTTCCGA GGGCCATAAT TTGCAAGATT GCGGCCATTT GCTCAACTAC GATTTGCACT GGAACCCGAC GCGCATGGTC CAACGGGCAG GCCGGATCGA TCGGCTTGGC TCGAATTTCG ATCTGCTGCA TGTGTACAAT ATGTTTCCTG AGCGCGAACT CGAAGCTTTG CTGGGCTTGG TGCGCAGCCT GACCAGCAAA ATCGATTTGA TCAACCAAAC GGGCTTTTTA GATGCTAGCG TGTTGGGCGA AGTCGTTACG CCGCGCGATT TCAACACGCT CAAGCGCATC GCCGACGAAG ATCATAGCGT GATCGAGGAA CAAGAATCAT TTTTGGAGCT AGCGAGCAGC GAATCATTAT TTGCCGAATT GCAAAATGTG CTGGCGACCG ATGCCCAACG CTGGCTGACC GACCTTGATG ATGGGATTCA TTCGGGGATT GAGCGACGTA ATGCCAAAGG CCTGTTTTTC TATTTCACTG CGCCGCGTGA TGGCAGCACC GCCCATTTTT GGCGCTACTA CGACCTTGAA AAACAAACGA TTACCGATAA TCGCTACACC ATTATGCAGC TGATTGCCTG TAGCCCAGAT ACGCCACGCT TTGCCCCGCC CTATAGCGAA GTTGATATTT TCGCCATCCA CGACACCATT CTCAACAGCA TTTTGCGCGA TGTTCAGCAA CAAGTCACGG CGACGGTGGT CGATAAAATC GTTGCACCGG AACAAAACGT GATCGCCCAA TTGCTCCACA ACAATCTTGC ACAGCCTGGC GTTGATCGCG GCGAAGTGCG CGAATTGCGC AAAATGCTCA AAGAACCCCA AGTTGGCGCG GTCGTGCAAC GTTTGCGTAA AACGCTCAAT CGCTACAACG CCGACAACGA TTTGCTTGCG CTGCTTGAGG TGCTACGCGA ACTTTACCAA ACCCAAGGTC GCAGCAATCT TGAGCCGAGT GGCCCCGTTA GCATGATTAC CCGCGATGAT CTGACCTTGG TGTGCTATGA GTATATTTAT GCATAG
|
Protein sequence | MQIPYVIDNI EQRLADVLRY LLSRASDQSL DIATAYFSIR GFEQLRHELE TTGQVRLLLG DKPVAADDLG MRPDSSAFLR HELNAEAFTE RTLRLVEQLI RFLRRDEVEV RLYTGLLAGE KGRRAFLHAK CYLLYGRLPA ANVFDPLTPI VGIVGSSNFT GPGLVSNHEL NLVHKTLLTD AELDDAEAKA TVAQHAAQLS AAPLELEHER MLKSEIGTRA ILDLSRWYEQ QWQHALDYKE QLIELLENSK FGGREYSPYE IYMKALYTYF KDDLAERDQQ PATRSAVELA EFQEDAVRKA RRILAQYDGA IVADSVGLGK TWIGKKLLED YAYHQRQKAL VICPASLRGM WERELHSATI AAQVLTQERL GLDEFDGREF FDVDLILVDE AHNFRNKRAK RYQQLELLLA ANGRRGRSGS RKKLILLTAT PINNNIFDLY NQINLFTGND RSYFASAGIG DLYKYFLAAR CESLEAGSIR IFNLLEEVVI RRTRQFIRRA YPEALIHGEP IRWPTRQLHS VEYDLQAAYS GLYQNIIGSI EGLHLAHYNL EAYKLRPSDQ DEFELGRQAA LVGIFKSRFL KRLESSIEAF RISIRRSLAF VKTFAEYVQD GIVLDSVSFQ QAMRLLEADE EDSDDSTPSS QASALDEHAA ASLIIAKLPK LEASKYDRRR LHRALQADID ALNEIWHAIK HIQASHDAKL LQLQSLLASQ LNGCKVVIFT YYKDTARYVY QALTEASNAE WLATLGNPTI RRIDSSVKTT DRTRIVSHFA PRASNQPELV GTSDEVQILI ATDVLSEGHN LQDCGHLLNY DLHWNPTRMV QRAGRIDRLG SNFDLLHVYN MFPERELEAL LGLVRSLTSK IDLINQTGFL DASVLGEVVT PRDFNTLKRI ADEDHSVIEE QESFLELASS ESLFAELQNV LATDAQRWLT DLDDGIHSGI ERRNAKGLFF YFTAPRDGST AHFWRYYDLE KQTITDNRYT IMQLIACSPD TPRFAPPYSE VDIFAIHDTI LNSILRDVQQ QVTATVVDKI VAPEQNVIAQ LLHNNLAQPG VDRGEVRELR KMLKEPQVGA VVQRLRKTLN RYNADNDLLA LLEVLRELYQ TQGRSNLEPS GPVSMITRDD LTLVCYEYIY A
|
| |