Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1766 |
Symbol | |
ID | 5733654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2054305 |
End bp | 2057472 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278909 |
Product | non-specific serine/threonine protein kinase |
Protein accession | YP_001544537 |
Protein GI | 159898290 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGGCCG CCGATTTACT GAATATTTCA GCGCTCCGAG CTGAGGCCGG CCAGCGAGTA GTACAAGCGG GCGAGGCGTA TTATCGAGCA GGTCAGGTTG AAATGCTTGA CCTTGATCAC GATCAGGCAC TCTTTGCTGT CCATGGTAGC CAAGAAAACC CGTACACCGT CACAATTATC AGTGATCAAC ATTGGTTAAT TTCTGATTGC ACATGCCCTT ATGCAGCCAA AGGCGTGATT TGTAAGCATG TCGTTGCAGC CGCGCTCTAC CTCTATGATC AGCTTGTAGC GCATCCACCT AGTATTTGGC GCTCGATTTT TGCCAATATT CAACCGCCTA CCCGCCGCCG TCAAAACAGC ATGCTGTTGA TATTTAGCTT GATAGAGCGC GGGGCTAGTT GGACACTCGT ACCCTATACG TTCGCTGAGC GTGTGATTCC ACAGCCAGTG ATGCATGATC CTGTTGCGCT ACACAAGCAC TTAAATACCT CAGCTATGCT CAAACATGCC AAAATTCCCC ATACCCAACT TCAGCCTCAG CAATTTCCCC ATAGCCCAGA AGCAGCGTTA ACCTTGGCAA ACATGATCGT AGCTTTTTCG CAAAACTATT TCTATGGCAT AACCCTTCAG TTGCCAATAA TACTCAACGC CTTAGTTCAA CTTTTGCCCG ATGCCGCTAT CTTTTGCGGC GATGATCTGC ACCCATTTCA ACGAACACTG CAAGTAGCCG TAAATCGTGG GCAAATTCAA CTCCATGGCA CTGCCACAAC CGATGGAATG ACGCTCACGC CAATGCTGCG CTTCGGCGAA ACAAGCACCA ACCTCGATAC AGACGAGGTG AGGATTCTAC TCTTTAATGC TGAATGGGCT ATCTATCGCG ATTGGCTCGT GCGCTATGAT GATCCTGGTA ATATACTTAG CTTATTTCGC CGCCATAACC ATCTCAATAT TCCGGCGGCT GACCACACTG AATTTATCGA GCAGTATCTC GTGCCGCTAG CGGAGCACAC CGCACTTAGT GGCGATCTAC CGCACCAAGA ATTGGTTGTG GCAGATCCTC AGCCACGCTT GTATTTGAGC GACCATGAAT CGACCTTGCG AGCTGAGTTA CGTTTTGGCT ATGCTGACCA TGAATTGGTC TATGACCTCC AGTTGCCAGC AGAAACCATC CGCTATAGTC ACGAGCATGC GACAATTCTG CGCATCCAGC GGCAACCAAG CATCGAGGAG CAATGCTGGG GTGAGTTGAT GCGTCATGGG CTTAAACGTG GACAGCAACC AAGCGTTGCA ACTTTGCGCA GTGGAACAAC GAGCGCTACA TTTTTGCTCA ACCATGTGCC CAAACTAGCG GCCATGAATT TCACAATTTA TGGAGAGGAA TCATTGCTTG GTGCGCGAGT TAACCGTCAC ACGCCCAGCA TCACCTTGCG GGTTTCATCG GGAATCGATT GGTTTGACCT TGAGGCCGTT GTCCGTTTTG GCGAAACCGA GCTTGAGCTG GCTGAGCTAC GGCGGGCAAT TCGCAAGCGT AAACGCTATG TAAAGTTGGC TGATGGAACA CTTGGGGCAA TTCCTGAGCT ATGGCTAGAG CGCTATCGTC ATCTCTTTAC GCTTGGTGAA ATGCATGCAG AAACGCTACG GTTTGCTCCG ACACAAATTA CCTTGTTGGA TGGACTGCTA CATAACACTG ATCAGGTTGA TCCGACCTTC AAACAGCGCC TCCAAGGCCT AAAAACTATC AATGGAATCG CACCACAGCC ACTTCCCTCA GGCTTCGCTG GAGTGTTGCG TTCTTATCAA AAAGCGGGCT ACGATTGGCT TCACTTCCTC TATAAGTATG GATTTGGCGG GTGCTTGGCC GATGATATGG GCACTGGCAA AACAATTCAG ACACTCGCCT TTCTCCAGTC GTTGAAAGCT CGTGGTCAAG CGTCTGCTAG CAGTCTAATC GTCATGCCAC GCTCATTAAT CTTCAACTGG CAACGTGAAA TCGCCCGCTG GACTCCCGAT TTGCAAGTGC TGGTTCATAC CGATCAAGGC CGACCTGACA CCGTCGCAGC CTTTAGCGAT TACGATCTCG TGCTTACAAC CTATGGAACG CTGTTGCGCG ATATCGATCT ATTTGCTACC TATCAGTTTC ACTGTCTTGT GCTCGATGAG GCCCAAGCGA TTAAAAACCC ATCCTCGCAA ACTGCGCGAG CCGCCCGCGC CCTGCATGCC GACCATCGCC TTACGTTAAC AGGTACGCCC GTTGAAAACT CAATCTTAGA GTTATGGTCA CAGTTTGCCT TTCTCAACCC AAGCATGCTT GGCAGCCTTG AGCATTTTCG TAGCGAATTT GCCACCCCAA TCGAGCGCGA CGGCGATCAG CAAACGGCGC AGTTGCTCCG CCGAATGGTC AATCCCTTTA TCCTGCGGCG CACCAAAGAC CAAGTTGCCC CTGAATTGCC GCCCCGCAAT GAACGGCTCA TGTATTGTGA TATGGAGCCA GCTCAACAAA AGCTCTATCA GCGTTATCGC GACCAATATC GCGCCATGTT GCTATCATTG ATCGATGATC AAGGGATCAA TGATAGTCGG ATTAAAGTTC TGGAGGGTTT ACTCCGACTC CGCCAGATTT GCAACCACCC ACAACTAGTT GAAGCAACAT TTCGAGGGCA CTCCGCCAAA TTTGATCAGT TACTGGAAAC CCTTGAGGTT CTTCATGCCG AAGGTCACAA AGCCCTGATT TTCTCGCAAT TCGTTCAGAT GTTGACGCTG CTGTGGAAAG AACTTGATCG ACGAAACCTG AGCTATGCCT ATCTTGATGG TAAAACAAAC AATCGAGCTG CGGTCGTGGA TCGTTTCCAA ACCGACCCAC AAATTCATTT CTTCCTGATC AGCCTTAAAG CTGGCGGAGT TGGCCTGAAT CTCACCGCAG CCGACTATGT AATTCACATC GACCCATGGT GGAACCCTGC CGTCGAACAG CAAGCAACGG ATCGCACCCA TCGGATCGGT CAAGATAAGC CTGTTTTTAT CTATAAATTA ATCGTGCGCA ATAGTGTTGA AGAAAAGATT TTGCAACTGC AAGAACGGAA ACGCGCCTTG GCAAACAATA TCATTACCAG CGAACAAGGA ATCGTCAAAT CACTGACCCG CGAAGATGTG GTAGATCTCT TCTCCTAA
|
Protein sequence | MLAADLLNIS ALRAEAGQRV VQAGEAYYRA GQVEMLDLDH DQALFAVHGS QENPYTVTII SDQHWLISDC TCPYAAKGVI CKHVVAAALY LYDQLVAHPP SIWRSIFANI QPPTRRRQNS MLLIFSLIER GASWTLVPYT FAERVIPQPV MHDPVALHKH LNTSAMLKHA KIPHTQLQPQ QFPHSPEAAL TLANMIVAFS QNYFYGITLQ LPIILNALVQ LLPDAAIFCG DDLHPFQRTL QVAVNRGQIQ LHGTATTDGM TLTPMLRFGE TSTNLDTDEV RILLFNAEWA IYRDWLVRYD DPGNILSLFR RHNHLNIPAA DHTEFIEQYL VPLAEHTALS GDLPHQELVV ADPQPRLYLS DHESTLRAEL RFGYADHELV YDLQLPAETI RYSHEHATIL RIQRQPSIEE QCWGELMRHG LKRGQQPSVA TLRSGTTSAT FLLNHVPKLA AMNFTIYGEE SLLGARVNRH TPSITLRVSS GIDWFDLEAV VRFGETELEL AELRRAIRKR KRYVKLADGT LGAIPELWLE RYRHLFTLGE MHAETLRFAP TQITLLDGLL HNTDQVDPTF KQRLQGLKTI NGIAPQPLPS GFAGVLRSYQ KAGYDWLHFL YKYGFGGCLA DDMGTGKTIQ TLAFLQSLKA RGQASASSLI VMPRSLIFNW QREIARWTPD LQVLVHTDQG RPDTVAAFSD YDLVLTTYGT LLRDIDLFAT YQFHCLVLDE AQAIKNPSSQ TARAARALHA DHRLTLTGTP VENSILELWS QFAFLNPSML GSLEHFRSEF ATPIERDGDQ QTAQLLRRMV NPFILRRTKD QVAPELPPRN ERLMYCDMEP AQQKLYQRYR DQYRAMLLSL IDDQGINDSR IKVLEGLLRL RQICNHPQLV EATFRGHSAK FDQLLETLEV LHAEGHKALI FSQFVQMLTL LWKELDRRNL SYAYLDGKTN NRAAVVDRFQ TDPQIHFFLI SLKAGGVGLN LTAADYVIHI DPWWNPAVEQ QATDRTHRIG QDKPVFIYKL IVRNSVEEKI LQLQERKRAL ANNIITSEQG IVKSLTREDV VDLFS
|
| |