Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3472 |
Symbol | |
ID | 5735333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4374361 |
End bp | 4377111 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280619 |
Product | TPR repeat-containing serine/threonin protein kinase |
Protein accession | YP_001546236 |
Protein GI | 159899989 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00045069 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGATC CCCAGTTGAT TGGGCGCATG CTTAATCATT TCAAAATTGT CGATAAACTT GGCCAAGGCG GCATGGCTAT GGTTTATCGT GCTTACCAAG AAAACCTGAA TCGTACTGTG GCGCTCAAAT TGTTGCCACC AGAGATGACG TTTGATCAAA GCTATATTGC GCGGTTCCAG CAAGAGGCGC GGGCCGCCGC AGGGCTTGAA CATAGCCATA TTGTGCCAAT CTACGAAGTT GGCCAAGCTG AAGGCTTTTA CTACATCGTC ATGAAATATA TTGAGGGCAA TACGCTCAAG GAAAATATCG AGCAAGAAGC CCCAATGTCG GTTCATCGAG TGCTGGAGTT GCTTGAGCCA GTTGGCAAGG CGCTCGATTA TGCCCATCGC AAGGGTGTCA TTCACCGTGA TATCAAGCCT TCGAATGTGA TGCTCACGCC TGAAGGCTGG GTTTATTTGA CCGACTTTGG CTTGGCTCGC GGTGGGAGCA GCGATTCGGG CTTGACCCAA GTTGGCACGG TGATGGGCAC ACCGGAATAT ATGTCGCCTG AGCAGGCTCA GGGCTTGACA GTTGGCGCTG CTAGCGATCT CTATGCCTTG GCGGTTATGG CTTACGAAAT GCTGACCAAA CAGATGCCAT TTGTCGCCAA TAATGCCCAA GCTGTGCTTT TAGCGCGGGT TATTCGTGCA CCACGCGCTC CCAGCGATCT GATTCCGACC ATGCCATCGG CGGTCGAAGA TGTCTTGATG AAGGCGTTGG CTCGCACGCC TGAGGCACGT TATCCAACAG CGGCGGCTTT TTTCGAGGCC TTGCGGCAAG CGAGTAATGG TGCACGGCCA AATGTGGCTG CGGCTACGCC ATTTGCCCAA AATCAGCCAG CCCAATATGC GCCAACATCG CCGAGTAGCC CACAGGTCTA TCCGCCAACG CCGCTGAGCA ATCAACAGGC GGTTGCGCCG CATTACCCAC CAACGCCGCT GAGCAATCAA CAGGCGGTTG CGCCGCATTA CCCGCCAACG CCAGTGAGCA ATCAGCAGGT AATGCCGAAT TATCCACCGA CCAATCCCAG CAATCAACAA GTGGTGATCC ATAGCCAATC GCCCTATGAT GGCTATGTCG CGGCCAATAC CCAAGCCACT CGGCCAGCAA TCATGCCAAA TGCGGCCCAG CCAGCACAAT ATAACCAACA GCCAATTAGC CAACCCAGTC CAGTTGCCTA TACAGGTGCT ACCTCAGTTC TGCGCAACAA GCAAAAATTG ACGATTTGGG TTGGCCTAGG GGTTTTGCTG TTGGTGGCAG TTGTGGTTGG GGTTATTTTG GCTTCTGGCA GTGATGCCGA AGATATTATT GCTCAAGGCG ATGCTGCCTT TGAACGCCGT GGCGGCTTGA TCGAAGCAAT TAATCTCTAC AAAGAAGCGA CCGCTGCTGA TGATGAGAGC TTCGAGGCCC ACGAAAAACT AGCCATTACC TATCTGATGC GTGGCCAAAC GCCTGATGCC GATCAAGCAA TTCGTCAAGC AATTGCGATT GATGCCAACC AAGCCAGTGC CCATGCTTGG CTCAGCCAAG TGCATTCCGA TAATCGTCAA TTTAATGAAT CTTTGGCTGA AGCCGAAGAA GCCGTCCGTT TGGATGCGAA TCATCCTTTG GCCTATATGG CGCGGGCTAC TGCACGAGCT GATGTCGGCA ACGAGCAAGG TGATAGCGAG TTGCTGGCCG ATGCCTTGGC CGATACCAAT AAAGCAATCG AGCTGGCAAC CAATCGTTCA CGCTTTGAGC AAGCTATGGC CTACAGCGCC AAAGGCTACG TTCAATGGGT CACCTATCAA GATCAAACCA GCCGCGACGC TGGCGCTGGC AAAGAATTTG TCGTCGATGG GATTGATAAT TTCAATCGGG CGATTGGTTT GCAAGAGCAA TTGCCGTTGC TGCGTAATAA TATTGGCTAT TTTTACGCCG AACAAGCGCG GGTTGCCCTG CACCTCGGCG AAGATGAAAC CGCAGCTCAG CGCTTTGAAA AAGCCTATCA ATCATTCGAC GATGCCCTAG CGCTTGACCC AAATTATGGT TTGGCCTTTG CAGGCAAGGG TTGGACGCAA ATTTACGAGC GTAAATATGA AGAAGCCCAG CAGTTTTTCG ATCTAGCGCT TGAGCGTAAC CAGCGTGACC CCAACGCCTT GAATGGACGA GCATTGACCA ACTGGTGGCT GGGTCGCAAC AACTCCAGCG ATCCCCAAAG CGATTATGCC GCAGCGATTC GCGATTATGA AGCGGCGATT GCCGAAGCTC CATCGTGGCT ATCGGTCTAT GTCGATTTGG GTTATGTCTA TCTCTACGAC ACCAAAGATA CTGATAAAGC CATCGAAACC TTTAAAAAGG CTTTGGAACG TGACCCGGAA TACCCAAATG CAATTGCTGG CTTAGCCGAT ACCTACTACG ACACGCGCTA CTATGATGAA GCGTTGAAGC TCTACGAACA AACGATTAAT CTCCAGCCTG ATTATGCGAC GGCCTACCTC GGCAAAGCCA ATATCTTGTA CAATAACAAA GATTATGATG CAGCGATCGA TCAATATAGC ACGGCGCTTG ATTATAATCC CTCGTTGAAA AATGCTTATA TTGGCAAAGC CTATTGCTAT CAAGCCAAAG GCGATATCGA CGAGGCTCGC CAAGTTTTGC AAGATGGATT AGAATCAGTG GCCTATGTTG ATCAATCCGA ATTGCAAACT ATTTTGGATA AGATGAAGTA A
|
Protein sequence | MQDPQLIGRM LNHFKIVDKL GQGGMAMVYR AYQENLNRTV ALKLLPPEMT FDQSYIARFQ QEARAAAGLE HSHIVPIYEV GQAEGFYYIV MKYIEGNTLK ENIEQEAPMS VHRVLELLEP VGKALDYAHR KGVIHRDIKP SNVMLTPEGW VYLTDFGLAR GGSSDSGLTQ VGTVMGTPEY MSPEQAQGLT VGAASDLYAL AVMAYEMLTK QMPFVANNAQ AVLLARVIRA PRAPSDLIPT MPSAVEDVLM KALARTPEAR YPTAAAFFEA LRQASNGARP NVAAATPFAQ NQPAQYAPTS PSSPQVYPPT PLSNQQAVAP HYPPTPLSNQ QAVAPHYPPT PVSNQQVMPN YPPTNPSNQQ VVIHSQSPYD GYVAANTQAT RPAIMPNAAQ PAQYNQQPIS QPSPVAYTGA TSVLRNKQKL TIWVGLGVLL LVAVVVGVIL ASGSDAEDII AQGDAAFERR GGLIEAINLY KEATAADDES FEAHEKLAIT YLMRGQTPDA DQAIRQAIAI DANQASAHAW LSQVHSDNRQ FNESLAEAEE AVRLDANHPL AYMARATARA DVGNEQGDSE LLADALADTN KAIELATNRS RFEQAMAYSA KGYVQWVTYQ DQTSRDAGAG KEFVVDGIDN FNRAIGLQEQ LPLLRNNIGY FYAEQARVAL HLGEDETAAQ RFEKAYQSFD DALALDPNYG LAFAGKGWTQ IYERKYEEAQ QFFDLALERN QRDPNALNGR ALTNWWLGRN NSSDPQSDYA AAIRDYEAAI AEAPSWLSVY VDLGYVYLYD TKDTDKAIET FKKALERDPE YPNAIAGLAD TYYDTRYYDE ALKLYEQTIN LQPDYATAYL GKANILYNNK DYDAAIDQYS TALDYNPSLK NAYIGKAYCY QAKGDIDEAR QVLQDGLESV AYVDQSELQT ILDKMK
|
| |