Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3486 |
Symbol | |
ID | 5735347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4390729 |
End bp | 4393698 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280633 |
Product | serine/threonine protein kinase |
Protein accession | YP_001546250 |
Protein GI | 159900003 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00873447 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCAT TTGAACATAC CCAACTGGGT GAGTATACAT TGCAAGAAGA GATTGGCCGT GGCGGGATGG CGCAAGTCTA TCGAGCGCAG CATCAAGCCT ATGGTCTGGT TGCCTTCAAA GTATTACCAC CCTATTTTGC CCACGATAAC GATACACTTC AGCGTTTTAT GCGTGAGGCG CGGGCCAATC GTACCTTGCA CCACCCGCAC ATTGTGCAAT TGTATGAAGC CAGCGACATT GCCCAAGCCC AAAACCCCTA CCAGCCGATC CACTATATCG CAATGGAATA TATCGCTGGG GGCACGCTGA CCGATCGCTT GCGTCAACAG CCGCAACAGC CATTAAACTC GACGCTTGAA ATGGGTGAGC AAATTGGCTC AGCCTTGGAT TATGCCCATG GCAAGGGTTT TATTCACCGC GATATCAAGC CGAGCAATAT TTTGTTTCGC AGCAACGGCC ATGCTGTGCT CGCCGACTTT GGGATTGCCC TCGCCAACAA CGAGGCCCGC ATGACCAAGG CTGGCGGTTT TGCTGGTACG GTCGCTTACA CTGCGCCCGA AATTTTCGAG GGCGAAACCG CCGATGTGCG CTCGGATATT TATGCTTTGG GCTTGATTTT GTATGAATCG TTGGCTGGGC ATAACCCCTA TGCCAACATC AGCACTAATG CCCAAATTGC CATGAGCAAA ATTTTAACGA CACCGCTGCC ACCATTGCAA GATGTTGCGC CGCATGTGCC TCCGTTGACG GCTGAAATTC TGGCTCAAGC AACCGCCAAA GATCCAATTC GGCGCTTTGC TACAATGTCG GATTTTGTTG AGGCCTTGAA ACAAGCTAAA TTCAATCGGG TCAGCGATCG ACCTGCGCAA CAATTGAATG CCAGTGGTCG CCCGATGATT CCAATTGCTG GTCGGCCTGG AGCGCCAAAT CGCCCACCAC CACGCAATCA GCCAAGTGGC GATAACCCAA CGCAAGCCTT TGTAGCTGGG GTTGCTGCTT CATCTGCGGC GGCTGCGGCC ATGCCTGAGT CTAATGAAGG CACGATGATC TACACGCCAC CAACCCCAAA TCCGGTGGTG AATCAACAGG CTGCGCCGCC ACCAAACGAG CCAACCCAAA TCTACACGCC ACCAACCCCC AATTCGGTGG TGAATCAACA GGCTGCGCCG CCACCAAACG AGCCAACCCA AATCTACACG CCACCGACCC CGAATCCGGT GGTGCAACAG GCTGCGCCAG CGGCCAAGCC GCCAGTTGAT GCGACCCAAA TCTATACGCC GCCGACCCCG AATCCGGTGG TGCAACAGGC TGCGCCAGCG GCCAAGCCGC CAGTTGATGC GACCCAGATT TATACGCCAC CAGCGGCTAA CCCACCGTTA CGGCAAGCAC CGATTACCCA GCCCAATCAA CCCTTGCCAA GTGTCCAATC GCAGGCCAAC CAGCCGTTGC CAGCTGCGCC AACCATGCCG CTGGATAACC AAAGTACAAT GATGTACACG CCAACTGCAA GCGTGCCCAA TCGGCCAATT CCAAGCCGAA CCAACCAGCC AGTTGATAAC GAAAGCACCG TGATGTATCC GGCCTATCCG CCGAATCAGC CACAAGCCTA TCCACCGCCA GCAACTGGTA ACCAAACCTA TATGCCGCCG ACTGGCAATC AGGCCTATCC ACCAATGGCT GGGGCTTCGC AGCCAAACCA ACAACTGCCC AAAAAAGGGC CAGCGCGGCC AATTAACGCC CAACGGATTG AACTGGCGCG ACCTGCGGCT CCTGTTCAAG CAACTCAGGC GGCGGAAACT GACTATGCGC CTGTTCCACC GCCCTATGCC ATGCCGCCAA AAGCATCGCC ACTTGCCAAA GCCAAGGATT TCTTCCGTTC GCCAAAGGTT CTCGTGGCCT TAGGTAGCGC ACTTTTAGCC TTGATTGTGA TTGCATTTTT GAGCTTTGGC CAAGGTGAAG ATGACCCAAA TCTTGCTAGC GGCGCTACAG CTACCGCAGA AGTTGCGGTT GATGAGAGTA CCCCAACGGT TGCGGCAACC AATGCGGCCA CTCCAACTGT GGCCGAAACT GCCGATCCCC AAGAGGCCAA ACGCTTAGAA TATATCGTTG GGCAAGAAGC TTACGCTAAA CAAGATTGGC CAAATGCAGC AGCGGCCTTT GACAAAGTGC TCGCCATCGA TCCAGCCTAT CTTGATTTGA GCAACATTGG CTCGGCAACT TACTACAATT GGGCGGTCAG CGAATTAACT GGCCCTGAAA ACGTTGCAGA AAGCCTTGAA ATTCTCAACA AAACCTTTGG TTTCAAGCCC GACCATCAAC CAGGTGGCAA CTTAGCCAAA GTGCTGAATT TCTATCGCAA TGGCCAAACT GCCGCCGAAC AACAAGATTG GCAAGCAGCG ATTAATGGCT ATAAAGAAGC CCAAACTGCT GGCAGCGGCG AGTTTGGCGA AATTATGGGT AAGCTACAAA CGGTCAAGCA ATTGTATGAA GCCTATCTTG GGCGCGGGCG CGAGCTAGAG GAGGCTGGTA ACGAAACCGA AGCCAATGCA ATCTATCGTG AAGCTGCCGC CCTCAAGGAT CTGGACAACA GTTTAGATGT TGCAGCAGCT AATACAGGCA TTGCGGCAAC CCAACCAACT GCGGTGCCAG CGACCCCAAC GAAGCCAGCA CCACCAACGA CTGCCCCGGT AGCTCAACGC TTGTACTTCC AAAAATATGC CGAAAACGCG GTTGATCCAA CCTGTTTTGC GGTGCATATC CGTGGGGTTA ACACGGGCGG CTGGTTTGTG ACGGTCGATG GACTGGGCAA TCGCGGCAAT GTCGATGGCG CTGGCAACAC CAACGTTTGC GGATTGTCCC CCAGCCAAGA AGTGACCTTC ACGGTTTACA ATGGCTCTGG GCAGGCTGTG CCAGGTGGCG GCGGCATCCC GACCCGTGGC GGCGATTTGA TGACCGGCTA TTGGCAATAA
|
Protein sequence | MKSFEHTQLG EYTLQEEIGR GGMAQVYRAQ HQAYGLVAFK VLPPYFAHDN DTLQRFMREA RANRTLHHPH IVQLYEASDI AQAQNPYQPI HYIAMEYIAG GTLTDRLRQQ PQQPLNSTLE MGEQIGSALD YAHGKGFIHR DIKPSNILFR SNGHAVLADF GIALANNEAR MTKAGGFAGT VAYTAPEIFE GETADVRSDI YALGLILYES LAGHNPYANI STNAQIAMSK ILTTPLPPLQ DVAPHVPPLT AEILAQATAK DPIRRFATMS DFVEALKQAK FNRVSDRPAQ QLNASGRPMI PIAGRPGAPN RPPPRNQPSG DNPTQAFVAG VAASSAAAAA MPESNEGTMI YTPPTPNPVV NQQAAPPPNE PTQIYTPPTP NSVVNQQAAP PPNEPTQIYT PPTPNPVVQQ AAPAAKPPVD ATQIYTPPTP NPVVQQAAPA AKPPVDATQI YTPPAANPPL RQAPITQPNQ PLPSVQSQAN QPLPAAPTMP LDNQSTMMYT PTASVPNRPI PSRTNQPVDN ESTVMYPAYP PNQPQAYPPP ATGNQTYMPP TGNQAYPPMA GASQPNQQLP KKGPARPINA QRIELARPAA PVQATQAAET DYAPVPPPYA MPPKASPLAK AKDFFRSPKV LVALGSALLA LIVIAFLSFG QGEDDPNLAS GATATAEVAV DESTPTVAAT NAATPTVAET ADPQEAKRLE YIVGQEAYAK QDWPNAAAAF DKVLAIDPAY LDLSNIGSAT YYNWAVSELT GPENVAESLE ILNKTFGFKP DHQPGGNLAK VLNFYRNGQT AAEQQDWQAA INGYKEAQTA GSGEFGEIMG KLQTVKQLYE AYLGRGRELE EAGNETEANA IYREAAALKD LDNSLDVAAA NTGIAATQPT AVPATPTKPA PPTTAPVAQR LYFQKYAENA VDPTCFAVHI RGVNTGGWFV TVDGLGNRGN VDGAGNTNVC GLSPSQEVTF TVYNGSGQAV PGGGGIPTRG GDLMTGYWQ
|
| |