Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3068 |
Symbol | |
ID | 5734940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3873148 |
End bp | 3875013 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641280212 |
Product | serine/threonine protein kinase |
Protein accession | YP_001545834 |
Protein GI | 159899587 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [S] Function unknown [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000268642 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACTG TTATGAACGC ATTAGCGTGT ACAAATTGTC AGATGTTGCT AGCACCAGGT GCACGATTCT GCTCGCATTG TGGAACGGTT GCGCCAACGC CACCAGCTAT CAACCTTCCA CGCGAAGCAA GCGGCACGCT GCGACGTGGG GTGATTCTCC AAGAGCGTTA TAAAATTATG ACCTTTATTG GAGCTGGTGG TTTTAGTCGG GTTTATGGTG CAATCGATTT GCGGCTCAAA ACGCCCTGCG CCATCAAAGA AAACAATGAA TTTGATCAAT CGGCTCATGC TCAGTTTTTG GCCGAAGCCC AATTGTTGGC GCGTTTGCGC CACCCCAGCA TGACCAAAGT CACCGATTAT TTCACTGATC CCAGCGGGGC GCAGTTTTTG GTGATGGAAT ATGCGCCTGG TAAAGATCTC GAAAAGATGA TGGATGAAGC CGAAGAGATG GTTTCATGGC GCAAGGTGGC CGAATGGGGT CAAATTGTCT GTGATGTACT GACCTATTTG CACACCCAAG AGCCGCCAAT TATCCACCGC GACATTAAAC CCGCTAATTT GCGTTTGACT CCCCACGGCG ACTTGATGGT CATCGACTTG GGAATTGCTA AAGAATATCG CGATGGCTCA GCCACAACTC GCGCCGCCCA AGCATATTCG GGCGGCTACT CACCAATCGA GCAATATTTG GGGCAAGGCA CTGACCCACG TTCAGATTTA TATGCCTTGG GTGCAACGCT CTATCATTTG TTGGTTGGCA AGATGCCACC CGAAGCACCA AACCGCTTGC GCGGCATTTC GATGCAAACC GTTGAGCAAG CTCGCCCCGA TATTCCAGTG TTGTTGGCGC GGGCAATCGA TCGCGCGATG GCGATTGAGC CTGATCATCG CCCACCGAGT GCTGCGGCTT TGCATATGGT CTTCGAACGA GTGCTTGAGC AAGATCAAGC GGTCAGCATC AGCCAACCTG CGGTGATTGC CCAAGCGGCA ACCGCCCGAA CCCTGCAAGC TGCGGCGCAT GCGGTTGCTA CGCCTGTGGC AATCAGTCAA CCACGCATCG GCACGCAACC GCGCTCAACC ACCAATGCTG GTCAACCCAG CCGCCCATTA CATGTGCCAA GCGAACTGCG TGCTGAACGC TTGCCGCATG GCATTATTTG GGAAGGTGAT CGGCGTGAAA TGGTGCGGGT TATGGCAGGG CCAATGCCAA TGGGCAGCGA GGGCAATGAT CCCGATGAAG TTCCAGTGCA TCGGCTTGAC CTCAAAACGT TTTTGATCGA CCGCTTTCCG GTGACATGTG CCGATTATGC GCGTTTTGTG CAGGAAACCG GAACAACACC GCCACGGTAT TGGGGTGGCC CATTGCCGCC GCATATGATC GAAGATCATC CAGTCGTCGA AATAACGCAC GACGAAGCGC GAGCTTACGC ACGTTGGGCT GGCAAGCGCC TACCAACCGA AGCCGAATGG GAAAAAGCGG CAACATGGGA TGCCACAACT GGGCACAAGC GGGTTTATCC TTGGGGCGAC CAATGGGATG AGCATCGCGC CAATGCGCGG GAAGGCGGAG CTGGCGGAAC CGTGCCAATT GGCGCGTATT CACCACAGGG CGATAGCCCA TGTGGCGCAG CAGAAATGGC TGGCAATATT TGGGAGTGGA CAGATACGGC CTATAAGCGC TATCCTTATG ACCCAAGTGA TGGCCGCGAT TTCCCCAAGA ATGCTGGCTT GCGCGTCACC CGTGGTGGGT CGTGGTCGTG TTCGCCGGAT GCATTACGCG GAGCGAATCG CAACATAGCT GCTGCCAACG ATGCCGATTT TGAGGTGGGC TTCCGTTGTG CCGCTGATGT ACGCGAGGAT TGGTAA
|
Protein sequence | MSTVMNALAC TNCQMLLAPG ARFCSHCGTV APTPPAINLP REASGTLRRG VILQERYKIM TFIGAGGFSR VYGAIDLRLK TPCAIKENNE FDQSAHAQFL AEAQLLARLR HPSMTKVTDY FTDPSGAQFL VMEYAPGKDL EKMMDEAEEM VSWRKVAEWG QIVCDVLTYL HTQEPPIIHR DIKPANLRLT PHGDLMVIDL GIAKEYRDGS ATTRAAQAYS GGYSPIEQYL GQGTDPRSDL YALGATLYHL LVGKMPPEAP NRLRGISMQT VEQARPDIPV LLARAIDRAM AIEPDHRPPS AAALHMVFER VLEQDQAVSI SQPAVIAQAA TARTLQAAAH AVATPVAISQ PRIGTQPRST TNAGQPSRPL HVPSELRAER LPHGIIWEGD RREMVRVMAG PMPMGSEGND PDEVPVHRLD LKTFLIDRFP VTCADYARFV QETGTTPPRY WGGPLPPHMI EDHPVVEITH DEARAYARWA GKRLPTEAEW EKAATWDATT GHKRVYPWGD QWDEHRANAR EGGAGGTVPI GAYSPQGDSP CGAAEMAGNI WEWTDTAYKR YPYDPSDGRD FPKNAGLRVT RGGSWSCSPD ALRGANRNIA AANDADFEVG FRCAADVRED W
|
| |