Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0365 |
Symbol | |
ID | 5732216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 436392 |
End bp | 438029 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277488 |
Product | Dak phosphatase |
Protein accession | YP_001543144 |
Protein GI | 159896897 |
COG category | [R] General function prediction only |
COG ID | [COG1461] Predicted kinase related to dihydroxyacetone kinase |
TIGRFAM ID | [TIGR03599] DAK2 domain fusion protein YloV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0753407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACAGCG CACCGAAGAA TCAAATTGAT GGCACGCGGC TGTTGGCGGT ATTCCGCGCA GCCGCCGCAT GGTTTAGCCA GAATGTGGCG ACTGTCAACG CCCTAAATGT GTTTCCTGTG CCCGATGGCG ATACCGGAAC CAACATGAAC CTTACGCTGA CAGCCGCCCT CAAGGATGTT CAGAATGATG CCTCGGTCGC CGTGGTGGCG GAACGGGTCT ATCGCGGGGC TTTGATGGGC GCTCGTGGTA ACTCGGGCGT GATTCTTTCG CAAATTCTGC GCGGGCTTTC GCAAGGCATG GTCGGCCAGC AGGTTTGTAC GCCTGAAATT CTGGTAACTG CGCTCGAACA AGCCGCTACC ACTGCCTACA AAGCGGTGAT CAAGCCAGTC GAGGGCACGA TGCTTACCGT TATTCGCGAA ACCAGCGAAG CCGCTCGCGC CGGATTTCAG CCCGAAATGA ATTGGCATGA AGTACTGGAT TTAATTGTTA AAGGTGCGCG AGTTTCGGTC GATAATACGC CCAACCTCAT GAAAATGTTA CGTGATGCTG GCGTAGTTGA TGCTGGCGGC GAGGGCTTGT ATCTGCTCTT TGAAGGCGCA CGCGCTTTTG CCCGTGGCGA ACAACTCGAA CAACGAGTTG CACCCGTCGA TCAGTTGGCG ATGGCGTTTG ACGACATTCA TAGCGATGAT GATTTTGGCT ATTGCACCAA CTTTATGATC CAAGGCGAAA ATATCCCTTA CGAAGATGTG CGCAACACGA TTGCCGAAAT GGGCACATCG GTGGTGGCGG TCGGCGATGA GCGCTTGGTC AAGGTGCACT TGCACACGTT GCGGCCTGGC GATGCGCTCA ATTATGCCGT GCAATGGGGC AGCCTTGGGG CAATCGAAAT CACCAATATG GATAAACAGC GCAGCGATCT GCATGCGGCC CAAGCGCAAC AAGCCAGCCA ACCAGCCCGC GTCAAGCTCG ACGAGCCAGT CAGCGATGTT GGGGTGGTCG CAGTTGCACC AGGTCAAGGC TTCCGTGTGC TGTTCGAATC ATTAAATGTG GGCGAGGTGG TCACTGGCGG TCAAACCATG AATCCTTCGA TTCAAGATTT GGTCACGGCG ATCGATAAGT TGCCACAGCC AGAGGTGATC GTGTTGCCCA ATAATAGCAA CGTGATTTTG GCAGCGCAAC AAGCCCAACA AGTAACCAAT AAAGTTGTGC ATGTGATTCC AACCAAAACC GTGCCTCAAG GTATGGCGGC AATGTTTGCC TTTAATTATG CGGTTGGCGC AAGCGATAAT GTGCAGGCCA TGAGCCGCGC GATCAAAGAT ATTACCACGG CAGAAATTAC CACCGCCGTG CGCGATGCTA CGGTTAATGA TGTTGAAGTG CGCGACGGCC AAACGATTGG CTTGCTCAAT GGCGCACTGG TTGAATCTGG CGATCAGCCC GACGAAGTGA TTGATCGCAT TCTAGCACGG ATGGATTTAG ACGATCATGA GATCGTTACC ATCTATTATG GCGAACAATG TTCGGCGGAA CAGGCTGAAG CACTAGCCCA CAAAATCAAT GCGACCTACC CGGCGCTTGA TGTTGAGGTG CAAAACGGCG GACAACCATT TTATGATTAC ATTCTCTCTG CGGAGTGA
|
Protein sequence | MDSAPKNQID GTRLLAVFRA AAAWFSQNVA TVNALNVFPV PDGDTGTNMN LTLTAALKDV QNDASVAVVA ERVYRGALMG ARGNSGVILS QILRGLSQGM VGQQVCTPEI LVTALEQAAT TAYKAVIKPV EGTMLTVIRE TSEAARAGFQ PEMNWHEVLD LIVKGARVSV DNTPNLMKML RDAGVVDAGG EGLYLLFEGA RAFARGEQLE QRVAPVDQLA MAFDDIHSDD DFGYCTNFMI QGENIPYEDV RNTIAEMGTS VVAVGDERLV KVHLHTLRPG DALNYAVQWG SLGAIEITNM DKQRSDLHAA QAQQASQPAR VKLDEPVSDV GVVAVAPGQG FRVLFESLNV GEVVTGGQTM NPSIQDLVTA IDKLPQPEVI VLPNNSNVIL AAQQAQQVTN KVVHVIPTKT VPQGMAAMFA FNYAVGASDN VQAMSRAIKD ITTAEITTAV RDATVNDVEV RDGQTIGLLN GALVESGDQP DEVIDRILAR MDLDDHEIVT IYYGEQCSAE QAEALAHKIN ATYPALDVEV QNGGQPFYDY ILSAE
|
| |