Gene Haur_0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0365 
Symbol 
ID5732216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp436392 
End bp438029 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content53% 
IMG OID641277488 
ProductDak phosphatase 
Protein accessionYP_001543144 
Protein GI159896897 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0753407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACAGCG CACCGAAGAA TCAAATTGAT GGCACGCGGC TGTTGGCGGT ATTCCGCGCA 
GCCGCCGCAT GGTTTAGCCA GAATGTGGCG ACTGTCAACG CCCTAAATGT GTTTCCTGTG
CCCGATGGCG ATACCGGAAC CAACATGAAC CTTACGCTGA CAGCCGCCCT CAAGGATGTT
CAGAATGATG CCTCGGTCGC CGTGGTGGCG GAACGGGTCT ATCGCGGGGC TTTGATGGGC
GCTCGTGGTA ACTCGGGCGT GATTCTTTCG CAAATTCTGC GCGGGCTTTC GCAAGGCATG
GTCGGCCAGC AGGTTTGTAC GCCTGAAATT CTGGTAACTG CGCTCGAACA AGCCGCTACC
ACTGCCTACA AAGCGGTGAT CAAGCCAGTC GAGGGCACGA TGCTTACCGT TATTCGCGAA
ACCAGCGAAG CCGCTCGCGC CGGATTTCAG CCCGAAATGA ATTGGCATGA AGTACTGGAT
TTAATTGTTA AAGGTGCGCG AGTTTCGGTC GATAATACGC CCAACCTCAT GAAAATGTTA
CGTGATGCTG GCGTAGTTGA TGCTGGCGGC GAGGGCTTGT ATCTGCTCTT TGAAGGCGCA
CGCGCTTTTG CCCGTGGCGA ACAACTCGAA CAACGAGTTG CACCCGTCGA TCAGTTGGCG
ATGGCGTTTG ACGACATTCA TAGCGATGAT GATTTTGGCT ATTGCACCAA CTTTATGATC
CAAGGCGAAA ATATCCCTTA CGAAGATGTG CGCAACACGA TTGCCGAAAT GGGCACATCG
GTGGTGGCGG TCGGCGATGA GCGCTTGGTC AAGGTGCACT TGCACACGTT GCGGCCTGGC
GATGCGCTCA ATTATGCCGT GCAATGGGGC AGCCTTGGGG CAATCGAAAT CACCAATATG
GATAAACAGC GCAGCGATCT GCATGCGGCC CAAGCGCAAC AAGCCAGCCA ACCAGCCCGC
GTCAAGCTCG ACGAGCCAGT CAGCGATGTT GGGGTGGTCG CAGTTGCACC AGGTCAAGGC
TTCCGTGTGC TGTTCGAATC ATTAAATGTG GGCGAGGTGG TCACTGGCGG TCAAACCATG
AATCCTTCGA TTCAAGATTT GGTCACGGCG ATCGATAAGT TGCCACAGCC AGAGGTGATC
GTGTTGCCCA ATAATAGCAA CGTGATTTTG GCAGCGCAAC AAGCCCAACA AGTAACCAAT
AAAGTTGTGC ATGTGATTCC AACCAAAACC GTGCCTCAAG GTATGGCGGC AATGTTTGCC
TTTAATTATG CGGTTGGCGC AAGCGATAAT GTGCAGGCCA TGAGCCGCGC GATCAAAGAT
ATTACCACGG CAGAAATTAC CACCGCCGTG CGCGATGCTA CGGTTAATGA TGTTGAAGTG
CGCGACGGCC AAACGATTGG CTTGCTCAAT GGCGCACTGG TTGAATCTGG CGATCAGCCC
GACGAAGTGA TTGATCGCAT TCTAGCACGG ATGGATTTAG ACGATCATGA GATCGTTACC
ATCTATTATG GCGAACAATG TTCGGCGGAA CAGGCTGAAG CACTAGCCCA CAAAATCAAT
GCGACCTACC CGGCGCTTGA TGTTGAGGTG CAAAACGGCG GACAACCATT TTATGATTAC
ATTCTCTCTG CGGAGTGA
 
Protein sequence
MDSAPKNQID GTRLLAVFRA AAAWFSQNVA TVNALNVFPV PDGDTGTNMN LTLTAALKDV 
QNDASVAVVA ERVYRGALMG ARGNSGVILS QILRGLSQGM VGQQVCTPEI LVTALEQAAT
TAYKAVIKPV EGTMLTVIRE TSEAARAGFQ PEMNWHEVLD LIVKGARVSV DNTPNLMKML
RDAGVVDAGG EGLYLLFEGA RAFARGEQLE QRVAPVDQLA MAFDDIHSDD DFGYCTNFMI
QGENIPYEDV RNTIAEMGTS VVAVGDERLV KVHLHTLRPG DALNYAVQWG SLGAIEITNM
DKQRSDLHAA QAQQASQPAR VKLDEPVSDV GVVAVAPGQG FRVLFESLNV GEVVTGGQTM
NPSIQDLVTA IDKLPQPEVI VLPNNSNVIL AAQQAQQVTN KVVHVIPTKT VPQGMAAMFA
FNYAVGASDN VQAMSRAIKD ITTAEITTAV RDATVNDVEV RDGQTIGLLN GALVESGDQP
DEVIDRILAR MDLDDHEIVT IYYGEQCSAE QAEALAHKIN ATYPALDVEV QNGGQPFYDY
ILSAE