Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5041 |
Symbol | |
ID | 5737000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 54668 |
End bp | 55738 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641282208 |
Product | aminoglycoside phosphotransferase |
Protein accession | YP_001547799 |
Protein GI | 159901553 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACT TTCATACCCT GCAGGCACGC GGGCAGCTCA ATCGCCTCCG TCAGCTCGCG CGTGCGGCAC TGGCTGACTA CGCCATCGTG AATCCCACAT TTCTGCCGCT CCGCCATGAG ACGAATACGA CCTTCCGGGT TCAGACACCC GATGGAAGCA CATATGTCCT CCGCATTCAT CGGCCCCAAG GACATACGTT TGAGCAGATT CGCTCGGAGC TGCAATGGCT CAGCGCCCTT CGGCACGATC TGAAGGCAGC GGTTCCGGAA CCGATTCCGA CTCGCGATGG CGCGCTCCTT ACCATCGCGT CGGCTCAGGG CGTTCCGGAG CCACGGATCT GCGTGCTCTT TCGCTGGCTC CCAGGGCGCT TCTTCAACGA CACCATCACG CCGGGGCGAA TGGCGCATAT AGGGCGGCTG ACCGCGCTGT TCCACACCCA TACTAGCCAC TGGCAGGCCC CAAGCGATTT TCGCCGTGGC CGCGCAGATG CCCTCACGGA GGAAGGGCGT CAGCGCGATT GGCGTGCGCC TGCGGCGGAC CAGCCCGCCA CGGACGTTCA CCCTGGTGGA TACGATGCGG CCCAGGCCAT TGCCGTGGTG ACCACACTAT GTTCGTCCAG TGATGCGGCG ATTGTGACAG CGGCGCTTGA GCGTATCCGC GCCGTGTTCC ACGAGCTTGG CGAGAGCAGA GAGGTCTTCG GGCTCATTCA TGGCGACCTT CACCAGGAGA ATTATTTCTT CCATGGAGGT TCGGCGGGGG CAATCGATTT TGACGACTGC GGATGGGGCC ATTTTCTCTT CGATCTTAGT ATTACCCTGC GCGAGATCCA GGACCTCCCA TCCTATCCGG CACTTCGAGC GGCCCTCTTG CGCGGGTATC GCGCCGTCCG CCCGCTCCCC AGCGACCACG AACGCCATCT TGAGGCGTTC TTCGCCCTCC GGCATATCCA GATCTTAATG TGGATCCTCG AATCACATGA CCATCCGGCC TTCCGCGACG ACTGGGTGGC ACAAGCACAC TATGAGATAG AGCAACTCCG CCAGTTCGTC ATCAGGGGGC CGATCAGCTG A
|
Protein sequence | MKNFHTLQAR GQLNRLRQLA RAALADYAIV NPTFLPLRHE TNTTFRVQTP DGSTYVLRIH RPQGHTFEQI RSELQWLSAL RHDLKAAVPE PIPTRDGALL TIASAQGVPE PRICVLFRWL PGRFFNDTIT PGRMAHIGRL TALFHTHTSH WQAPSDFRRG RADALTEEGR QRDWRAPAAD QPATDVHPGG YDAAQAIAVV TTLCSSSDAA IVTAALERIR AVFHELGESR EVFGLIHGDL HQENYFFHGG SAGAIDFDDC GWGHFLFDLS ITLREIQDLP SYPALRAALL RGYRAVRPLP SDHERHLEAF FALRHIQILM WILESHDHPA FRDDWVAQAH YEIEQLRQFV IRGPIS
|
| |