Gene Haur_5041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5041 
Symbol 
ID5737000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp54668 
End bp55738 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content61% 
IMG OID641282208 
Productaminoglycoside phosphotransferase 
Protein accessionYP_001547799 
Protein GI159901553 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACT TTCATACCCT GCAGGCACGC GGGCAGCTCA ATCGCCTCCG TCAGCTCGCG 
CGTGCGGCAC TGGCTGACTA CGCCATCGTG AATCCCACAT TTCTGCCGCT CCGCCATGAG
ACGAATACGA CCTTCCGGGT TCAGACACCC GATGGAAGCA CATATGTCCT CCGCATTCAT
CGGCCCCAAG GACATACGTT TGAGCAGATT CGCTCGGAGC TGCAATGGCT CAGCGCCCTT
CGGCACGATC TGAAGGCAGC GGTTCCGGAA CCGATTCCGA CTCGCGATGG CGCGCTCCTT
ACCATCGCGT CGGCTCAGGG CGTTCCGGAG CCACGGATCT GCGTGCTCTT TCGCTGGCTC
CCAGGGCGCT TCTTCAACGA CACCATCACG CCGGGGCGAA TGGCGCATAT AGGGCGGCTG
ACCGCGCTGT TCCACACCCA TACTAGCCAC TGGCAGGCCC CAAGCGATTT TCGCCGTGGC
CGCGCAGATG CCCTCACGGA GGAAGGGCGT CAGCGCGATT GGCGTGCGCC TGCGGCGGAC
CAGCCCGCCA CGGACGTTCA CCCTGGTGGA TACGATGCGG CCCAGGCCAT TGCCGTGGTG
ACCACACTAT GTTCGTCCAG TGATGCGGCG ATTGTGACAG CGGCGCTTGA GCGTATCCGC
GCCGTGTTCC ACGAGCTTGG CGAGAGCAGA GAGGTCTTCG GGCTCATTCA TGGCGACCTT
CACCAGGAGA ATTATTTCTT CCATGGAGGT TCGGCGGGGG CAATCGATTT TGACGACTGC
GGATGGGGCC ATTTTCTCTT CGATCTTAGT ATTACCCTGC GCGAGATCCA GGACCTCCCA
TCCTATCCGG CACTTCGAGC GGCCCTCTTG CGCGGGTATC GCGCCGTCCG CCCGCTCCCC
AGCGACCACG AACGCCATCT TGAGGCGTTC TTCGCCCTCC GGCATATCCA GATCTTAATG
TGGATCCTCG AATCACATGA CCATCCGGCC TTCCGCGACG ACTGGGTGGC ACAAGCACAC
TATGAGATAG AGCAACTCCG CCAGTTCGTC ATCAGGGGGC CGATCAGCTG A
 
Protein sequence
MKNFHTLQAR GQLNRLRQLA RAALADYAIV NPTFLPLRHE TNTTFRVQTP DGSTYVLRIH 
RPQGHTFEQI RSELQWLSAL RHDLKAAVPE PIPTRDGALL TIASAQGVPE PRICVLFRWL
PGRFFNDTIT PGRMAHIGRL TALFHTHTSH WQAPSDFRRG RADALTEEGR QRDWRAPAAD
QPATDVHPGG YDAAQAIAVV TTLCSSSDAA IVTAALERIR AVFHELGESR EVFGLIHGDL
HQENYFFHGG SAGAIDFDDC GWGHFLFDLS ITLREIQDLP SYPALRAALL RGYRAVRPLP
SDHERHLEAF FALRHIQILM WILESHDHPA FRDDWVAQAH YEIEQLRQFV IRGPIS