Gene Haur_3855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3855 
Symbol 
ID5735734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4839330 
End bp4840334 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content51% 
IMG OID641281006 
Productaminoglycoside phosphotransferase 
Protein accessionYP_001546617 
Protein GI159900370 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000520845 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAAG CCCTTGCCCC CGACCCTGCC ACGATTCGTC AACATCTTAC CCAAACCTAT 
GCGATTCAGC CCAACCGCAT CGAGCAAATC AATCGTGGCA ATGACCCACG CGCGGCGATT
TACCATGTGC AAACCAACGA ACAACCCTAC TTTCTCAAGC TCAAGGCTGG CTCAATCTAT
CAGGCAGGGG TGTTATTATC GCGCTATCTC AAGGATCGGG GAGTGGCGGC GGTTGCGCCA
GTCGATACCC GCACCCAGCA GCTTTGGAGC CATTGCCAAC AATTTCATAG CGTGCTCTAC
CCCTATATCG AAGGCGCAAC AGGCATGGAC CAAGGCATGT CGGCGCTGCA ATGGCGGAGT
TTTGGCCAAC AATTGCGCCG AATTCATACG ATGCAAGTGC GTGCGCCGCT ACGCCAGATG
CTGCAATGGG AGCAATTTCG CCCGCTCTGG TTGCCAACGG TTCAAGCAAT TCACAACAGC
ATCAATACTT GGCCAATTGG CGATAGCTAT AGCGCCGAGT TAATTGATTT TTGGCGAGTC
AAATCGGTCG AAATCAGTTA TTTGATCAAG CGAATTAGCG CATTAGGCCA TGAGCTAAGG
GCCAATGCTG GTGATTTTGG CCTGAGCCAT GGCGATATTC ACACCGCCAA CATTGTGCTC
GATCAGATTC AACAGATTAA TATCGTCGAT TGGGATTACC CGATGTTTGC TCCCAAAGAG
CGTGATTTGC GTTTTGTGGT TGGTTCTGTC ATCGGTGTGC CAGTGCAGCA GCATGAAGAA
CAATGGTTTT TTGAGGGCTA CGGCCAACCA ACGATTGACT ACAAAGCCTT GGCCTACTAT
CGTTATGAGC GGGTGATTCA AGACCTTGGC GATTATGCCC AGCGGGTATT GTTACGCCGT
GATGCCCCGC CAGCATTCAA ACAAGCCGCC TTACAATCGT TGCGTTCGCG CTTTTCGACT
GGCAATATCA TCGAATCGGC CTATCAAGCT GATCGCACAA GTTAA
 
Protein sequence
MQQALAPDPA TIRQHLTQTY AIQPNRIEQI NRGNDPRAAI YHVQTNEQPY FLKLKAGSIY 
QAGVLLSRYL KDRGVAAVAP VDTRTQQLWS HCQQFHSVLY PYIEGATGMD QGMSALQWRS
FGQQLRRIHT MQVRAPLRQM LQWEQFRPLW LPTVQAIHNS INTWPIGDSY SAELIDFWRV
KSVEISYLIK RISALGHELR ANAGDFGLSH GDIHTANIVL DQIQQINIVD WDYPMFAPKE
RDLRFVVGSV IGVPVQQHEE QWFFEGYGQP TIDYKALAYY RYERVIQDLG DYAQRVLLRR
DAPPAFKQAA LQSLRSRFST GNIIESAYQA DRTS