Gene Haur_4312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4312 
Symbol 
ID5736171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5505119 
End bp5506783 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content51% 
IMG OID641281472 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_001547072 
Protein GI159900825 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCATTC TCGATAAGCT CCTCTCGCGC TCTGATGATG CCGATAATTT AAAGGGCGTT 
GAGTTTAAAG TTGGCGATAT TATCGACCAA CGCTATTTGG TGCGCAATGT GCGCAAAGGA
TTTATGGGCT TAGTCTACAT CGCCCGCGAT TTGCGCTCGG AGCAAACCGT GGCGATGAAA
ACGTTTCAAG CCAAATTTAC ATGGGTCGAT AGCGCAATTG CTAATTTTAC CCGTGAGGCC
GAAGTTTGGA TGCGGCTTGG TTCGCACCCA AATATTGTCG AAGCAACCAG AATTGTGACC
ATCGCTGGCC GTCCGCATAT CGTCATGGAG TTTGTGCCAG GGGTTTCGCT GCGCGAGATG
ATGCGCCGTG GTCGCTTGCG TTTCAAGCAC ATTGTCGATT TTGCCATCCA AATTTGTTGG
GGCATGCAAT ATGCCTACGA TCGCTGTAAT TTGATTCACC GCGATCTCAA GCCCGATAAT
GTGATGGTTA CACCCGAAGG CATCGCCAAA GTCACCGATT TTGGCTTGGC GCAAGGTGCA
AGTGTTTCAA CCAAAGTTCG TTGGGGCCAC GATTCGCAGC ACAACGAATC GAAAATTGTG
ACCCACGCCA GCGATTTGTT TGGCGGTTCG CAGCCCTACA TGTCGCCAGA ACAACGGGCA
CCAGGCACGC CGTTGGGCAC AACTAGCGAT ATTTATGCCA TGGGCGTGAT GCTGTATGAG
ATGATGATCG GCGATTTGCC GTTCAAAGCC CCAACCGTTG CTGAATTAAC GCATCTGCAT
TGTAACGTAC CGCCGCCAAC GCCCTCAGAA GTACGGCCTG ATTTGCGGCG CGGCTGTGAT
CATGTGATTT TGCGCTGTTT GGCCAAAAAA GCCAATGATC GCTATCAATC GTTCGATGAG
CTAGAACACG ATTTGCAATG GCTGCGCAAA TATCACCTTG GCGAAGAATT GGCGCGGCCA
ACCGTGACGA TTAGCCTTGA AACCCAAGCC GCCGATTTGA ATGCCAAGGG GATTGTGCAT
ATGTCGCTGC ACGAATACAG TGCGGCGGTT AAATGTTTTC GCCAAGCCAC CGAGCTTGAA
AGCAGCCGCG CAACCTATTG GCTCAATTTG GGTATGAGCC AAGTGGCCTT GTTGGCCTAC
AACGATGCGC TCAAAAGCTA TGAACATGCC TTGACTTTAA ATCCAAGCCG CGATGAAGAA
GTTCGTTTGC ACTGGCTTGC TGGTGAATCG CACGAATATC TGTATCAATT GCGCGAAGCC
TTGGAAGATT ATGATGCAGC CTTAAACTTG GACAAAAAAG AGCGGCGGGC TTGGCTGGGC
CGTGGTCGGG TCTATAGCGC TTTGGCCTTG CCCAAAGAGG CCTTTCAAGC CTACGAATAC
GCTTCCAAGC TCGACCCGAA CGATCCGGTT ACCTGGCGCA GCATGGGCTA TGCCTCGCTC
GAATTAAACC AAGCCAAGCA GGCCTTATAC TATTTCGATC AAGCGTTGAA GGTCAACCCA
CGCGATGCAC AATCGTGGTG CGGCAAAGGC CAAGCCTTCC TTGATCTCAA GAAAAACAAC
GATGCCTTGC AAGCCTACGA AAAAGCCTAT AAACTCGACG CGAATTTAGC AGAAGCCGTG
ATGGGCTTGG CCAAAGTGCG TGGCACTGGG GCATTACCGA GGTAG
 
Protein sequence
MAILDKLLSR SDDADNLKGV EFKVGDIIDQ RYLVRNVRKG FMGLVYIARD LRSEQTVAMK 
TFQAKFTWVD SAIANFTREA EVWMRLGSHP NIVEATRIVT IAGRPHIVME FVPGVSLREM
MRRGRLRFKH IVDFAIQICW GMQYAYDRCN LIHRDLKPDN VMVTPEGIAK VTDFGLAQGA
SVSTKVRWGH DSQHNESKIV THASDLFGGS QPYMSPEQRA PGTPLGTTSD IYAMGVMLYE
MMIGDLPFKA PTVAELTHLH CNVPPPTPSE VRPDLRRGCD HVILRCLAKK ANDRYQSFDE
LEHDLQWLRK YHLGEELARP TVTISLETQA ADLNAKGIVH MSLHEYSAAV KCFRQATELE
SSRATYWLNL GMSQVALLAY NDALKSYEHA LTLNPSRDEE VRLHWLAGES HEYLYQLREA
LEDYDAALNL DKKERRAWLG RGRVYSALAL PKEAFQAYEY ASKLDPNDPV TWRSMGYASL
ELNQAKQALY YFDQALKVNP RDAQSWCGKG QAFLDLKKNN DALQAYEKAY KLDANLAEAV
MGLAKVRGTG ALPR