Gene Haur_3472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3472 
Symbol 
ID5735333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4374361 
End bp4377111 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content51% 
IMG OID641280619 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_001546236 
Protein GI159899989 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00045069 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGATC CCCAGTTGAT TGGGCGCATG CTTAATCATT TCAAAATTGT CGATAAACTT 
GGCCAAGGCG GCATGGCTAT GGTTTATCGT GCTTACCAAG AAAACCTGAA TCGTACTGTG
GCGCTCAAAT TGTTGCCACC AGAGATGACG TTTGATCAAA GCTATATTGC GCGGTTCCAG
CAAGAGGCGC GGGCCGCCGC AGGGCTTGAA CATAGCCATA TTGTGCCAAT CTACGAAGTT
GGCCAAGCTG AAGGCTTTTA CTACATCGTC ATGAAATATA TTGAGGGCAA TACGCTCAAG
GAAAATATCG AGCAAGAAGC CCCAATGTCG GTTCATCGAG TGCTGGAGTT GCTTGAGCCA
GTTGGCAAGG CGCTCGATTA TGCCCATCGC AAGGGTGTCA TTCACCGTGA TATCAAGCCT
TCGAATGTGA TGCTCACGCC TGAAGGCTGG GTTTATTTGA CCGACTTTGG CTTGGCTCGC
GGTGGGAGCA GCGATTCGGG CTTGACCCAA GTTGGCACGG TGATGGGCAC ACCGGAATAT
ATGTCGCCTG AGCAGGCTCA GGGCTTGACA GTTGGCGCTG CTAGCGATCT CTATGCCTTG
GCGGTTATGG CTTACGAAAT GCTGACCAAA CAGATGCCAT TTGTCGCCAA TAATGCCCAA
GCTGTGCTTT TAGCGCGGGT TATTCGTGCA CCACGCGCTC CCAGCGATCT GATTCCGACC
ATGCCATCGG CGGTCGAAGA TGTCTTGATG AAGGCGTTGG CTCGCACGCC TGAGGCACGT
TATCCAACAG CGGCGGCTTT TTTCGAGGCC TTGCGGCAAG CGAGTAATGG TGCACGGCCA
AATGTGGCTG CGGCTACGCC ATTTGCCCAA AATCAGCCAG CCCAATATGC GCCAACATCG
CCGAGTAGCC CACAGGTCTA TCCGCCAACG CCGCTGAGCA ATCAACAGGC GGTTGCGCCG
CATTACCCAC CAACGCCGCT GAGCAATCAA CAGGCGGTTG CGCCGCATTA CCCGCCAACG
CCAGTGAGCA ATCAGCAGGT AATGCCGAAT TATCCACCGA CCAATCCCAG CAATCAACAA
GTGGTGATCC ATAGCCAATC GCCCTATGAT GGCTATGTCG CGGCCAATAC CCAAGCCACT
CGGCCAGCAA TCATGCCAAA TGCGGCCCAG CCAGCACAAT ATAACCAACA GCCAATTAGC
CAACCCAGTC CAGTTGCCTA TACAGGTGCT ACCTCAGTTC TGCGCAACAA GCAAAAATTG
ACGATTTGGG TTGGCCTAGG GGTTTTGCTG TTGGTGGCAG TTGTGGTTGG GGTTATTTTG
GCTTCTGGCA GTGATGCCGA AGATATTATT GCTCAAGGCG ATGCTGCCTT TGAACGCCGT
GGCGGCTTGA TCGAAGCAAT TAATCTCTAC AAAGAAGCGA CCGCTGCTGA TGATGAGAGC
TTCGAGGCCC ACGAAAAACT AGCCATTACC TATCTGATGC GTGGCCAAAC GCCTGATGCC
GATCAAGCAA TTCGTCAAGC AATTGCGATT GATGCCAACC AAGCCAGTGC CCATGCTTGG
CTCAGCCAAG TGCATTCCGA TAATCGTCAA TTTAATGAAT CTTTGGCTGA AGCCGAAGAA
GCCGTCCGTT TGGATGCGAA TCATCCTTTG GCCTATATGG CGCGGGCTAC TGCACGAGCT
GATGTCGGCA ACGAGCAAGG TGATAGCGAG TTGCTGGCCG ATGCCTTGGC CGATACCAAT
AAAGCAATCG AGCTGGCAAC CAATCGTTCA CGCTTTGAGC AAGCTATGGC CTACAGCGCC
AAAGGCTACG TTCAATGGGT CACCTATCAA GATCAAACCA GCCGCGACGC TGGCGCTGGC
AAAGAATTTG TCGTCGATGG GATTGATAAT TTCAATCGGG CGATTGGTTT GCAAGAGCAA
TTGCCGTTGC TGCGTAATAA TATTGGCTAT TTTTACGCCG AACAAGCGCG GGTTGCCCTG
CACCTCGGCG AAGATGAAAC CGCAGCTCAG CGCTTTGAAA AAGCCTATCA ATCATTCGAC
GATGCCCTAG CGCTTGACCC AAATTATGGT TTGGCCTTTG CAGGCAAGGG TTGGACGCAA
ATTTACGAGC GTAAATATGA AGAAGCCCAG CAGTTTTTCG ATCTAGCGCT TGAGCGTAAC
CAGCGTGACC CCAACGCCTT GAATGGACGA GCATTGACCA ACTGGTGGCT GGGTCGCAAC
AACTCCAGCG ATCCCCAAAG CGATTATGCC GCAGCGATTC GCGATTATGA AGCGGCGATT
GCCGAAGCTC CATCGTGGCT ATCGGTCTAT GTCGATTTGG GTTATGTCTA TCTCTACGAC
ACCAAAGATA CTGATAAAGC CATCGAAACC TTTAAAAAGG CTTTGGAACG TGACCCGGAA
TACCCAAATG CAATTGCTGG CTTAGCCGAT ACCTACTACG ACACGCGCTA CTATGATGAA
GCGTTGAAGC TCTACGAACA AACGATTAAT CTCCAGCCTG ATTATGCGAC GGCCTACCTC
GGCAAAGCCA ATATCTTGTA CAATAACAAA GATTATGATG CAGCGATCGA TCAATATAGC
ACGGCGCTTG ATTATAATCC CTCGTTGAAA AATGCTTATA TTGGCAAAGC CTATTGCTAT
CAAGCCAAAG GCGATATCGA CGAGGCTCGC CAAGTTTTGC AAGATGGATT AGAATCAGTG
GCCTATGTTG ATCAATCCGA ATTGCAAACT ATTTTGGATA AGATGAAGTA A
 
Protein sequence
MQDPQLIGRM LNHFKIVDKL GQGGMAMVYR AYQENLNRTV ALKLLPPEMT FDQSYIARFQ 
QEARAAAGLE HSHIVPIYEV GQAEGFYYIV MKYIEGNTLK ENIEQEAPMS VHRVLELLEP
VGKALDYAHR KGVIHRDIKP SNVMLTPEGW VYLTDFGLAR GGSSDSGLTQ VGTVMGTPEY
MSPEQAQGLT VGAASDLYAL AVMAYEMLTK QMPFVANNAQ AVLLARVIRA PRAPSDLIPT
MPSAVEDVLM KALARTPEAR YPTAAAFFEA LRQASNGARP NVAAATPFAQ NQPAQYAPTS
PSSPQVYPPT PLSNQQAVAP HYPPTPLSNQ QAVAPHYPPT PVSNQQVMPN YPPTNPSNQQ
VVIHSQSPYD GYVAANTQAT RPAIMPNAAQ PAQYNQQPIS QPSPVAYTGA TSVLRNKQKL
TIWVGLGVLL LVAVVVGVIL ASGSDAEDII AQGDAAFERR GGLIEAINLY KEATAADDES
FEAHEKLAIT YLMRGQTPDA DQAIRQAIAI DANQASAHAW LSQVHSDNRQ FNESLAEAEE
AVRLDANHPL AYMARATARA DVGNEQGDSE LLADALADTN KAIELATNRS RFEQAMAYSA
KGYVQWVTYQ DQTSRDAGAG KEFVVDGIDN FNRAIGLQEQ LPLLRNNIGY FYAEQARVAL
HLGEDETAAQ RFEKAYQSFD DALALDPNYG LAFAGKGWTQ IYERKYEEAQ QFFDLALERN
QRDPNALNGR ALTNWWLGRN NSSDPQSDYA AAIRDYEAAI AEAPSWLSVY VDLGYVYLYD
TKDTDKAIET FKKALERDPE YPNAIAGLAD TYYDTRYYDE ALKLYEQTIN LQPDYATAYL
GKANILYNNK DYDAAIDQYS TALDYNPSLK NAYIGKAYCY QAKGDIDEAR QVLQDGLESV
AYVDQSELQT ILDKMK