Gene Haur_3946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3946 
Symbol 
ID5735807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4946861 
End bp4948624 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content53% 
IMG OID641281097 
Productserine/threonine protein kinase 
Protein accessionYP_001546708 
Protein GI159900461 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0106209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCCAA ATGGCATGAT CTTGAATGAA ACCTATGAAA TTCGTGAGCG GATTGGCCAT 
GGCGGCGGGG GCGCGGTCTA TGCTGCTTAC GATATTAAGT TGCGCAAAAT GGTGGCGCTC
AAACATTTGC ACCTTGAGGG TGACCATGCG ATCAAAGCCT TTGGCCATGA AGCTAGCCTG
TTGGCGAACT TAGAACACCC AAGCCTGCCC AAAGTCTATT TTAACTTCAA TAATGATGCT
GGCCAATTTT TGGTGATGGA TTTGATCGAT GGTGACGATT TGAGCACGAT GCTTGATCGT
CAAGGTGGGC CGTTGCCGGT TGAGCAAGTG TTGGCTTGGG CCGATCAATT ATTGAGCGTT
TTGACCTATC TGCATACATT TTATCGCCAG CCGATTATTC ACCGCGATAT TAAACCTGCC
AATATCAAAA TTACTAGCCG TGGCGAATTG AAATTGTTGG ATTTTGGCTT GGCAAAGGCC
GAGATTTTTA CCAATATGCG CAGTGCAATG CATAGTGTTC ATGGCTATAC CTTGACCTAT
GCACCACCTG AGCAGATTAA TCAAGAAGGC ACTGATCCAC GGAGTGATTT GTATTCGTTG
GGGGCGACGC TCTATCATTT ATTAACTGGG CGCACTCCAG CTGATGGTGA TGGCAAAACC
GCCGATGCTT TAGCCCGAAC CTTGGCGATG GCTAAACGCA AGCCCGACCC AATTATTGCC
CCCAAAAACT TCAATCCATC AATCCCTGAG CATGTCGATC AAGCGATTAT GCGTTCGTTG
GAGATTGATG CTGATGAGCG TTTTGCGTCG GCTGAGGAAT TTCGACTCGC CTTGGCCCAA
CCCTATGCGC CAGTCAATAA TGCCCAAACC AAAATTTTGC CATCGACAAT TAGCTCACGG
GCGATTACTG GGCCGCCCAA ACAACTAACC ACCCGCGAGC ATAGCCAACC AAGCCCAATC
ATCAGCAAAC CGCGAAGCAG CTATCCTGGG CAACCGTTGC CAGAAGCTGG GCCGCAGGCG
CAGCCAACCC CAAACTATAA ACGCTATTGG CCATTATTGC TGATTGTGTT GCTCGGCGGG
GCTGGTGGTG GTTGGTGGCT TAACCGCGAT CAGCAAACGC CAGTGGTCAC GCTTGAGCAA
AACACGAAAA CAGTGGTAGT AGCTCAGCCA AGCCAGTCCG CACCAACGAA TACCTTGATG
GGCGTGGCGG GACAACCAAC GATTACCTTG GGTGTTGCAA CCAATGAGCC AACGAGTGTC
GCCGTCGAGC AGGCTACAGC AACTAGCGAG CCAGCCCAAC CAACGACAGC GACCCAGCCA
ACTGCTCAGC CCGCTGCAAC CCCTCGCCCT GCAACCGCAA GACCAGCCAC TGCACGGCCT
GCGGCAACCA ACCCGCCTGT GGTGCAGCCG ACTAATCCAC CAGCAGTTCA GCCAACCAAT
CCACCTGCTC CTCCTATCGA TCGTGATGGT GATGGTGTGA CTGATGATGT TGATGGTTGT
CCTGATGTGG CTGGCCCAAA TAATGGTTGC CCTGCCCCAG TTGAGCCGAC CAATCCACCA
GCAGTTGCTG ATAGCGATGG CGATACGATT CCCGATGATC GCGATGCTTG CCCACGCGAG
CCTGGCGACC CATCGCGTAA TGGTTGCCCC AAGCCAGCTA ATACGCCAAC TGATCGACCA
ACTGATCGAC CAACTGATAG GCCAGTGCAA CCAACCGAGC GTCCAACCGA TAAACCAACC
GCCAAGCCGC CAGTAGTGCC TTAA
 
Protein sequence
MFPNGMILNE TYEIRERIGH GGGGAVYAAY DIKLRKMVAL KHLHLEGDHA IKAFGHEASL 
LANLEHPSLP KVYFNFNNDA GQFLVMDLID GDDLSTMLDR QGGPLPVEQV LAWADQLLSV
LTYLHTFYRQ PIIHRDIKPA NIKITSRGEL KLLDFGLAKA EIFTNMRSAM HSVHGYTLTY
APPEQINQEG TDPRSDLYSL GATLYHLLTG RTPADGDGKT ADALARTLAM AKRKPDPIIA
PKNFNPSIPE HVDQAIMRSL EIDADERFAS AEEFRLALAQ PYAPVNNAQT KILPSTISSR
AITGPPKQLT TREHSQPSPI ISKPRSSYPG QPLPEAGPQA QPTPNYKRYW PLLLIVLLGG
AGGGWWLNRD QQTPVVTLEQ NTKTVVVAQP SQSAPTNTLM GVAGQPTITL GVATNEPTSV
AVEQATATSE PAQPTTATQP TAQPAATPRP ATARPATARP AATNPPVVQP TNPPAVQPTN
PPAPPIDRDG DGVTDDVDGC PDVAGPNNGC PAPVEPTNPP AVADSDGDTI PDDRDACPRE
PGDPSRNGCP KPANTPTDRP TDRPTDRPVQ PTERPTDKPT AKPPVVP