Gene Haur_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3947 
Symbol 
ID5735808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4948836 
End bp4950644 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content51% 
IMG OID641281098 
Productserine/threonine protein kinase 
Protein accessionYP_001546709 
Protein GI159900462 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0694263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTTC TTCAACCAGG CGTTGTGCTG AACCAACAAT ATGAAATTAT TAAAAAGGTT 
GGTGGTGGTG GTGGCGGCAA TGTGTATGAG GCACTTGATC TTAATTTAGG CCGCTCAGTC
GCGATCAAAC ACTTGATTTT GCCAAGTGAT GATGCAGTCA AGCATTTTAA GAAGGAAGCA
CGTTTATTAG CTCGCCTCGA ACATCAATGT TTGCCACGGG TCTATAACCA ATTTGATGAA
GGCCCAAGCC AGTTTCTGGT GATGGAATAT ATTGATGGCA CTGACCTCGC CGAATACCTC
AATCAGCAAG CTGGCCCCTT AGCGCTTCAA CAGGTCTTGC CATGGGCTGA TCAATTGTTG
AAGCTGCTTG AGTATTTGCA TACGGCGCAC GAACTACCAA TTATTCACCG CGATATTAAG
CCAGCAAATA TTAAATTAAG CGCATCGGGC CAGTTGAAAT TGATCGATTT TGGCTTAGCC
AAAAGTGCTC TATCAACCCA AATTAATAAT GTCTACCAAA GTGTGCGTGG CTATACTTCG
ACCTACGCGC CCTTAGAGCA GCTTGTGCCT AACGATAATG AACACACCGA TGAGCGCAGC
GATATTTTTG CCTTTGGTGT GACACTCTAC CACTTGCTGA CTGGCAAGGT TCCAGTGAAT
GCTCAGCAGA AGCCGATTGA TGCGACCAAT CGTCATGCCA TGGTCATTCA AGCGAAACCC
GACCCTATCG TTCGACCAAA GCTGTTGAAT CCGACGATTC CTGACTATGT TGATGCTGCG
ATTATGAAGG CAATGGCGAT TAATAAGCTT GATCGGTTTG GCTCGGCAGC CGAATTTCGC
CAAGCCTTGA ATCCTTCTGT CGCAGTAAAA CTAAGCTCGG GGATCTCAAG TTTGCTTGGT
TCGCAACGCC GCCCAACCCA ACCTGTTGGC GGGCTATCGA GTCAACCAGC GCCGGCTAAA
ATCAGCCAAC CACTTCGTTC AATCCCTAGT ACGTCGATTG CTGATCAGGC GCAATCGACC
TTGCCAACGC CATTGGTACA AAACCCAAAC GCTCAAGCAA AGCCACTCAA ACGCGTTGCG
CCATGGTTAA TGTTGATCTT GGTTGGTTTG ATTGGTGGTG GGTGGTGGTG GGTTAAGGCT
AATCCAACCG GAGCTTCAGC GAATGGTGCG ACCCAAACCC AAGCAGCTAA TTTGGTTGTT
AATCAAGTTC CGGCTCTGCC ATCAACCACA CCAACCGATA CTGGGAGCAC TGTTGGGCAA
CCGACAATCA CGCTGGGCTT AGCCACGGCA ACCCTCGTTG AGCCAAGTGC GACGGCAACT
GTTGAGCCTG CAACTGCTGT GGTAGTTCAA CCAACTGCAC GGCCTGCAAC GGCGCGGCCT
GTGGTGCAAC CAACCGCGCG ACCTGCGCTA CAACCAACCA ATCCACCAGT TGTCCAGCCA
ACCAATCCAC CGCCACCTCC AAGTGACCGC GATGGCGATG GTTTTATCGA TGATGTTGAT
GCCTGCCCCG ATGTGGCTGG GCCAAATAAT GGCTGCCCTG TGGTCGAGCC AACCGCGCCA
CCAGCAGTTG CTGATACTGA TGGCGATACG ATTCCCGATG ATCGCGATGC TTGCCCACGT
GAGCCAGGCG ATCCTTCGCG CAATGGCTGC CCGAAGCCAG TAGAGCAGGC TACTAATACA
CCGCGGCCAA CCGAAACGCC GAATCTATCA ACCGATCGGG CAACGCCACG GCCAACCAAT
ACGCCAACAC CAAATATTAA TCCAAATAAT CCACCGTCAC CACCAACATC ACCACCAATG
ACTCCATAA
 
Protein sequence
MTVLQPGVVL NQQYEIIKKV GGGGGGNVYE ALDLNLGRSV AIKHLILPSD DAVKHFKKEA 
RLLARLEHQC LPRVYNQFDE GPSQFLVMEY IDGTDLAEYL NQQAGPLALQ QVLPWADQLL
KLLEYLHTAH ELPIIHRDIK PANIKLSASG QLKLIDFGLA KSALSTQINN VYQSVRGYTS
TYAPLEQLVP NDNEHTDERS DIFAFGVTLY HLLTGKVPVN AQQKPIDATN RHAMVIQAKP
DPIVRPKLLN PTIPDYVDAA IMKAMAINKL DRFGSAAEFR QALNPSVAVK LSSGISSLLG
SQRRPTQPVG GLSSQPAPAK ISQPLRSIPS TSIADQAQST LPTPLVQNPN AQAKPLKRVA
PWLMLILVGL IGGGWWWVKA NPTGASANGA TQTQAANLVV NQVPALPSTT PTDTGSTVGQ
PTITLGLATA TLVEPSATAT VEPATAVVVQ PTARPATARP VVQPTARPAL QPTNPPVVQP
TNPPPPPSDR DGDGFIDDVD ACPDVAGPNN GCPVVEPTAP PAVADTDGDT IPDDRDACPR
EPGDPSRNGC PKPVEQATNT PRPTETPNLS TDRATPRPTN TPTPNINPNN PPSPPTSPPM
TP