Gene Haur_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3039 
Symbol 
ID5734911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3837286 
End bp3839493 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content53% 
IMG OID641280183 
Productserine/threonine protein kinase 
Protein accessionYP_001545805 
Protein GI159899558 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000574662 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTCGT TTGAACATAC ACAGTTGGGC GAGTATAGCT TGCAGGATGA AATAGGCCGC 
GGCGGGATGG CACTCGTCTA TCGGGCGCAT CACCCCGATT ATGGCTCGGT AGCATTTAAA
GTGCTACCCC CCTATTTTGC CCACGATACC GACACGCTCC GACGATTCAT GCTCGAGGCA
CGGGCAATTC GTGAACTGCA CCACCCACAT ATTGTGCAGC TCTACGAAGC CAGTGATATT
CCTGACCCAA ATAATAGTCG TCAACGCATC CATTACATTG CAATGGAGTT TATTGGCGGT
GGCACACTCA GCGAACGCTT ACATACGCAA CCACGCCAAC AACTCAATCC AACGATTGAG
ATGGGCTTGC ATATTGGTTC GGCCCTCGAT TACGCTCACC GCAAGCAGTT TATTCACCGC
GATATTAAGC CAAGCAATAT TCTGTATCGC CACGATGGTC ATGCAGTTTT GGCCGACTTT
GGGATTGCCC GGGTCAGCAA TGAAGCCCGT ATGACCAAAA CTGGTGGTTT TGCTGGGACG
GTAGCCTACA CCGCGCCCGA AATTGCTGAA GGCCAAATTG CCGATGCCCG CTCCGATATT
TATGCCTTAG GCTTGATTCT CTATGAAGCC TTGGCAGGCA AAAACCCATA TGCCAACATG
CATGCCAATA TTGCTGTTGC ATTAAGCAAA ATTATTAGCA CACCTCTGCC GCCATTACGC
GAATTGGCTC CGCATGTGCC ACCACTGACC GCCCAAATTA TCGAGCGGGC AACCGCCAAA
GACCCGGAAC GACGCTTTGA GAATATGTCT GATTTTGTTG AAGCCCTCAA ACAGGCAAAG
TTTGGGCGAT CAGCAACCAA ACCTGATGTG CAGTTGAATG CCCATGGCAA ACCCATGATT
CCAATTGCCC GACCAAGTGG CAGTCGGACT GAACGTAGTT CTGAAGGCGA TTCAACCCGC
ACCCAGGTGT TTAACCCAGT GGTTGGGGCT GCCAGCGCCG CTAATGTTGC AATCTCAAAG
CCAAACCAAG GTTTGGCCGG CCCAGCAAGT GGCGCAAACA TGGCCTTGCC TGTTGAAGGA
AGCCAAATTT ATACGCCACC AGCAGGCAGT GGCCCTAACA GCCGCATCAA TCAGCCCTTG
AGTGGCGTAA ATCAACCCTT ACCAATGGGA GCCAACTCAC AGGCCAATGT GGCGTTACCC
AACGACGGGA CCCAAATCTA TACCCCTGAG CCAGTTGGCG CACTCAGTGG AACCAATATG
GCACTCACTG GCGAGAGCAC GCAAATGTAT GCACCAGTGT CAGGAGTAAA CCAACCCTTA
TCAGGGGTGA ATCAACCACT CACTGGCCAA AATCCAGTGC GTGGGCCAAG TTCACGGCCT
AATCGCACCG TCAATGCACG GCCCATTGAG GTTGCCCGGC CTGGCACGGC GGTCAATCAA
AATCTTGGCC CTAGCTCACA ACCAAATCTC GGTTTTGACG GCGGCTCAGG CACATTTGCG
GTTAATCGGC CAAATAACAA TAAAAAGAAG GCCTTGATCG CAGGCATCGG CGGCGGTATT
CTCGGCCTGA TATTAATTGG CGTGTATATG CTATCAAGCA GCAACACTGT CAATCCCAAC
GATACCAATG GTACGACAAA CGCCGTGCAA TTTACCACCG ATGGCAGTGC TACAGCCACC
AGTGCAAGCA CCAGCAATGG CAATGGCACA CCTGTTGGTA CTCAGCCAGC GGTTGTGGTT
GATCCAGGTG TTGCAACCAG TACCTTGGCT CCAACACCTG AAAATAGCCC AACGCCTGAA
CCAACCGATA CACCTGTAGC CACGGCAACC TCCAATGCAG TTGTTGTAGC AACGACGCGT
CCGCAAGCAA CGAATCGCCC ACGCCCAACT AATACGCCAA GCCAGCAACA AGCATCTCCT
GTCCCACCAA CCAATACACC ACCACCAGCC GATAGTGATG GCGATGGCGT GCCCGATGAA
GTTGATGGCT GTCCTGGTGT GGCTGGGCCA AATAATGGTT GTCCAATACC AACCGAAGTT
CCACAGCCAA CACCAATCCC TGACTCGGAC GGTGATACAA TTCCCGATAA CGTCGATAAT
TGTCCCAATG AGCCTGGTGA TCCAGCTCGG GGCGGGTGTC CAAAACCACC AGCAACCAAT
ACACCACGGC CAACCGAAAA ACCAACACCA CCGCCGATTA ATCCATAA
 
Protein sequence
MESFEHTQLG EYSLQDEIGR GGMALVYRAH HPDYGSVAFK VLPPYFAHDT DTLRRFMLEA 
RAIRELHHPH IVQLYEASDI PDPNNSRQRI HYIAMEFIGG GTLSERLHTQ PRQQLNPTIE
MGLHIGSALD YAHRKQFIHR DIKPSNILYR HDGHAVLADF GIARVSNEAR MTKTGGFAGT
VAYTAPEIAE GQIADARSDI YALGLILYEA LAGKNPYANM HANIAVALSK IISTPLPPLR
ELAPHVPPLT AQIIERATAK DPERRFENMS DFVEALKQAK FGRSATKPDV QLNAHGKPMI
PIARPSGSRT ERSSEGDSTR TQVFNPVVGA ASAANVAISK PNQGLAGPAS GANMALPVEG
SQIYTPPAGS GPNSRINQPL SGVNQPLPMG ANSQANVALP NDGTQIYTPE PVGALSGTNM
ALTGESTQMY APVSGVNQPL SGVNQPLTGQ NPVRGPSSRP NRTVNARPIE VARPGTAVNQ
NLGPSSQPNL GFDGGSGTFA VNRPNNNKKK ALIAGIGGGI LGLILIGVYM LSSSNTVNPN
DTNGTTNAVQ FTTDGSATAT SASTSNGNGT PVGTQPAVVV DPGVATSTLA PTPENSPTPE
PTDTPVATAT SNAVVVATTR PQATNRPRPT NTPSQQQASP VPPTNTPPPA DSDGDGVPDE
VDGCPGVAGP NNGCPIPTEV PQPTPIPDSD GDTIPDNVDN CPNEPGDPAR GGCPKPPATN
TPRPTEKPTP PPINP