Gene Haur_3486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3486 
Symbol 
ID5735347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4390729 
End bp4393698 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content54% 
IMG OID641280633 
Productserine/threonine protein kinase 
Protein accessionYP_001546250 
Protein GI159900003 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00873447 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAT TTGAACATAC CCAACTGGGT GAGTATACAT TGCAAGAAGA GATTGGCCGT 
GGCGGGATGG CGCAAGTCTA TCGAGCGCAG CATCAAGCCT ATGGTCTGGT TGCCTTCAAA
GTATTACCAC CCTATTTTGC CCACGATAAC GATACACTTC AGCGTTTTAT GCGTGAGGCG
CGGGCCAATC GTACCTTGCA CCACCCGCAC ATTGTGCAAT TGTATGAAGC CAGCGACATT
GCCCAAGCCC AAAACCCCTA CCAGCCGATC CACTATATCG CAATGGAATA TATCGCTGGG
GGCACGCTGA CCGATCGCTT GCGTCAACAG CCGCAACAGC CATTAAACTC GACGCTTGAA
ATGGGTGAGC AAATTGGCTC AGCCTTGGAT TATGCCCATG GCAAGGGTTT TATTCACCGC
GATATCAAGC CGAGCAATAT TTTGTTTCGC AGCAACGGCC ATGCTGTGCT CGCCGACTTT
GGGATTGCCC TCGCCAACAA CGAGGCCCGC ATGACCAAGG CTGGCGGTTT TGCTGGTACG
GTCGCTTACA CTGCGCCCGA AATTTTCGAG GGCGAAACCG CCGATGTGCG CTCGGATATT
TATGCTTTGG GCTTGATTTT GTATGAATCG TTGGCTGGGC ATAACCCCTA TGCCAACATC
AGCACTAATG CCCAAATTGC CATGAGCAAA ATTTTAACGA CACCGCTGCC ACCATTGCAA
GATGTTGCGC CGCATGTGCC TCCGTTGACG GCTGAAATTC TGGCTCAAGC AACCGCCAAA
GATCCAATTC GGCGCTTTGC TACAATGTCG GATTTTGTTG AGGCCTTGAA ACAAGCTAAA
TTCAATCGGG TCAGCGATCG ACCTGCGCAA CAATTGAATG CCAGTGGTCG CCCGATGATT
CCAATTGCTG GTCGGCCTGG AGCGCCAAAT CGCCCACCAC CACGCAATCA GCCAAGTGGC
GATAACCCAA CGCAAGCCTT TGTAGCTGGG GTTGCTGCTT CATCTGCGGC GGCTGCGGCC
ATGCCTGAGT CTAATGAAGG CACGATGATC TACACGCCAC CAACCCCAAA TCCGGTGGTG
AATCAACAGG CTGCGCCGCC ACCAAACGAG CCAACCCAAA TCTACACGCC ACCAACCCCC
AATTCGGTGG TGAATCAACA GGCTGCGCCG CCACCAAACG AGCCAACCCA AATCTACACG
CCACCGACCC CGAATCCGGT GGTGCAACAG GCTGCGCCAG CGGCCAAGCC GCCAGTTGAT
GCGACCCAAA TCTATACGCC GCCGACCCCG AATCCGGTGG TGCAACAGGC TGCGCCAGCG
GCCAAGCCGC CAGTTGATGC GACCCAGATT TATACGCCAC CAGCGGCTAA CCCACCGTTA
CGGCAAGCAC CGATTACCCA GCCCAATCAA CCCTTGCCAA GTGTCCAATC GCAGGCCAAC
CAGCCGTTGC CAGCTGCGCC AACCATGCCG CTGGATAACC AAAGTACAAT GATGTACACG
CCAACTGCAA GCGTGCCCAA TCGGCCAATT CCAAGCCGAA CCAACCAGCC AGTTGATAAC
GAAAGCACCG TGATGTATCC GGCCTATCCG CCGAATCAGC CACAAGCCTA TCCACCGCCA
GCAACTGGTA ACCAAACCTA TATGCCGCCG ACTGGCAATC AGGCCTATCC ACCAATGGCT
GGGGCTTCGC AGCCAAACCA ACAACTGCCC AAAAAAGGGC CAGCGCGGCC AATTAACGCC
CAACGGATTG AACTGGCGCG ACCTGCGGCT CCTGTTCAAG CAACTCAGGC GGCGGAAACT
GACTATGCGC CTGTTCCACC GCCCTATGCC ATGCCGCCAA AAGCATCGCC ACTTGCCAAA
GCCAAGGATT TCTTCCGTTC GCCAAAGGTT CTCGTGGCCT TAGGTAGCGC ACTTTTAGCC
TTGATTGTGA TTGCATTTTT GAGCTTTGGC CAAGGTGAAG ATGACCCAAA TCTTGCTAGC
GGCGCTACAG CTACCGCAGA AGTTGCGGTT GATGAGAGTA CCCCAACGGT TGCGGCAACC
AATGCGGCCA CTCCAACTGT GGCCGAAACT GCCGATCCCC AAGAGGCCAA ACGCTTAGAA
TATATCGTTG GGCAAGAAGC TTACGCTAAA CAAGATTGGC CAAATGCAGC AGCGGCCTTT
GACAAAGTGC TCGCCATCGA TCCAGCCTAT CTTGATTTGA GCAACATTGG CTCGGCAACT
TACTACAATT GGGCGGTCAG CGAATTAACT GGCCCTGAAA ACGTTGCAGA AAGCCTTGAA
ATTCTCAACA AAACCTTTGG TTTCAAGCCC GACCATCAAC CAGGTGGCAA CTTAGCCAAA
GTGCTGAATT TCTATCGCAA TGGCCAAACT GCCGCCGAAC AACAAGATTG GCAAGCAGCG
ATTAATGGCT ATAAAGAAGC CCAAACTGCT GGCAGCGGCG AGTTTGGCGA AATTATGGGT
AAGCTACAAA CGGTCAAGCA ATTGTATGAA GCCTATCTTG GGCGCGGGCG CGAGCTAGAG
GAGGCTGGTA ACGAAACCGA AGCCAATGCA ATCTATCGTG AAGCTGCCGC CCTCAAGGAT
CTGGACAACA GTTTAGATGT TGCAGCAGCT AATACAGGCA TTGCGGCAAC CCAACCAACT
GCGGTGCCAG CGACCCCAAC GAAGCCAGCA CCACCAACGA CTGCCCCGGT AGCTCAACGC
TTGTACTTCC AAAAATATGC CGAAAACGCG GTTGATCCAA CCTGTTTTGC GGTGCATATC
CGTGGGGTTA ACACGGGCGG CTGGTTTGTG ACGGTCGATG GACTGGGCAA TCGCGGCAAT
GTCGATGGCG CTGGCAACAC CAACGTTTGC GGATTGTCCC CCAGCCAAGA AGTGACCTTC
ACGGTTTACA ATGGCTCTGG GCAGGCTGTG CCAGGTGGCG GCGGCATCCC GACCCGTGGC
GGCGATTTGA TGACCGGCTA TTGGCAATAA
 
Protein sequence
MKSFEHTQLG EYTLQEEIGR GGMAQVYRAQ HQAYGLVAFK VLPPYFAHDN DTLQRFMREA 
RANRTLHHPH IVQLYEASDI AQAQNPYQPI HYIAMEYIAG GTLTDRLRQQ PQQPLNSTLE
MGEQIGSALD YAHGKGFIHR DIKPSNILFR SNGHAVLADF GIALANNEAR MTKAGGFAGT
VAYTAPEIFE GETADVRSDI YALGLILYES LAGHNPYANI STNAQIAMSK ILTTPLPPLQ
DVAPHVPPLT AEILAQATAK DPIRRFATMS DFVEALKQAK FNRVSDRPAQ QLNASGRPMI
PIAGRPGAPN RPPPRNQPSG DNPTQAFVAG VAASSAAAAA MPESNEGTMI YTPPTPNPVV
NQQAAPPPNE PTQIYTPPTP NSVVNQQAAP PPNEPTQIYT PPTPNPVVQQ AAPAAKPPVD
ATQIYTPPTP NPVVQQAAPA AKPPVDATQI YTPPAANPPL RQAPITQPNQ PLPSVQSQAN
QPLPAAPTMP LDNQSTMMYT PTASVPNRPI PSRTNQPVDN ESTVMYPAYP PNQPQAYPPP
ATGNQTYMPP TGNQAYPPMA GASQPNQQLP KKGPARPINA QRIELARPAA PVQATQAAET
DYAPVPPPYA MPPKASPLAK AKDFFRSPKV LVALGSALLA LIVIAFLSFG QGEDDPNLAS
GATATAEVAV DESTPTVAAT NAATPTVAET ADPQEAKRLE YIVGQEAYAK QDWPNAAAAF
DKVLAIDPAY LDLSNIGSAT YYNWAVSELT GPENVAESLE ILNKTFGFKP DHQPGGNLAK
VLNFYRNGQT AAEQQDWQAA INGYKEAQTA GSGEFGEIMG KLQTVKQLYE AYLGRGRELE
EAGNETEANA IYREAAALKD LDNSLDVAAA NTGIAATQPT AVPATPTKPA PPTTAPVAQR
LYFQKYAENA VDPTCFAVHI RGVNTGGWFV TVDGLGNRGN VDGAGNTNVC GLSPSQEVTF
TVYNGSGQAV PGGGGIPTRG GDLMTGYWQ