Gene Haur_5226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5226 
Symbol 
ID5737184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp326722 
End bp329787 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content55% 
IMG OID641282390 
Productnon-specific serine/threonine protein kinase 
Protein accessionYP_001547981 
Protein GI159901735 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGTGCTC TCACGCACGA TCAGGCCGTT TTTGAGGTCG CTGGGAGCCA AAAGAAACCC 
TACACCGTGA CGATCACGAG TGATCCCTAT GGGGTGATCA CCGATTGTAC CTGTCCGCAC
GCCGCCAAAG GGGCGCTGTG CAAACATGTG GTGGCTGCCG CGCATACCCT CGCTGATCAC
CTTATCGCGC ATCCCCTCAA CCCATGGCGA TCCGTATTTG CCAGCTTCCA AGCGCTGCCG
CGCCGTCGCC CCACCCCGAT GCGGCTCGTC TTTAGCTTGC TCGAACGGGG GGCAACATGG
ACGCTCATGC CCTATACCAT TGCTGAGCGC GTGATTCCCC AAGCCGTGAT GGACGATCCG
GTGGCGTTGC ACCGGCACTT AATGACAAAG GATGTGCTCA TCCAAGCCAA AATTCCCCGT
ACCCCACTCC ACCCATCACA CGTTCCCTCC ACCGCAGAAG CAGCATTAGT CGTGGCAAAC
ATGATCATCA CGGTTTCCCA TCAGTTTTTC TATGGAGCGG CCTTTCCGCT TTCCACCGTG
CTGGATAGTG TGGTGCAATT CCTTCCCAAT ACCATTCTTT TCATGGGCGA TGATTACAAC
CCATTCCGTC GCATGGTGCA GGTTTGTTCA GACCATGCGC AGATCCACCT CCATGGCACT
GCCACTGCCG ATGGAATCAC GATCACGCCC ATGCTGCACG TCGGCGACAC GCGTGCTACC
CTTGATACCG CCGATGTGCG CATGCTCCAC AGTGCCCCGC CATGGGCGAT CTATCGCACC
ATGCTCGTGC GCTATGACGA TCCGGGGAAC ATCCTCACCT TGTTTGGGCG CTATCAACAC
CTGCATATTC CAGCAGCCGA TTATCCTGAA TTTATCGAAC GCTATCTCAT GCCGCTCGCT
GAGCATACCC CTTTGAGCGG CGATTTACCC CAGCAAGCAT TGGTTGCAGC TGATCCCCAG
CCACGGGTGT ATCTTCGTGA ACACGAATCG ACCGTTTTTG CCGAATTACG CTTTGGGTAT
GCCGATCATG AAGTGGCATA CGACCAACAG CTTCCCGCAG AAACCATGCG GTATCACCCC
GACCACGCAA CTATCCTGCG TATCCAACGC CACTCAACCA GCGAGGAGCA CGCGTGGGGC
GAGTTGCTAC GCCATGGCCT CAAACGTGGG CCACAAGCCG GGGTTGCCAC CCTCCGCAGT
GGGACAACCG TCGCGACCTT TTTGCTCACC CATGTTCCCG CATTAGTGGC CATGAATTTC
AGCATTTATG GTGAGGAATC ATTACTTGGT GCACGCATAA ACCGTCACAC GCCCAGCATC
AGCTTGCGGG TGTCATCTGG TATCGATTGG TTTGATCTGG AGGCGGTGGT GCGTTTTGGC
GAAACGGAGC TGCATCTGGC TGAGTTCCGG CGGGCAATTC GCAAACGCGA ACGCTATGTC
CAACTGGCCG ATGGCTCGCT TGGGGCCATT CCTGAGGCAT GGCTCGATCG CTACCGCCAT
CTCTTTGCGT TTGGCGACCT GCATGCCGAA ACCTTGCGGT TCGCGCCAGC ACATAGTACG
GTGGTGGATG CGGTTCTCGA GGAAACGGAT CAGGTTGATC AGCGCTTCAA AGAGCGCAGC
GAACGCTTAA AGACGATCAA TGGCATCGAA GCGCAGGCGC TTCCCGCAGG TTTTGAGACG
GTGTTGCGTC CCTATCAAAA AGCGGGCTAC GATTGGCTGC ACTTTCTGTA TGCGTATGGG
TTTGGGGGAT GTTTGGCCGA TGATATGGGA ACAGGGAAAA CTATCCAAAC ACTCGTGTTC
CTTCAGTCAT TGGTATCGCG AGGGCAAACG TCGGCCAGCA GCCTGATTGT CATGCCACGG
TCGTTGATTT TTAACTGGCA ACGGGAAATC GCCCGTTGGA CTCCCGAGCT GCGGGTCTTG
GTTCACACCG ATCACGGTCG GGCTGACACC ACCGCAGCCT TTGCCAATGC TGATCTGGTG
CTCACGACCT ATGGGACGCT GTTACGCGAT CATGACCTGC TTGCTACGTA TCAGTTTTCC
TATGTGGTGC TTGACGAAGC CCAAACGATC AAAAACCCCG TTTCCCAAAC AGCGCGTGCG
GTTCGCGCCT TGCGGTCAGA GCATCGCCTC ACCTTAAGCG GGACACCCGT CGAAAACTCG
ATCATGGAGC TTTGGTCACA GTTTGCCTTT TTGAATCCCG GCATGCTGGG AAGCCTCGAT
CATTTTCGCA CCGAATTTGC CACACCCATT GAGCGGGACG GTGATGCGCA CGCCGCGCAG
TTGCTGCGCC GTATGGTCAA TCCATGTATC CTGCGGCGCA CCAAAGATCA GGTTGCCCCG
GATCTCCCAG CGCGGAACGA ACGCATCCTC TATTGCGCTA TGGAGGCGGC GCAACACAAA
CTGTATCAAC GCTATCGCGA CCAATATCGT GCGCAGTTGC TTTCGTTGAT TGATGATCAC
GGGATGAATG ACAGCCGCAT GAAAGTTTTA GAGGGGCTAT TGCGGCTCCG CCAGATTTGT
AATCACCCGC GTTTGGTTGA ATCGACGTTT CGTGGCCGCT CGGCCAAGTT TGAACAGCTT
CTCGAAACCT TGGCCATTTT GCAGGCTGAA GGGCATAAAG CCTTAATCTT TTCGCAATTT
GTCCAGATGT TGACGATCCT GCGGGAACAC CTTGATCAGC AGAACGTAAG CTATACCTAT
CTTGATGGGA AAACCCAGAA TCGAGCCGCC GTGGTGGATC GCTTCCAAAC CGATCCCCAT
GTCCACTTCT TTTTGATCAG CCTCAAAGCG GGCGGGGTTG GACTGAATCT GACGGCTGCC
GATTATGTGA TTCACATCGA CCCATGGTGG AATCCTGCGG TTGAGCAACA AGCAACGGAT
CGGACGCACC GCATTGGTCA AGAGAAGCCC GTTTTTATCT ATAAATTGAT TGTCCGTGAG
AGTGTTGAAG AGAAGATGGT GCACTTGCAA GAACGCAAGC GAGCCTTGGC CGATAGCATT
ATTACGAGCG AACAAGGCAT TGTCAAAGCG TTGACCCGCG ATGATGTGGC CGATCTCTTT
TCATAA
 
Protein sequence
MRALTHDQAV FEVAGSQKKP YTVTITSDPY GVITDCTCPH AAKGALCKHV VAAAHTLADH 
LIAHPLNPWR SVFASFQALP RRRPTPMRLV FSLLERGATW TLMPYTIAER VIPQAVMDDP
VALHRHLMTK DVLIQAKIPR TPLHPSHVPS TAEAALVVAN MIITVSHQFF YGAAFPLSTV
LDSVVQFLPN TILFMGDDYN PFRRMVQVCS DHAQIHLHGT ATADGITITP MLHVGDTRAT
LDTADVRMLH SAPPWAIYRT MLVRYDDPGN ILTLFGRYQH LHIPAADYPE FIERYLMPLA
EHTPLSGDLP QQALVAADPQ PRVYLREHES TVFAELRFGY ADHEVAYDQQ LPAETMRYHP
DHATILRIQR HSTSEEHAWG ELLRHGLKRG PQAGVATLRS GTTVATFLLT HVPALVAMNF
SIYGEESLLG ARINRHTPSI SLRVSSGIDW FDLEAVVRFG ETELHLAEFR RAIRKRERYV
QLADGSLGAI PEAWLDRYRH LFAFGDLHAE TLRFAPAHST VVDAVLEETD QVDQRFKERS
ERLKTINGIE AQALPAGFET VLRPYQKAGY DWLHFLYAYG FGGCLADDMG TGKTIQTLVF
LQSLVSRGQT SASSLIVMPR SLIFNWQREI ARWTPELRVL VHTDHGRADT TAAFANADLV
LTTYGTLLRD HDLLATYQFS YVVLDEAQTI KNPVSQTARA VRALRSEHRL TLSGTPVENS
IMELWSQFAF LNPGMLGSLD HFRTEFATPI ERDGDAHAAQ LLRRMVNPCI LRRTKDQVAP
DLPARNERIL YCAMEAAQHK LYQRYRDQYR AQLLSLIDDH GMNDSRMKVL EGLLRLRQIC
NHPRLVESTF RGRSAKFEQL LETLAILQAE GHKALIFSQF VQMLTILREH LDQQNVSYTY
LDGKTQNRAA VVDRFQTDPH VHFFLISLKA GGVGLNLTAA DYVIHIDPWW NPAVEQQATD
RTHRIGQEKP VFIYKLIVRE SVEEKMVHLQ ERKRALADSI ITSEQGIVKA LTRDDVADLF
S