Gene Haur_2539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2539 
Symbol 
ID5734417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3253308 
End bp3256703 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content52% 
IMG OID641279679 
Producthelicase domain-containing protein 
Protein accessionYP_001545305 
Protein GI159899058 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATTC CCTATGTGAT TGATAATATT GAGCAGCGCC TCGCCGACGT GCTGCGCTAC 
TTACTGAGCC GAGCGAGCGA CCAATCGCTA GATATTGCCA CGGCCTATTT CAGCATTCGC
GGCTTTGAAC AGCTGCGCCA CGAGCTGGAA ACCACAGGCC AAGTGCGTTT ACTGCTGGGC
GATAAACCAG TGGCAGCTGA TGATTTGGGT ATGCGACCCG ATTCGAGCGC GTTTTTGCGC
CATGAACTGA ACGCCGAAGC CTTTACTGAA CGCACCTTGC GCTTAGTCGA ACAGCTGATT
CGCTTTTTGC GCCGCGACGA GGTTGAGGTG CGTTTGTATA CGGGCTTGTT GGCTGGCGAA
AAAGGCCGCC GCGCATTTTT GCATGCCAAA TGCTATTTAT TGTATGGTCG CTTGCCCGCT
GCCAATGTGT TTGATCCGCT CACGCCGATT GTTGGCATTG TGGGCAGCAG CAATTTCACT
GGGCCAGGCT TGGTCAGCAA TCATGAACTA AATTTGGTGC ACAAAACCCT GCTCACCGAC
GCTGAGCTTG ATGATGCTGA AGCTAAAGCT ACGGTTGCCC AGCATGCCGC CCAACTGAGT
GCTGCGCCGC TTGAACTTGA ACATGAACGC ATGCTCAAAA GCGAAATTGG CACGCGAGCT
ATTCTCGATC TTTCGCGTTG GTATGAGCAA CAATGGCAGC ATGCGCTTGA TTATAAAGAA
CAATTGATCG AACTGTTGGA AAACTCCAAA TTTGGCGGGC GCGAATATAG CCCCTACGAA
ATTTACATGA AAGCGCTGTA TACCTATTTT AAAGATGATT TAGCTGAGCG CGACCAACAA
CCAGCCACCC GTTCGGCGGT TGAGCTAGCC GAATTTCAAG AAGATGCAGT GCGCAAGGCG
CGGCGCATTT TAGCCCAATA CGATGGCGCG ATTGTCGCCG ACTCGGTGGG CTTGGGCAAA
ACTTGGATTG GCAAAAAACT GCTGGAAGAT TACGCCTACC ATCAGCGCCA AAAAGCCTTG
GTGATTTGTC CGGCCTCGCT GCGTGGCATG TGGGAGCGTG AATTGCACAG CGCCACGATC
GCCGCCCAAG TGCTGACCCA AGAACGGTTG GGCCTCGACG AATTTGATGG CCGCGAGTTT
TTCGATGTTG ATCTGATTTT GGTCGATGAA GCCCACAACT TTCGCAACAA ACGCGCCAAA
CGCTACCAAC AACTTGAGCT TTTACTGGCG GCCAATGGGC GACGCGGGCG CAGCGGTAGC
CGCAAAAAGC TGATTTTGCT GACCGCCACC CCCATTAACA ACAATATTTT CGATTTATAC
AACCAAATTA ATCTGTTTAC CGGCAACGAC CGCAGCTACT TTGCTTCGGC TGGCATCGGC
GATTTATACA AATATTTCTT GGCAGCGCGG TGCGAATCGT TGGAAGCTGG TTCGATTCGC
ATCTTTAACC TGCTCGAAGA AGTGGTGATT CGCCGCACGC GCCAATTTAT TCGGCGAGCC
TACCCCGAAG CGCTGATTCA TGGCGAGCCA ATTCGCTGGC CAACCCGCCA ACTGCACAGC
GTCGAATACG ATTTACAAGC GGCCTACAGT GGCTTATATC AAAATATTAT TGGTTCAATC
GAAGGCCTAC ATCTCGCTCA CTACAACCTT GAGGCCTACA AACTGCGCCC CAGCGACCAA
GATGAATTTG AGCTGGGGCG ACAAGCAGCG CTGGTGGGTA TTTTCAAAAG TCGCTTTCTC
AAGCGGCTCG AATCGAGTAT CGAAGCCTTT CGCATTTCAA TTCGCCGTTC GTTGGCCTTT
GTCAAAACCT TTGCCGAATA TGTGCAAGAT GGGATTGTGC TCGATTCGGT GTCGTTTCAG
CAGGCCATGC GCTTGCTCGA AGCCGACGAA GAAGATAGCG ACGATAGCAC GCCCAGTTCG
CAGGCCTCGG CGCTTGATGA GCATGCCGCC GCCAGCCTGA TTATTGCCAA ACTGCCCAAA
CTTGAGGCCA GTAAATACGA TCGGCGGCGT TTGCATCGGG CTTTGCAAGC AGATATTGAT
GCGCTGAACG AAATTTGGCA TGCGATTAAA CACATTCAAG CCAGCCACGA TGCCAAATTG
CTGCAATTGC AAAGCCTACT TGCCAGCCAA CTCAACGGCT GCAAAGTGGT GATTTTTACC
TATTACAAAG ATACAGCCCG CTATGTCTAT CAAGCCTTGA CTGAGGCAAG CAATGCTGAA
TGGCTGGCCA CGCTTGGCAA TCCAACAATT CGGCGCATTG ATAGCAGCGT CAAAACGACC
GATCGGACGC GAATTGTGAG CCATTTTGCG CCACGCGCCA GCAACCAACC CGAATTGGTT
GGCACAAGCG ACGAAGTACA AATTTTGATC GCCACCGATG TGCTTTCCGA GGGCCATAAT
TTGCAAGATT GCGGCCATTT GCTCAACTAC GATTTGCACT GGAACCCGAC GCGCATGGTC
CAACGGGCAG GCCGGATCGA TCGGCTTGGC TCGAATTTCG ATCTGCTGCA TGTGTACAAT
ATGTTTCCTG AGCGCGAACT CGAAGCTTTG CTGGGCTTGG TGCGCAGCCT GACCAGCAAA
ATCGATTTGA TCAACCAAAC GGGCTTTTTA GATGCTAGCG TGTTGGGCGA AGTCGTTACG
CCGCGCGATT TCAACACGCT CAAGCGCATC GCCGACGAAG ATCATAGCGT GATCGAGGAA
CAAGAATCAT TTTTGGAGCT AGCGAGCAGC GAATCATTAT TTGCCGAATT GCAAAATGTG
CTGGCGACCG ATGCCCAACG CTGGCTGACC GACCTTGATG ATGGGATTCA TTCGGGGATT
GAGCGACGTA ATGCCAAAGG CCTGTTTTTC TATTTCACTG CGCCGCGTGA TGGCAGCACC
GCCCATTTTT GGCGCTACTA CGACCTTGAA AAACAAACGA TTACCGATAA TCGCTACACC
ATTATGCAGC TGATTGCCTG TAGCCCAGAT ACGCCACGCT TTGCCCCGCC CTATAGCGAA
GTTGATATTT TCGCCATCCA CGACACCATT CTCAACAGCA TTTTGCGCGA TGTTCAGCAA
CAAGTCACGG CGACGGTGGT CGATAAAATC GTTGCACCGG AACAAAACGT GATCGCCCAA
TTGCTCCACA ACAATCTTGC ACAGCCTGGC GTTGATCGCG GCGAAGTGCG CGAATTGCGC
AAAATGCTCA AAGAACCCCA AGTTGGCGCG GTCGTGCAAC GTTTGCGTAA AACGCTCAAT
CGCTACAACG CCGACAACGA TTTGCTTGCG CTGCTTGAGG TGCTACGCGA ACTTTACCAA
ACCCAAGGTC GCAGCAATCT TGAGCCGAGT GGCCCCGTTA GCATGATTAC CCGCGATGAT
CTGACCTTGG TGTGCTATGA GTATATTTAT GCATAG
 
Protein sequence
MQIPYVIDNI EQRLADVLRY LLSRASDQSL DIATAYFSIR GFEQLRHELE TTGQVRLLLG 
DKPVAADDLG MRPDSSAFLR HELNAEAFTE RTLRLVEQLI RFLRRDEVEV RLYTGLLAGE
KGRRAFLHAK CYLLYGRLPA ANVFDPLTPI VGIVGSSNFT GPGLVSNHEL NLVHKTLLTD
AELDDAEAKA TVAQHAAQLS AAPLELEHER MLKSEIGTRA ILDLSRWYEQ QWQHALDYKE
QLIELLENSK FGGREYSPYE IYMKALYTYF KDDLAERDQQ PATRSAVELA EFQEDAVRKA
RRILAQYDGA IVADSVGLGK TWIGKKLLED YAYHQRQKAL VICPASLRGM WERELHSATI
AAQVLTQERL GLDEFDGREF FDVDLILVDE AHNFRNKRAK RYQQLELLLA ANGRRGRSGS
RKKLILLTAT PINNNIFDLY NQINLFTGND RSYFASAGIG DLYKYFLAAR CESLEAGSIR
IFNLLEEVVI RRTRQFIRRA YPEALIHGEP IRWPTRQLHS VEYDLQAAYS GLYQNIIGSI
EGLHLAHYNL EAYKLRPSDQ DEFELGRQAA LVGIFKSRFL KRLESSIEAF RISIRRSLAF
VKTFAEYVQD GIVLDSVSFQ QAMRLLEADE EDSDDSTPSS QASALDEHAA ASLIIAKLPK
LEASKYDRRR LHRALQADID ALNEIWHAIK HIQASHDAKL LQLQSLLASQ LNGCKVVIFT
YYKDTARYVY QALTEASNAE WLATLGNPTI RRIDSSVKTT DRTRIVSHFA PRASNQPELV
GTSDEVQILI ATDVLSEGHN LQDCGHLLNY DLHWNPTRMV QRAGRIDRLG SNFDLLHVYN
MFPERELEAL LGLVRSLTSK IDLINQTGFL DASVLGEVVT PRDFNTLKRI ADEDHSVIEE
QESFLELASS ESLFAELQNV LATDAQRWLT DLDDGIHSGI ERRNAKGLFF YFTAPRDGST
AHFWRYYDLE KQTITDNRYT IMQLIACSPD TPRFAPPYSE VDIFAIHDTI LNSILRDVQQ
QVTATVVDKI VAPEQNVIAQ LLHNNLAQPG VDRGEVRELR KMLKEPQVGA VVQRLRKTLN
RYNADNDLLA LLEVLRELYQ TQGRSNLEPS GPVSMITRDD LTLVCYEYIY A