Gene Haur_1937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1937 
Symbol 
ID5733826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2347018 
End bp2350092 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content46% 
IMG OID641279081 
Productsuperfamily II DNA/RNA helicase 
Protein accessionYP_001544708 
Protein GI159898461 
COG category[R] General function prediction only 
COG ID[COG4889] Predicted helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00156892 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTCGA TCAAGCCAAA TCATAAGGCG GTGCGCGAGT ATTACGCCAG TTTGCGCAGT 
TTGGCCGAGG CCCGTGCCCA ACACGAGGGC GCAGTTGCTC CAGCCTTTGC GGCCTTGCTC
CGCGCTTGCG CAAGCCAGAT GGGCTGGACA TTGGTCGAAC AGTATTCGAT CCGCTTGAAA
AAAAGCTCGA TTCGCGCCGA TGGTGCTTTG CTCGATAGTT TTACCTTGAT TCGCGGGGTC
TGGGAAGCCA AAGATAGTAA CGATGATTTG GCAACTGAGG TGCGTAAAAA GTTCGCCGCT
GGCTATCCCG CTGAAAATAT CTTGTTTCAG GCTCCGCAAC GGATTATTCT TTGGCAAAAT
CAGCGCCAAG TGCTCGATGT GGATATTAGC CAGCCCGATG CGTTGATCGA TGCGCTGCTG
CTGTTTTTCA ATTATCAGCC GCCCCAATAT TTGCAATGGG ATCACGCGGT CGTCGAGTTT
CGCGAGCGCG TGCCCGAACT CGCCCAAGGC GTGCTGCGTT TGATCGAGCG CGAAATTAGC
GAGAAGAATC AGCGTTTTAT CACGGCGCTT GAGCGCTTTA TGGCCTTGGT GCGCGAGGCG
ATCAACCCCA ATATTTCGGT TTCAGCGGTC GAGGAGATGT TGATTCAGCA CTTGTTGACC
GAGCGGATTT TTCGCAAGGT CTTTAATAAT CCCGATTTTG TTAATCGCAA TGTGATTGCC
CGCGAAATTG AAACCGTGAT TCAGGCATTA ACTTCGCGTT CGTTCAATCG TAATGATTTT
TTGCGTGAAC TCGACCGTTT TTATGGGGCA ATCGAATCGA CCGCCGCCAC CATCGAGAAT
TTCAGCCATA AACAGGATTT CTTGAACACG GTGTATGAGA ATTTTTTCCA AGGCTTTTCG
ATTAAGGTCG CTGATACGCA TGGGATTGTT TACACGCCTC AGCCGATTGT CGATTTTATG
GTGCGTTCGG TCGAGGAGTT GTTGCGGCGC GAATTTAATA CCTCGCTCGG CAACGCGGGC
GTGCATGTGC TCGACCCATT TGTTGGCACT GGCAACTTTT TGCTGCGGGT GATGCACGAA
ATTCCGCGCA GCAAATTGCG CCAAAAATAT GCCGAGGAAT TACACTGCAA CGAGGTGATG
TTGTTGCCCT ACTACATCGC TTCGATGAAT ATCGAGCATT TGTATTATGA ATTGACCAAT
AGCTATCAAG AATTCAATGG CATTTGTTTG GTCGATACCT TTGAATTAGC TCAAGTTGGC
GCAGGCCAGC AATTGGGCTT GTTTGTGCCC GAAAACACCG AGCGCGTGCT CAAACAACAA
CAACAAGATA TTTTTGTAAT CATCGGCAAC CCACCTTACA ACGCCCGCCA AGTTAACGAG
AATGATAATA ATAAAAATCG TAAGTATGAA ATTATCGATC AACGGGTGGC TATGACCTAT
AGCCGCGATT CACAACAAAC CAACAAAAAT GCATTGAACG ATCCATATGT TAAATCCTTT
CGCTGGGCGG CTGATCGAAT TATACGCAAC GGTGATGAAG GTATTGTAGC CCTTGTTACC
AACAATAGTT TTATTGATGA TTTATCGTTT GATGGCATGC GCAAGCATTT AGCACAGGAT
TTTGATGCAA TCTATGTGCT TGATCTTGGT GGCAATGTGC GCAAAAATCC CAAACTTTCC
GGCACAACAC ACAATGTGTT TGGCATTCAG GTTGGAGTCA GTATCATTTT TCTGATCAAA
AAGCGTGGTT CAACTAAAGC AAGTGATGCC AAAATTTGGT ATGCGCGAGC TGGCGAGATG
TGGAAAAAAC AAGAAAAGTT TAATTTGCTT AATCAGGCTG AAACAATAGA TAAAATCGAA
TGGCAAGAAA TTTTACCTGA TAAAAAGCAT ACTTGGCTCA CGGATGGCTT AGAAAATGAT
TTTGATAATT TTATTCCCTT AGGTACAAAG GAAGCGAAAA AAGGATTTGG TCAAGCAATT
TTCACACAAT TTACAAATGG TGTAAAAAGT AATCGTGATG CTTGGGTTTG GAATTTTGAT
TCTGATACCT TATCAAACAA CATCAAAACC ACTATTGATT ATTACAATGA TCATGTTTCT
CGCTGGCAAA GATTGGTGAC CAAACAAGAA ATTGATAGTT TTATTTCTAC TGATGACAAA
AAAATTAGTT GGAGCGGTGA TCTAAAAAGT AATATCCAAC GAGGGCGTTA TATCCAATAT
GATGCAAATA AAGTTATAGA TGGCATCTAT CGACCATACA CAAAACAGAA AATTTACTTT
GAACGGCTTC TCAATGAGCG AGTCTACCTG ATACCATCTC TATTCCCAAC AGCTAGTGAA
AATCGGGTAA TTTGCGTTGT CAACGAGGCA CAAATCCCAT TTTCAGCCCA AATCACTAAC
GTCATTCCTT GTTTGCATTA TGGTGGGCGG CAAACTCAAT GCTTCCCATA TTATGTCTAT
GATGACGATG GCAGCAACCA GCGCGAAAAC ATCAGCGATT GGGCGCTTGA GCACTTCCGC
AGCCAACTTG GCGAGCCAAG CATCGAAAAA TGGGATATTT TTTATTATGT GTATGGGCTG
CTGCACTCGC CGCATTACCG CGAACGCTAC GCCGCCAACT TACGCCGCGA ACTGCCGCGC
ATCCCAATTG TGGCCTTGGC CGATTTTCAG GCCTTGGCCC AAGCAGGCCG CGAGCTGGCC
GAACTGCACA TTAATTACGA AAGTCAGCCT GAATATAATT TACAATGGCT CGAAAACCGC
GACGAACCAC TGAATTGGCG GGTTGAAAGC ATGAAACTCA GCAAAGATCG CACAACCCTG
CGCTACAACA ACTTTCTGAG TTTGGCGGGC ATTCCGGCGG CGGCCTTTGA ATATAAGCTG
GGCAACCGCT CGGCGCTCGA TTGGGTGATT GATCAATATC GGGTCAGCAC CGATGCGCGG
TCGGGCATCA CCAACGATCC CAATCGCCAC GACGATCCTG AGTATATCGT GCGGCTGATC
GGCAAAATCA TCACCATCAG CCTCAAAACC GTCGAGATTG TGACGCGAAT TGGGGATGTG
GCCCTCACCC CCTAA
 
Protein sequence
MLSIKPNHKA VREYYASLRS LAEARAQHEG AVAPAFAALL RACASQMGWT LVEQYSIRLK 
KSSIRADGAL LDSFTLIRGV WEAKDSNDDL ATEVRKKFAA GYPAENILFQ APQRIILWQN
QRQVLDVDIS QPDALIDALL LFFNYQPPQY LQWDHAVVEF RERVPELAQG VLRLIEREIS
EKNQRFITAL ERFMALVREA INPNISVSAV EEMLIQHLLT ERIFRKVFNN PDFVNRNVIA
REIETVIQAL TSRSFNRNDF LRELDRFYGA IESTAATIEN FSHKQDFLNT VYENFFQGFS
IKVADTHGIV YTPQPIVDFM VRSVEELLRR EFNTSLGNAG VHVLDPFVGT GNFLLRVMHE
IPRSKLRQKY AEELHCNEVM LLPYYIASMN IEHLYYELTN SYQEFNGICL VDTFELAQVG
AGQQLGLFVP ENTERVLKQQ QQDIFVIIGN PPYNARQVNE NDNNKNRKYE IIDQRVAMTY
SRDSQQTNKN ALNDPYVKSF RWAADRIIRN GDEGIVALVT NNSFIDDLSF DGMRKHLAQD
FDAIYVLDLG GNVRKNPKLS GTTHNVFGIQ VGVSIIFLIK KRGSTKASDA KIWYARAGEM
WKKQEKFNLL NQAETIDKIE WQEILPDKKH TWLTDGLEND FDNFIPLGTK EAKKGFGQAI
FTQFTNGVKS NRDAWVWNFD SDTLSNNIKT TIDYYNDHVS RWQRLVTKQE IDSFISTDDK
KISWSGDLKS NIQRGRYIQY DANKVIDGIY RPYTKQKIYF ERLLNERVYL IPSLFPTASE
NRVICVVNEA QIPFSAQITN VIPCLHYGGR QTQCFPYYVY DDDGSNQREN ISDWALEHFR
SQLGEPSIEK WDIFYYVYGL LHSPHYRERY AANLRRELPR IPIVALADFQ ALAQAGRELA
ELHINYESQP EYNLQWLENR DEPLNWRVES MKLSKDRTTL RYNNFLSLAG IPAAAFEYKL
GNRSALDWVI DQYRVSTDAR SGITNDPNRH DDPEYIVRLI GKIITISLKT VEIVTRIGDV
ALTP