Gene Haur_4537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4537 
Symbol 
ID5736388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5806414 
End bp5809611 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content59% 
IMG OID641281699 
Producthypothetical protein 
Protein accessionYP_001547296 
Protein GI159901049 
COG category[R] General function prediction only 
COG ID[COG1205] Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.382016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATAT CGTCTGCCTC AACCCCAATT GGCGACGTGT TGCAGCAAAT TGCCCAACGT 
TTCGCCCAAA GCAGCCGCTC GCCTTTGTTG AGGAGTGGTG AATTAGCCTT ACCCGATGCT
AGCACCCAAC CGCTGCCCTT GGCTCCGGAG TTAGCGGTAG CTTGGCGGGC CTTGCTCGGC
GAAACTGGTT GGCCTTGGCA AGCTGAAGCG TTGGCGACCG TGCGGCGTGG CTTGGGCTTG
GCCTTGGTTG CGCCAGCCCC ACTTGGCCCA GCATGTTTGT TGCTGTTGGC TGCCGAGCAT
GTGAGCACCA ATCAAGGTAG TTTGTTGCTG TTGGCTCCCG ATGCTGCTAG CTTGCACGAT
TTGGCGCAAA CAGCGAATAA TATTGATGCG CTGCTTGGTG GCGGCTTTCC CCATTTGGTC
GTTGAGCAAA GTACGCGCCC ACCTCACAGT CCACCGCGTT TGATTTTGAC TACTCCTACA
ATTTTGCATC AACGCATGCT GCGCTCGCAC CATCGTGGAT GGTCGAATAT CTGGCCGCAT
CTGAATGGGG TGGTGTTGCC AGCTTTCGAT CAAGCCAGTA GTACGATTTT TGGCCATTGC
CGTTGGTTGA TGCGGCGGAT CGAACGCTTG CGCCCGCGTG CCCGACCATT GACGCTCTAT
GCCAGTTTAG CGCCCGTTGC CGAGATTGAT GAATTGCTGG CGCGAGTGTT TGACCACCTG
CCGCCGCTGG TTTATGCCAA TACCGCCCGC ATTCCCTTGA CTTGGGCTTT GTGGAATGGT
GGCACGCAGC CAGTTGATGC AGCTTTGAAA TTGGCCTTGG CCTTGCGCCA AGCAGGTTTA
AGCGTGCAGC TTGATGCGCC CGATAGGCTA GAGCGAGCAA TGTTGGCCCA ACGTGGCGCG
GCTCAAGGCC TGAGTTTAGT GCCACGGGCA GCGGCTCCGG CCCATGTGTT GGTGATGTTG
GGTGGCGTGA ATGCTACCAG CCTGCCAAGT TTACTCGCCA GCGGCCATCG CGCCGTGGTG
TTGGTGACCG ATCAATCGAT TGCTGCCCAA ACCGCCTTGG CTCAACCAGC CTTGCTCACC
CCAACTGTGC CGCCAGCCTT GCCCGTAGCC ACCCAAAACA ACTACATCAG CAGTGGGCAT
TTGCGCTGCG CCGCCGAAGA ACGGCCCTTG CAACAAAGCG AAATTAGCGC GTGGGAAGTG
AGCGATCTGG TTGAGCGTTT GACCCAGCGC AATCAGCTAG CCCAGTTGCC CGATAGCCCA
ACTTGGCAAC CAGTCGCCAA TTTGCAGCAG CGCAGCGATA TTTATGCCAC GCTGCATCCA
ACCACAATTA GTGATATACC AGTACAGATC GTCGATCACG AAGGCACCTT CTTGGCCGAA
CTTGATTCGG TGACGGTTGA GCGGCGTTTA TTTAGCGGCG CTAGCGTGCT CGGCGGGCGA
GTAATTGGCT GGAATGATGA TGGTTCATTG GGTTTGCGTT TGCAGGATGT TGCGCCAACC
TTGGCTGAAC ATCGCTGTAG TGTTGCGGTG CGCGAGCAAT TTGGCCAACG CCCGCTTGAT
GGAGCACGCG CCGAGATCGA GTTGATGATT GGCCGTGTGG TGGCAACCGA AGAAATTGTG
GCACGGCGCA GCCTGGCTGA TGATGGCAGC ATTCGGCGTG TGCCGTTTGA GCCGCCGATT
CAGCTGCAGT GGAATGCCCC AGCGCTGTGG TTGGCTGCGC CAGAAGCTGG GGCTGGCTTG
GGCGAAATCC TGCTTGGCGT GTTGCCATTA TTGCTGCACT GCCAGCCTGA TGCGATGGTG
GCGTGTGTCA GCGAACAACA TTTGTATTTG GTCGAAGCTC AACCAGGCGG GCGCGGCATA
GTTGAGCAGT TGTATAGCCA ATTTGAGGCT TGGCTACATT TGGCTGGCTT GGCGGCCCGC
ACTCTGAGCA AAGACCCGTT GTATGCTAGT TATGCTCAGG CCGAATTGCG TTGGCTTGAA
AAGATTTTGG TACCACTCGC TGCGCCGTTA CGCGCCGATA TGCCACCAGA GCCAGCCCAA
GTTGCACCGC CACGGGTCGA GCGAGCGAGC CGTCAATCGA TGGTAATTAG CACCAACGAT
TTGAATGCCC GCCGCCGTGG ACGTGGCAAT GTGTTTGCGC TGCCGCGTTC GCTACCCAAG
CAAGGCGAAG CGCTCAAGCG TAGCCAAACC AACCCTGTGC CAAACCAACC AGCGTTGCCT
GCTCAGCCCA TGCGTTTGGC CGCCCAACAA CCGCCAGCCA ACAAGCCGCT GACCAACCAA
CGTCCAGCGG TGCGCAACGA AGCCCCGCCA GCGCCGCCTA GCCCTGAAAA AGCCAAGGCC
AACCTGACCC GACCGTCACG GCGCAAGGCT AACGTTGGGC GCAATGAGGC GCAACGCTCA
ACCCAGCCCT TAGTGCAGCC CAAAACGCCA GCGCCAAGCC AGCCTTTACC GCCTGAACGC
GGTTCGGTGG TGATGCCGGT GGCCAACGAG CCGCCGCCCT ACGAACGACC ACCGTTTCAA
CAGCGCAACC CTCCAGAAAA ACCAGCGGCT CAGCGTCCAA GCTCGCGGCC TGTCCAGCGC
GAAAATCAGC CGCAGCAGCG TCCGATTCAG CGTGAGCAAC AACCGACCCG ACCGTATCAG
CGCAACGATC AGCCGACCAA ACCGATGCCG CGCGAAAGCC AGTCGCAGCA GCGTCCGATT
CAGCGCGAGC AACAACCAGC GCGACCGTAT CAGCGCAACG ATCAGCCGAC CAAACCGATG
CCGCGCGAAA TTCAGCCGCC ACAGCGCCCG CTTCAACCTG AGCAACGGCC AGTACAGCCA
AACTTGGCCG AGCCAGTGCG GCCATATCAA CGTAATGAGC AACCAGCCAA GCCCGTGGAA
GCCACTGCTG ATCCACAGAC AATGCTTGAA AAAGCGCGGC GTTTACGTGA GCAACGCGAA
GCCGAAGCCC GCGTAGCGCA GCCAATCACG CGGCCAAGCA CCAATCAAGC CGCCGAGCCA
AGCGAATCGC GCTTCAAACA AGGCGATCGG GTGCATTGCG TGCCCTACGG CGAGGGCGTG
GTGCAAAAAA CGCGCATCCG CGATGGCCGT GAGCTATTGC TGGTACAATT TCCAGAGCTA
GGTGATCTAC GGGTTGATCC AGCAGTCAAT GCTGTACGCA TCCTACGCCC TGAGATTCAA
GCCGAAGACG ACGAATAA
 
Protein sequence
MSISSASTPI GDVLQQIAQR FAQSSRSPLL RSGELALPDA STQPLPLAPE LAVAWRALLG 
ETGWPWQAEA LATVRRGLGL ALVAPAPLGP ACLLLLAAEH VSTNQGSLLL LAPDAASLHD
LAQTANNIDA LLGGGFPHLV VEQSTRPPHS PPRLILTTPT ILHQRMLRSH HRGWSNIWPH
LNGVVLPAFD QASSTIFGHC RWLMRRIERL RPRARPLTLY ASLAPVAEID ELLARVFDHL
PPLVYANTAR IPLTWALWNG GTQPVDAALK LALALRQAGL SVQLDAPDRL ERAMLAQRGA
AQGLSLVPRA AAPAHVLVML GGVNATSLPS LLASGHRAVV LVTDQSIAAQ TALAQPALLT
PTVPPALPVA TQNNYISSGH LRCAAEERPL QQSEISAWEV SDLVERLTQR NQLAQLPDSP
TWQPVANLQQ RSDIYATLHP TTISDIPVQI VDHEGTFLAE LDSVTVERRL FSGASVLGGR
VIGWNDDGSL GLRLQDVAPT LAEHRCSVAV REQFGQRPLD GARAEIELMI GRVVATEEIV
ARRSLADDGS IRRVPFEPPI QLQWNAPALW LAAPEAGAGL GEILLGVLPL LLHCQPDAMV
ACVSEQHLYL VEAQPGGRGI VEQLYSQFEA WLHLAGLAAR TLSKDPLYAS YAQAELRWLE
KILVPLAAPL RADMPPEPAQ VAPPRVERAS RQSMVISTND LNARRRGRGN VFALPRSLPK
QGEALKRSQT NPVPNQPALP AQPMRLAAQQ PPANKPLTNQ RPAVRNEAPP APPSPEKAKA
NLTRPSRRKA NVGRNEAQRS TQPLVQPKTP APSQPLPPER GSVVMPVANE PPPYERPPFQ
QRNPPEKPAA QRPSSRPVQR ENQPQQRPIQ REQQPTRPYQ RNDQPTKPMP RESQSQQRPI
QREQQPARPY QRNDQPTKPM PREIQPPQRP LQPEQRPVQP NLAEPVRPYQ RNEQPAKPVE
ATADPQTMLE KARRLREQRE AEARVAQPIT RPSTNQAAEP SESRFKQGDR VHCVPYGEGV
VQKTRIRDGR ELLLVQFPEL GDLRVDPAVN AVRILRPEIQ AEDDE