Gene Haur_2194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2194 
Symbol 
ID5734081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2781851 
End bp2784916 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content47% 
IMG OID641279335 
Productsuperfamily II DNA/RNA helicase 
Protein accessionYP_001544962 
Protein GI159898715 
COG category[R] General function prediction only 
COG ID[COG4889] Predicted helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAACGA TTGCCCCCAA GGTGCTGAAC ACCTACCGTG TTGAGCTAGC GAGGGTGATG 
AAAGTCGGCG GTCTCAATGA AGGTGCGATT CGTAATGCCT TTCAAAATTT GTTGAGCGAG
GCTGGCCGCG CTTGGGGCAT GACCCTCGTC GCCGAGCAAA CACTCACGCT AGGTTCGCGC
AAGGCGCTGC GCTTCGATGG CGAGTTGCGC GATAGCCTCA AATTGCGCCA TGGAATTTGG
GAAGCCAAAG ATCCTGCCGA TGATTTGGAA CGCGAAATTA GCAATAAGCT CCGCGCAGGC
TACCCCACCA AAAATACGCT GTTTGAAAAT AGCCGCCAAG CAGTGTTATA CCAACATAAT
CAGCGGGTGT TGAGCATCGA TATGCACGAT GATCACCAGT TGGTACGCTT GCTTGAAACA
TTTTTTAGCT ATGCCGAGCC ACAGGTCGAT GATTTTCACT TAGCTGTCGC TCGCTTTCGT
CGCGAAATTC CCGATTTGGC CACCAGCGTC GCTGAAATTA TTGCCAATGA GTTAAAACAT
AGCCGCGATT TTAAATTGGC CTTCGATAGT TTTGTGGCGT TATGCCGCAG TTCGCTCAAC
CCCCAAACCA GCGAGCAGCA AGTCGAGGAG ATGTTGATTC AGCATTTGTT GACTGAACGA
ATTTTTCGCT CGGTCTTCGA TAACCCCGAT TTTGTGCGTC GTAATGCAAT TGCTGCCGAA
TTGGAAAAAG TGATTGCGGC GTTGCCCAAA CGGGCATTTA GCCGCGATAA GTTCTTGGCC
AGCCTCGATT ATTTTTATAA AGCGATAGAG AATTCGGCGC GGACGATCAG CGATTACAGC
GAGAAATCAA CCTTTTTAAA TACGGTCTAT GAGCAGTTTT TTCAGGGCTA TTCAACCGAT
ATCGCCGATA CGCACGGAAT TGTTTATACG CCTGCGCCAA TTGTGCGTTG GATGGTCACG
TCGGTTGAGC AACTATTGCG TGATCAATTC GACTCTAGTT TGAGCGATAA AGGTGTGCAT
GTGCTCGATC CCTGTGTCGG CACGGGCACG TTTATGCTCG AAATTTTGAA TCAATTACAA
AATAGTACAC TTGAGCATAA ATATCGCCAT GAGTTGCATT GTAATGAGTT GTTGTTGTTG
CCCTATTACA TCGCAGCCCA AAATATCGAA CATGAATTTT ATGATCGCAC CCAGAATTAT
GCGCCATTTG AGGGCCTTTG TTTTGCCGAC AATTTAGAGA TGGAAGCCAA TAAGCGCCAA
GCTTCGATGT TTGTGCCGGA AAATGCGCAA CGGGTGCAGC AACAGCAAGA TGCACCAATT
TTTGTGATTA TTGGCAATCC ACCCTATAAC GTCGGACAGC AAAATGAAAA TGATAATAAT
AAAAATCGTA AATATCCGCA TATCGATGCG CGGATTCGTC AAACCTATGC GAAATCGTCG
AAAGCATCGT TGCAAACCAA ACTTTACGAT ATGTATTCGC GCTTTTTTCG CTGGGCCACC
GACCGCCTCG GCGATAACGA TGGGGTGATT GCCTATGTTA GTAATGGCTC GTTTGTTGAG
CAAATTGCCT TCGATGGCAT GCGCAAGGAG TTGCTGAAGG ATTTTACCAG CATCTATGTG
CTTGATTTGG GCGGCAATGT GCGCAAAAAT CCTAAGCTTT CGGGCACAAC CCACAATGTG
TTTGGCATTC AGGTAAGTGT GGCGATTACC TTGTTGATTC GTAATCGTGC CCAATATCCC
CAGCGCCAGC AGGCCGAGCT ACACTACGCC CGCTTGGATG AATGGTGGTG GCGTGGCGAG
AAATATAGTT ATCTCAACCA GCACGCCGAT TATCGGGCGA TTGCGTGGCA ACAGTTGCAG
CCCACCAGCA ACGGCACATG GATCACCGAG GGCATAAGCG ACGATTTCGC CACCTTTGTA
CCAATTGGCA GCAAAGAGTC GCGTTCGGGC AGTGCTGGCG CAGAACCAAC GATTTTCAAT
ACCTATAGTT TGGGCGTTTC AACCAATCGG GATACTTGGG TGTATGATTT CAACCGCGAA
GCGCTGGCCA AACGCATGCA AACCTTCATC ACTACCTACA ATACCGAGGT TGATCGTTGG
CACAATCGCC AAACTGAGGT AGCACTCGAT GATTTTGTGT TGCAAGATGA CACGAAAATT
AAGTGGAGCC GAAATATTAA ACGTGATTTG AAGCGTTCAA AAAAAGTTTC ATTTTACGAA
AATAATGTAT TATTATCATT GTATAGACCA TTTACTCATC GATATATTTA TTTTAGCGAT
GTAATAATTG ATGAAATGAG CAAGATGGGT CTATTCTTCA AAGGAGCAAA CACATCGATA
TGTGTTACTG GTGTTGGTTC AGAAAAACCA TTTTCATTTT TCATAAGTAA TTATATATCT
GATCTTAATT TTTATGGTGG AGGTTCTGCC ACACAATGGT TTCCATTCTA CATTTACGAT
GAGGATGGCA GCAACCGGCG TGAGAATATC AGCGATTGGG CTTTGCAGCA TGTTCAAGCG
CATACTGGCA ACAATAATTT CGATAAATGG GATATTTTCT ACTACATCTA TGGTTTATTG
CATGTGCCAA GCTATCGTGA ACGCTACGCC GCCAACCTCA AACTTGAGCT ACCGCGCATC
CCCTTACTCG CGCCGAGCGT GATCGAACAA TTGAGTGCGG CAGGTCGCCA ATTGGCCGAA
TTGCACCTGA ACTACGAGCA ACAGCGCGAA TATAAGCTCA AGCATAACGA GAATTGCAAT
GTCCCATGGA CGTGGCGGGT CGAGAAAATG CGATTAAGCC GCGACAAAAG TGCGATCATC
TACAACCAAG CCTTGACGCT TGAAGGCATC CCAGTCGAGG TCTACGAGTA TCGGTTGGGC
AACCGCTCGG CGCTCGAATG GGTGATTGAT CAATATCAGG TCAGCACCGA CAGGCGCAGC
GGCATCACCA GCGATCCCAA CGACCTTGAT GATCGCGAGG CGATTGTGCG CTTGCTCAAA
CAAGTGATCA CGGTCAGTCT CAAAACCATA GCGATCATCC AGCAACTGCG GGCAATTAGC
CTCTAG
 
Protein sequence
MPTIAPKVLN TYRVELARVM KVGGLNEGAI RNAFQNLLSE AGRAWGMTLV AEQTLTLGSR 
KALRFDGELR DSLKLRHGIW EAKDPADDLE REISNKLRAG YPTKNTLFEN SRQAVLYQHN
QRVLSIDMHD DHQLVRLLET FFSYAEPQVD DFHLAVARFR REIPDLATSV AEIIANELKH
SRDFKLAFDS FVALCRSSLN PQTSEQQVEE MLIQHLLTER IFRSVFDNPD FVRRNAIAAE
LEKVIAALPK RAFSRDKFLA SLDYFYKAIE NSARTISDYS EKSTFLNTVY EQFFQGYSTD
IADTHGIVYT PAPIVRWMVT SVEQLLRDQF DSSLSDKGVH VLDPCVGTGT FMLEILNQLQ
NSTLEHKYRH ELHCNELLLL PYYIAAQNIE HEFYDRTQNY APFEGLCFAD NLEMEANKRQ
ASMFVPENAQ RVQQQQDAPI FVIIGNPPYN VGQQNENDNN KNRKYPHIDA RIRQTYAKSS
KASLQTKLYD MYSRFFRWAT DRLGDNDGVI AYVSNGSFVE QIAFDGMRKE LLKDFTSIYV
LDLGGNVRKN PKLSGTTHNV FGIQVSVAIT LLIRNRAQYP QRQQAELHYA RLDEWWWRGE
KYSYLNQHAD YRAIAWQQLQ PTSNGTWITE GISDDFATFV PIGSKESRSG SAGAEPTIFN
TYSLGVSTNR DTWVYDFNRE ALAKRMQTFI TTYNTEVDRW HNRQTEVALD DFVLQDDTKI
KWSRNIKRDL KRSKKVSFYE NNVLLSLYRP FTHRYIYFSD VIIDEMSKMG LFFKGANTSI
CVTGVGSEKP FSFFISNYIS DLNFYGGGSA TQWFPFYIYD EDGSNRRENI SDWALQHVQA
HTGNNNFDKW DIFYYIYGLL HVPSYRERYA ANLKLELPRI PLLAPSVIEQ LSAAGRQLAE
LHLNYEQQRE YKLKHNENCN VPWTWRVEKM RLSRDKSAII YNQALTLEGI PVEVYEYRLG
NRSALEWVID QYQVSTDRRS GITSDPNDLD DREAIVRLLK QVITVSLKTI AIIQQLRAIS
L