Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2194 |
Symbol | |
ID | 5734081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2781851 |
End bp | 2784916 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641279335 |
Product | superfamily II DNA/RNA helicase |
Protein accession | YP_001544962 |
Protein GI | 159898715 |
COG category | [R] General function prediction only |
COG ID | [COG4889] Predicted helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAACGA TTGCCCCCAA GGTGCTGAAC ACCTACCGTG TTGAGCTAGC GAGGGTGATG AAAGTCGGCG GTCTCAATGA AGGTGCGATT CGTAATGCCT TTCAAAATTT GTTGAGCGAG GCTGGCCGCG CTTGGGGCAT GACCCTCGTC GCCGAGCAAA CACTCACGCT AGGTTCGCGC AAGGCGCTGC GCTTCGATGG CGAGTTGCGC GATAGCCTCA AATTGCGCCA TGGAATTTGG GAAGCCAAAG ATCCTGCCGA TGATTTGGAA CGCGAAATTA GCAATAAGCT CCGCGCAGGC TACCCCACCA AAAATACGCT GTTTGAAAAT AGCCGCCAAG CAGTGTTATA CCAACATAAT CAGCGGGTGT TGAGCATCGA TATGCACGAT GATCACCAGT TGGTACGCTT GCTTGAAACA TTTTTTAGCT ATGCCGAGCC ACAGGTCGAT GATTTTCACT TAGCTGTCGC TCGCTTTCGT CGCGAAATTC CCGATTTGGC CACCAGCGTC GCTGAAATTA TTGCCAATGA GTTAAAACAT AGCCGCGATT TTAAATTGGC CTTCGATAGT TTTGTGGCGT TATGCCGCAG TTCGCTCAAC CCCCAAACCA GCGAGCAGCA AGTCGAGGAG ATGTTGATTC AGCATTTGTT GACTGAACGA ATTTTTCGCT CGGTCTTCGA TAACCCCGAT TTTGTGCGTC GTAATGCAAT TGCTGCCGAA TTGGAAAAAG TGATTGCGGC GTTGCCCAAA CGGGCATTTA GCCGCGATAA GTTCTTGGCC AGCCTCGATT ATTTTTATAA AGCGATAGAG AATTCGGCGC GGACGATCAG CGATTACAGC GAGAAATCAA CCTTTTTAAA TACGGTCTAT GAGCAGTTTT TTCAGGGCTA TTCAACCGAT ATCGCCGATA CGCACGGAAT TGTTTATACG CCTGCGCCAA TTGTGCGTTG GATGGTCACG TCGGTTGAGC AACTATTGCG TGATCAATTC GACTCTAGTT TGAGCGATAA AGGTGTGCAT GTGCTCGATC CCTGTGTCGG CACGGGCACG TTTATGCTCG AAATTTTGAA TCAATTACAA AATAGTACAC TTGAGCATAA ATATCGCCAT GAGTTGCATT GTAATGAGTT GTTGTTGTTG CCCTATTACA TCGCAGCCCA AAATATCGAA CATGAATTTT ATGATCGCAC CCAGAATTAT GCGCCATTTG AGGGCCTTTG TTTTGCCGAC AATTTAGAGA TGGAAGCCAA TAAGCGCCAA GCTTCGATGT TTGTGCCGGA AAATGCGCAA CGGGTGCAGC AACAGCAAGA TGCACCAATT TTTGTGATTA TTGGCAATCC ACCCTATAAC GTCGGACAGC AAAATGAAAA TGATAATAAT AAAAATCGTA AATATCCGCA TATCGATGCG CGGATTCGTC AAACCTATGC GAAATCGTCG AAAGCATCGT TGCAAACCAA ACTTTACGAT ATGTATTCGC GCTTTTTTCG CTGGGCCACC GACCGCCTCG GCGATAACGA TGGGGTGATT GCCTATGTTA GTAATGGCTC GTTTGTTGAG CAAATTGCCT TCGATGGCAT GCGCAAGGAG TTGCTGAAGG ATTTTACCAG CATCTATGTG CTTGATTTGG GCGGCAATGT GCGCAAAAAT CCTAAGCTTT CGGGCACAAC CCACAATGTG TTTGGCATTC AGGTAAGTGT GGCGATTACC TTGTTGATTC GTAATCGTGC CCAATATCCC CAGCGCCAGC AGGCCGAGCT ACACTACGCC CGCTTGGATG AATGGTGGTG GCGTGGCGAG AAATATAGTT ATCTCAACCA GCACGCCGAT TATCGGGCGA TTGCGTGGCA ACAGTTGCAG CCCACCAGCA ACGGCACATG GATCACCGAG GGCATAAGCG ACGATTTCGC CACCTTTGTA CCAATTGGCA GCAAAGAGTC GCGTTCGGGC AGTGCTGGCG CAGAACCAAC GATTTTCAAT ACCTATAGTT TGGGCGTTTC AACCAATCGG GATACTTGGG TGTATGATTT CAACCGCGAA GCGCTGGCCA AACGCATGCA AACCTTCATC ACTACCTACA ATACCGAGGT TGATCGTTGG CACAATCGCC AAACTGAGGT AGCACTCGAT GATTTTGTGT TGCAAGATGA CACGAAAATT AAGTGGAGCC GAAATATTAA ACGTGATTTG AAGCGTTCAA AAAAAGTTTC ATTTTACGAA AATAATGTAT TATTATCATT GTATAGACCA TTTACTCATC GATATATTTA TTTTAGCGAT GTAATAATTG ATGAAATGAG CAAGATGGGT CTATTCTTCA AAGGAGCAAA CACATCGATA TGTGTTACTG GTGTTGGTTC AGAAAAACCA TTTTCATTTT TCATAAGTAA TTATATATCT GATCTTAATT TTTATGGTGG AGGTTCTGCC ACACAATGGT TTCCATTCTA CATTTACGAT GAGGATGGCA GCAACCGGCG TGAGAATATC AGCGATTGGG CTTTGCAGCA TGTTCAAGCG CATACTGGCA ACAATAATTT CGATAAATGG GATATTTTCT ACTACATCTA TGGTTTATTG CATGTGCCAA GCTATCGTGA ACGCTACGCC GCCAACCTCA AACTTGAGCT ACCGCGCATC CCCTTACTCG CGCCGAGCGT GATCGAACAA TTGAGTGCGG CAGGTCGCCA ATTGGCCGAA TTGCACCTGA ACTACGAGCA ACAGCGCGAA TATAAGCTCA AGCATAACGA GAATTGCAAT GTCCCATGGA CGTGGCGGGT CGAGAAAATG CGATTAAGCC GCGACAAAAG TGCGATCATC TACAACCAAG CCTTGACGCT TGAAGGCATC CCAGTCGAGG TCTACGAGTA TCGGTTGGGC AACCGCTCGG CGCTCGAATG GGTGATTGAT CAATATCAGG TCAGCACCGA CAGGCGCAGC GGCATCACCA GCGATCCCAA CGACCTTGAT GATCGCGAGG CGATTGTGCG CTTGCTCAAA CAAGTGATCA CGGTCAGTCT CAAAACCATA GCGATCATCC AGCAACTGCG GGCAATTAGC CTCTAG
|
Protein sequence | MPTIAPKVLN TYRVELARVM KVGGLNEGAI RNAFQNLLSE AGRAWGMTLV AEQTLTLGSR KALRFDGELR DSLKLRHGIW EAKDPADDLE REISNKLRAG YPTKNTLFEN SRQAVLYQHN QRVLSIDMHD DHQLVRLLET FFSYAEPQVD DFHLAVARFR REIPDLATSV AEIIANELKH SRDFKLAFDS FVALCRSSLN PQTSEQQVEE MLIQHLLTER IFRSVFDNPD FVRRNAIAAE LEKVIAALPK RAFSRDKFLA SLDYFYKAIE NSARTISDYS EKSTFLNTVY EQFFQGYSTD IADTHGIVYT PAPIVRWMVT SVEQLLRDQF DSSLSDKGVH VLDPCVGTGT FMLEILNQLQ NSTLEHKYRH ELHCNELLLL PYYIAAQNIE HEFYDRTQNY APFEGLCFAD NLEMEANKRQ ASMFVPENAQ RVQQQQDAPI FVIIGNPPYN VGQQNENDNN KNRKYPHIDA RIRQTYAKSS KASLQTKLYD MYSRFFRWAT DRLGDNDGVI AYVSNGSFVE QIAFDGMRKE LLKDFTSIYV LDLGGNVRKN PKLSGTTHNV FGIQVSVAIT LLIRNRAQYP QRQQAELHYA RLDEWWWRGE KYSYLNQHAD YRAIAWQQLQ PTSNGTWITE GISDDFATFV PIGSKESRSG SAGAEPTIFN TYSLGVSTNR DTWVYDFNRE ALAKRMQTFI TTYNTEVDRW HNRQTEVALD DFVLQDDTKI KWSRNIKRDL KRSKKVSFYE NNVLLSLYRP FTHRYIYFSD VIIDEMSKMG LFFKGANTSI CVTGVGSEKP FSFFISNYIS DLNFYGGGSA TQWFPFYIYD EDGSNRRENI SDWALQHVQA HTGNNNFDKW DIFYYIYGLL HVPSYRERYA ANLKLELPRI PLLAPSVIEQ LSAAGRQLAE LHLNYEQQRE YKLKHNENCN VPWTWRVEKM RLSRDKSAII YNQALTLEGI PVEVYEYRLG NRSALEWVID QYQVSTDRRS GITSDPNDLD DREAIVRLLK QVITVSLKTI AIIQQLRAIS L
|
| |