Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1442 |
Symbol | |
ID | 5733306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1674028 |
End bp | 1676694 |
Gene Length | 2667 bp |
Protein Length | 888 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641278580 |
Product | transcriptional regulator |
Protein accession | YP_001544214 |
Protein GI | 159897967 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.31533 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAATG CAGGGAAGGT GCATATGACT GTCGATTCAA TCATTACGAT TGACCAACTT GAGATTAATT TGACTGAGCA GCGAGTCAAG CGGGCGGGAG TTTATGTGCG GCTTGGGCGC AAAGAGTGGG GTTTGTTAGA ACTGTTAGTG CGTAATCGTA ATCACGTGCT CAGCCATCGC CAGTTGTTGC AGCATGTGTG GGGCGATAAT TATGAACGTG ATAGTAGCTA TTTATTGCAT GTAGCGATTG GCCGTTTGCG CGAGAAATTG GGCGATAAAC CGCCGCGCTA TATTGTGGCC GAGGCCGATT TGGGCTATCG TTTTGCCCTG TCAACGCCAA ATTTGCCAAG TGTGCCAACG CCTGAGCCCA GCTCCTTGGT TGTACCATCA ATCAGCTATA CCAATATTCC TGCGCCATTG AATGCCTTTA TTGGGCGCGA AAACGAGCAA CAACGCTTGT TGCAGCTTCT CCGTCAGCCG CATATTCGCT TGATTAGCGT TTTGGGGGTT GGTGGGGTTG GCAAAACCCG TTTAGTGCTG CAAAGTGCCA TGCAATGGCT GCCGATCTTT CATCATGGCG TGCATGTTGT GCATTTAGCT GCTTTGCGCG ATCCTGAGCT ACTAATCCAG ACAATTATTC AATCATTGGC GATTAAAACG ACTAGTCAAT TACCATTGCT CAAACAGCTC AAAGATTTTC TGTATGATAA ACAACTCTTG CTCATTCTCG ATAATTTTGA GCAATTGCTT GATGCTGCGC CGATTGTGAC CGATCTCTTA GCCCATGCAC CGCAATTAAA GTTAATGACG ACTAGCCGCG AGGCTTTGAA TGTGTATGCT GAGCAACAAT TTGAATTAAT GCCCTTTAGC GTTGATTGTT CGCAACAGTT GATGCTGCGC CAACAACCAG CAGTTAATCT TTTTCTCAGC CGTGCTCAAG CGCTACAAGC GACTATTGCT TATAACGATG CTGAATTAGC AACGATTGCT CAAATTTGCC AACGGCTTGA TGGTTTAGCC TTGGCAATTG AGCTGGCCGC CAGCCGAATC AGCTTGTTTT CATTAACAAA TTTACTTGAG CGTTTAAGCC AACGCTTGAG TTTTATCAAT AGTGGACCAC GCGATCTACC AGCTCGCCAC CAAACCTTGC AAGCAGTCAT CGAATGGAGC TATGTTTTAC TTACACCTCA AGAACAAGCG CTATTTGTCC AATTATCAGT GTTTGTCGGC AGCTTTGATC TACCAGCAGC TAGCGCAATT TGTGCGAGGG CTGAACAATC AGCAGTTGAA TTAGTGTTAT TAAGCTTGGT GCAAAAACAC CTGCTTCAGT ATCAGATAAC CGAACAACGT TTTTATTTCT TGGAAACAAT CCGTGAATAT GCTCAAGATC AATTAAATCA ACGAACTGAT AGTCGATGCT ACTATACAGC GCATACTGAT TATTATCGAA CGTTTGTTCA AAATTATGTG CATGAATTAA CTGGCTCAGA GCAACAACAC TGGATGAGTC AATTTCAAGC GAATTATCCC AATATTCGGG CTGCGCTAGC TCAAAGCCTC AAATCTGGGG CGTTTGCGAG TGCTGCTCAT TTGGGTGCTG TGCTCTGGAA TTTTTGGCAT CGCGCTGATC TCGCCCAAGA AGGTAGTTAT TGGATTAAAA AAATTCTCGA TCAGCCAGCC GAATTTGATC CCAAGCATCA GATTATGATG CTACGTGGAT TAAGTACTTT CATGCGTAAT CAGGGAGATT TTCAATTAGC CCAGCAGTAT CTTGCTCAAG GTTTAAGCAT TGCACGCCAA GAACAACTTG ATGAATTGGC GATGGGTTTA TTGAATGGAA TCGGTTTGCT CTATAAACGT GAGGGTTTAT TTGAACAAGC AGGCCAAGCA TTCAGTGAAG CGATTCAACT TGCCCGTACA TTCGATAAAA CCCGCGATAT GGCGGCGATT CTCAGCAATC TTGGCGAAAT TCGGCGTAGC CTCGAACAAT ATCAGCAAGC ACAGCTCGAT TTACAAGAAT CATTGGGATT AGCACGCAAA GTCAATGATC ATCATATGGC TGCAAATATT TTGAATAGTC TAGGCTTATT GGCCTATGAT ACCTGCAATT ATCAACAAGC ACAGCAATAC TATCGACAAG GCCTAGAAAT TCATCAACAA TTACAAAATA AACGTGGAAT TGCCCTAATT TACACGAATT TAGCCGATAG CTTAACCAAA TTACTGCAAT TTAGCCAAGC AAATGAATAT TATGATCAAG CATTAATCCA TAGTCATACC ATTGCTGATC CCTATTTTAT CTGTCATATA ACGATTCAAA AAGCCTGGGC TTTGCGCCAA CAAGCCCAAA TTGCCCAAGC TCAACCATTA ATTATTCAAG CATTACACTT GGCCAATCAA CACAAATTTA TCGCTCATAA CCTACGAGCA ACATTGGAGT TGACCGAATG TTATTTAGCC CAACAACAAT TGCCATTAGC AGCATGGTTG GCAGGGTTTA TGTACCAATT GGCCCAAGAC CATCAACAGT TGCTCGATGG CAATGATCGG CGCTATCACC AAACGCGTTT GGCCGAATTG CAAGCAATAC TTGCTGAACA TAATGAATAT TGGCAGCAAG GCTATCAAAG CTCGTTTGAA CAGCTGTTGG CGGCAATTGT GGTTTAA
|
Protein sequence | MANAGKVHMT VDSIITIDQL EINLTEQRVK RAGVYVRLGR KEWGLLELLV RNRNHVLSHR QLLQHVWGDN YERDSSYLLH VAIGRLREKL GDKPPRYIVA EADLGYRFAL STPNLPSVPT PEPSSLVVPS ISYTNIPAPL NAFIGRENEQ QRLLQLLRQP HIRLISVLGV GGVGKTRLVL QSAMQWLPIF HHGVHVVHLA ALRDPELLIQ TIIQSLAIKT TSQLPLLKQL KDFLYDKQLL LILDNFEQLL DAAPIVTDLL AHAPQLKLMT TSREALNVYA EQQFELMPFS VDCSQQLMLR QQPAVNLFLS RAQALQATIA YNDAELATIA QICQRLDGLA LAIELAASRI SLFSLTNLLE RLSQRLSFIN SGPRDLPARH QTLQAVIEWS YVLLTPQEQA LFVQLSVFVG SFDLPAASAI CARAEQSAVE LVLLSLVQKH LLQYQITEQR FYFLETIREY AQDQLNQRTD SRCYYTAHTD YYRTFVQNYV HELTGSEQQH WMSQFQANYP NIRAALAQSL KSGAFASAAH LGAVLWNFWH RADLAQEGSY WIKKILDQPA EFDPKHQIMM LRGLSTFMRN QGDFQLAQQY LAQGLSIARQ EQLDELAMGL LNGIGLLYKR EGLFEQAGQA FSEAIQLART FDKTRDMAAI LSNLGEIRRS LEQYQQAQLD LQESLGLARK VNDHHMAANI LNSLGLLAYD TCNYQQAQQY YRQGLEIHQQ LQNKRGIALI YTNLADSLTK LLQFSQANEY YDQALIHSHT IADPYFICHI TIQKAWALRQ QAQIAQAQPL IIQALHLANQ HKFIAHNLRA TLELTECYLA QQQLPLAAWL AGFMYQLAQD HQQLLDGNDR RYHQTRLAEL QAILAEHNEY WQQGYQSSFE QLLAAIVV
|
| |