Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2114 |
Symbol | |
ID | 5734002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2654695 |
End bp | 2657313 |
Gene Length | 2619 bp |
Protein Length | 872 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279255 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001544882 |
Protein GI | 159898635 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.566849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAAA TCGCAGTTAA TTCACCAGCG GCTTTTGGCA AGCAATTGCG CTTGTTGCGC CGCCGTGCTC GCCTAACCCA AGCCGAATTA GGGATTGCCG TAGGCTATAG CGATGCGCAA ATTTGTCGGC TTGAAACAGG CCGCCGTCCC CCCGATTTAA CCACCTTAAT TGCGTTATTT TTGCCAGCGC TCGATATCGA GGCGCAATCA CACGAGGCTC AGCAGCTTTT AGAATTAGCC GCAAGCGCCC GCGAGGAATT AATCGCTGAG CCAAGCCAGA ACAATCATGG TGCACTCAAT CAAGCACTGC CAACCTTGGC CAGCCACCTG CCTAGCCCAA CCAGCTACTT TTTAGGCCGT GGCGCTGAAC AACAACGCAT CTTGCAATGG CTGAATAATC CTACAATTCG CTTAATCAGC ATTGTGGGCT TGGCGGGCAT TGGCAAAACC CAGCTTGGCT TGCAATGTTT GCATCAATTT GCGAGCCAAA GCGAACAGCA GTGTGTTTTT GTTGATCTGG TGACCGCCAA TGACCCTGAA TCGATGGTTC AGGCGATCAA CAAGGCGCTC GAAATCAGCG AAAGCCCCGA TGAGCATCCC TTGAGTTTGG CAATTAGTCA GCTTGAGCAG CAACCAAGCT GTTTACTGTT GGATAATTGT GAACAGATTC AGGATGCTAG CCGGGTGATC AGCCTATTGC TCAGCGAAGT ACCAACCCTC AAATTAATTA TTACCAGCCA AGTAGCCTTG CGTCTCAGTG CCGAACATGT ACTACAACTC ACGCCCTTGG CCGTGCCCAA TTTGTTGGCA TTGCCGCCCT TAGCTGAATT AGCCCAAATC GAAGCTATGG CTTTGTTATT GGCCCGTTTG CAAGTGCATA ACCCCAAACT TGAATTGACC GCGAAAAATG CGTTGGCGCT TGCTGCCTTG TGTGTACGCG TCGATGGTGT ACCGTTGGCG CTAGAGTTAG TTGCTGCTTC AGGCCGTTTG TTCGACCCCG AAGCCTTGTT GAGCGAATTG GCCAGCCATT TTTTGAGCAT GCGACGGCGG GGCCGCGATT TACCGTCGCG CCACTACTCA GTCACAACCG CACTGACATG GAGTTATCAA CAGCTTGATT CAGCTAGCCA ACGTTTGTTT GAGCGGTTGA GCGTATTCGT TAGCGGTTGG ACGGTCGAGG CGGCCTTGGC GGTTTGTGGC CCAGAATACC AACGCCATGA GCTGATTGAA CAATTAAATG TGCTGCTTGA TCATAGCCTG ATTCAACAAC AAACCAACGA TGATTCGACG CGAATGAGTA TGTTGACGAT GGTACGGACG TTTGCTCAAG AGCAAGCCAA CAAGCATGCT GAACATGATT TGCTCAAGAG CCGCATGCTC GATTATTTGA TTGAACTAGC CCAGCAAGCC GAGCAACCAC TGCGTTCAGG CAATAACCAA GCTATGTGGA TTCAGCGGTT AGAGGCTGAG CACGATAATA TTCGGGCTGG CTTAAATTGG GCTTGGCAAC ACAATGCCCA TCAACGCGGC ATTCAGTTGG TTGGCTATGT ATGGCGTTTT TGGTATATGC GCGGCTATTT ACGTGAAGGT CGGCGTTGGT TTGAAAACCT ACTGATCAGC CATGAACCAA CTGCTGACGT TGATTTTGCC CGAGCACTCG ATGGGGTCGG CATTTTGGCT TGGAGACAAA GCGATTATCA GCAAGCTGAA CAATGGTATC AGCAAGCCCT TGCCATCTAC CAAACCACCC AACACACCGC TGGCCAGGCG CAAGTTTTGG GTCATCTGGG CTTAGTGGCG ATGGATACCG GAGCCTATGC CCAAGCAGCG GCCTACTACG AACAAAGTTT ACCGCTCTAT CAAGCCGTTG AGGATCAATC CGGAGTGGTT GCCACATTGC ACAATCTCGG CAATCTCTAT TGCCAACAAT CAGAAAATCA ACGGGCAAGC CAACTCTATC AAGAATGCTT ACAGATCTAT CAGGAGATGG GTGATCAATC GGGAGTGGCA TTAATTGCCT TGGGTTTGGG GGTGATCGCC CGTGATGAAC AACGTTTAGA TGCAGCGCAA GCCTCGTTTG AACAAAGCCT CAGCTTGGCG CGTGAGTTGG GCGATGATTG GAATGAAGCG ACGGCCTTAA TCAATCTTGG TAACATCGCG ATTGATACCA GCCAGCCCAA ACTTGGCTTG GAGCATTATC AAACCGCCAA ACAGATTTTC GAGCGTTTAG GCGATCAGCA ATCATTGTGT CTGATCGAAA ACCGAATTTC TAATGCTCAC TGGCTGCTGG GCGACTATGC CCAAGCCCAA GCTGGCTACC GTCAATGCCT CATGTTAGCC CATGCAATCG GCTTTGATGG CGGAATTATT GAAGGTTTAG AAGGACTAGC CCATTGCTTG AGCCAAACAT TGCCAACGAC TGCCGCCCAA CTCATGGCCT ACGCCGCCCA GATGCGTAGC ACCAAAGGCT ACCCAATTAT TCCTGCCGAT GAAGCTGGTT ACAACCAAAT TGGGCAAGAA ATTCAGGCCC ATTTAAGCAC CACCGCATGG CAACAGGCCT ACCAGCAAGG CCAACAGCTG AGTTTGCAAC GAGCAGTAAG TTTGGCGCTG GCCAATTGA
|
Protein sequence | MTQIAVNSPA AFGKQLRLLR RRARLTQAEL GIAVGYSDAQ ICRLETGRRP PDLTTLIALF LPALDIEAQS HEAQQLLELA ASAREELIAE PSQNNHGALN QALPTLASHL PSPTSYFLGR GAEQQRILQW LNNPTIRLIS IVGLAGIGKT QLGLQCLHQF ASQSEQQCVF VDLVTANDPE SMVQAINKAL EISESPDEHP LSLAISQLEQ QPSCLLLDNC EQIQDASRVI SLLLSEVPTL KLIITSQVAL RLSAEHVLQL TPLAVPNLLA LPPLAELAQI EAMALLLARL QVHNPKLELT AKNALALAAL CVRVDGVPLA LELVAASGRL FDPEALLSEL ASHFLSMRRR GRDLPSRHYS VTTALTWSYQ QLDSASQRLF ERLSVFVSGW TVEAALAVCG PEYQRHELIE QLNVLLDHSL IQQQTNDDST RMSMLTMVRT FAQEQANKHA EHDLLKSRML DYLIELAQQA EQPLRSGNNQ AMWIQRLEAE HDNIRAGLNW AWQHNAHQRG IQLVGYVWRF WYMRGYLREG RRWFENLLIS HEPTADVDFA RALDGVGILA WRQSDYQQAE QWYQQALAIY QTTQHTAGQA QVLGHLGLVA MDTGAYAQAA AYYEQSLPLY QAVEDQSGVV ATLHNLGNLY CQQSENQRAS QLYQECLQIY QEMGDQSGVA LIALGLGVIA RDEQRLDAAQ ASFEQSLSLA RELGDDWNEA TALINLGNIA IDTSQPKLGL EHYQTAKQIF ERLGDQQSLC LIENRISNAH WLLGDYAQAQ AGYRQCLMLA HAIGFDGGII EGLEGLAHCL SQTLPTTAAQ LMAYAAQMRS TKGYPIIPAD EAGYNQIGQE IQAHLSTTAW QQAYQQGQQL SLQRAVSLAL AN
|
| |