Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4372 |
Symbol | |
ID | 5736929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5583474 |
End bp | 5587040 |
Gene Length | 3567 bp |
Protein Length | 1188 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281534 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001547132 |
Protein GI | 159900885 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit |
TIGRFAM ID | [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00730956 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACAA ATTCCAATCA GAGTACATAC GATGCCGCCC AGATTCAAAT GTTGCGCGGC TTGGAAGCAG TACGCGAAAA CATGGGGATG TATCTCGGTG GCCAAGACAC GTCAGCATTA CATCACTTGG TCTATGAAGT TGTCGATAAC TCGGTTGACG AGGCCTTGGC TGGCTTCTGC GATACGATCA TCGTCGAGAT GCGGACTGAT GGGTCAATCG CTGTCGTCGA TAATGGGCGC GGGATTCCCA CCGATATTCA CCCAGTCGAG GGGCGTTCGG CCTTGGAAAT TGTGCTGACC GAGCTGCACG CTGGTGGTAA GTTCAAAGGC TCACAGGGCT ACAAGGTTTC TGGTGGTTTG CACGGGGTCG GGGTTTCGGC AGTTAACGCA GTTTCCGAAT TTTTGCGGGC TGAAGTTAAA CGCGATGGCA AGCTGTGGGC GCAAGATTTT CGCCTTGGCA TGCCTCAGGC TCCAGTCAAA GCGGTTGGCG ATGCTGAAGG CACGGGCACA ACGATTATCT TCAAGCCCGA TGCCCAATTA TTTACCACGG TTGATTTCAA CTATCGCACC TTGGCTAACC GTTTACGCGA TATGGCCTAC CTCAACAAGA GCCTGCGCTT CAAGCTCGTC GATTATAACA ACGACCGCGA AGTAACCTAT TATTTTGATG GTGGGATTGT CTCGTTTGTG CGCCATCTGA CGCGCGAAAA AGGCCCAGTG CTGGCTCAGC CGTTCTACGT CGAAAAACCT TATGAAAACG TCAATGTTGA AATTGCAATG CAATACACTG GCGATTTCAA CGAAAATCTT TTAGCCTTCA CCAATAACAT TGCCAACCCC GATGGTGGTA CGCACGTCAC TGGCTTCCGC GCGGCCTTGA CCCGCACGAT CAATGCCTAT GGTCGTAACA AAGGCTTGCT CAAAGAAGGC GATGCACTTT CGGGCGAAGA TGTGCGCGAG GGCTTGACCG CGATCATCAG CATCAAGCTG TTCCGCCCAC AATTCGAAAG CCAAACCAAA TCGAAGCTGG CAACGCCTGA AGCTAAAACC GCCGTCGAAA CCGTGCTCAA CGAAGCACTT TCAGCCTTCT TGGATGAAAA TCCTAACGAG GCCCGCCGGA TTATCGAAAA ATCGCTGTTG GCTTCACGCG CCCGCGATGC CGCCCGTAAA GCCCGCGATT TGGTGCAACG CAAAGGTGCA CTCGAAGGCT TTGCGCTGCC AGGCAAGCTC GCCGACTGCT CAGATAAAGA GCCTGCCCAC TGCGAAATCT TCATCGTCGA AGGCGATAGT GCCGGGGGAA GCGCAAAACA AGGCCGTGAT CGTCGTTTCC AAGCGATTTT GCCGCTGCGC GGTAAAATTC TGAACGTAGA AAAATCACGT TTGGATAAAA TGCTGGCAAA TAACGAAGTT CGAGCATTAA TCACCGCACT TGGCACAGGT ATTGGCGAAA CATTTGATAT TTCGCGTTTG CGCTACCACC GCATTTTGAT TATGAGCGTT GCTGGCGATG AGCCAACCTT GATTCGCAAT GCGCAAGGTC ATACCGAGTT TGTGCGCATC GGCGAGTTTA TCGATCAATG TATTGCAGGC CAACGTAGTG CTAGTGAATA CGAAGTGATC AGCTTCGATC AAAAGCGGCA TGTTGCGCGT TTCCGCCCAC TCAAAGCCGT GATGCGCCAC GCCAACCATG AGCCAATGTA CAAACTGACC ACACGCTATG GCCGCTCGGT CAAAGTAACT GCCTCGCATA GCGTCTTTGT GCTCGAAAAT GGTCAGCCTG TGCTGAAAAA GGGCGATCAA ATTCGACTTG GCGATCAGTT GGTTGCCAGC CGTCGGATTC CACGCCCAGC CAGCCAACCT CGCGAAATCG ACCTTATGAA GTTGTTTGCT GAAGCAGGCT TGATTGATAA CCTGTATTTG CGCGGCGAAA GCGTGCGCAC GATCGCAGCC CAACGGGTGC TCAACCACGT TTCTCAGCCT GAACAATGGA ACGAAGCACG GATCAACTTG CATACTGCTG CATGGGAACG CTTGGTCGAA TATCGCCAAG CCAATGGCCT CAGCCAAAAA GCAGTTGCCC AACGCCTTGG TGTGAAGCAA GCGATTACGA TTAGCCAATG GGAACGCGGC ATGCTGCGGC CTATTCAATC ACAATTTGAC AACTATCTGG CAACGATTGG CTGGGAAGAA CCAATCAGTT ACGAACTTGT GCCATCGAAG ATCGAGCGCT TGTTGTTACA AGATGATAGC AGCGCCAACG CTCGCTGGCG CGAAGTGAGC AACTACAAAG CCTTTGATAG TTTCAGCAAC GATGAGCTAG ATTTGCTTGA TCGTGACGTT GAGCTTGTAC CGCAAGCCCA CGACGATCGA GCTTTCCCAC GCATGTTGAG CATCACCCCA GAATTGCTGT GGTTCTTGGG CTGGTTTACT GCTGAAGGCT CCTTGAGCAA GCATCAAGTC AGTTTGTCGT TAGGTCAAAA AGATGCAGCC TTCTTCGACG AGCTAAAAAC CACGATCGAG CAACTCTTTG GCGAAACGCC ACGCTTCTAT CAATCGCCTG ATAATGGCGG GATCAAATGC TATTTCCATA GCGTGTTGGC GGCGCGACTG ATTCGAGCCT TAGGCTTGGG CGCGGTTGCT CATCAAAAAC GTGTGCCAAA CATGCTGTTC AGCCTGAGCA ACGATTTGCA ACGCAGCTAT CTCGAAGGCT ATTTCCTTGG CGATGGCACA CTCAGCGATA GCACAATCAG CATGACCACC AACTCGACTG AACTCAAAGA TGGCTTACTC TATTTGCTTG GTCAGCTTGG TGTTTTTGCT GGCGTGAGCA AGATCAAGCC CAACTTACCA GCCGATGCAC CAATTCAAAC CGTGCACGAC TACTACAATA TTGCAATTAG CGGCAAGCAA CAGCTAGAGC AATTGAGTGG GGTTTGGCAG CGCCATCATT TGGCTGCCAA AGTCGAGGCA CATTTGGCCA AACCAGCGAC CAAAGCTCAA GCATTCACGC CATTAAGCGA CGATATGGTT GGCTTAGAAG TGTTGGCAGT TGAAGAATTG GCTCCAACTG GCGAGTTCGT CTACGACTTC TCGGTCGAAG AAGACGAAAA CTTCCTGTGT GGCACTGGCG GTTTATACGC TCACAATACC GACGCAGACG TTGACGGCAG CCACATCCGC ACCTTGTTGC TGACCTTCTT CTTCCGCCAT ATGCGCGATT TGATCACCAA TGGGCACTTG TATGTAGCTC AGCCGCCATT GTTCCGCGTA CAACATGGCA AGGCCTACAA ATATGTCTAC GATGAAGCCA CCCGCGATGA GTACATTCGC TCATTGCCAG CTGGCACCAA AGTCACCGTT CAGCGCTTCA AAGGGCTAGG CGAAATGAAT CCCGACCAAC TGTGGGACAC CACGCTCAAC CCGGGCAATC GCATGATTTT ACAAGTGACA GTTGAAGATG CAATGGAAGC CGATGAAACC TTCTCGATGT TGATGGGTGA AATCGTGTTA CCGCGCAAGC GCTTTATCCA AACTCACGCC GCCGACGTGA AGAACTTGGA TGTGTAG
|
Protein sequence | MATNSNQSTY DAAQIQMLRG LEAVRENMGM YLGGQDTSAL HHLVYEVVDN SVDEALAGFC DTIIVEMRTD GSIAVVDNGR GIPTDIHPVE GRSALEIVLT ELHAGGKFKG SQGYKVSGGL HGVGVSAVNA VSEFLRAEVK RDGKLWAQDF RLGMPQAPVK AVGDAEGTGT TIIFKPDAQL FTTVDFNYRT LANRLRDMAY LNKSLRFKLV DYNNDREVTY YFDGGIVSFV RHLTREKGPV LAQPFYVEKP YENVNVEIAM QYTGDFNENL LAFTNNIANP DGGTHVTGFR AALTRTINAY GRNKGLLKEG DALSGEDVRE GLTAIISIKL FRPQFESQTK SKLATPEAKT AVETVLNEAL SAFLDENPNE ARRIIEKSLL ASRARDAARK ARDLVQRKGA LEGFALPGKL ADCSDKEPAH CEIFIVEGDS AGGSAKQGRD RRFQAILPLR GKILNVEKSR LDKMLANNEV RALITALGTG IGETFDISRL RYHRILIMSV AGDEPTLIRN AQGHTEFVRI GEFIDQCIAG QRSASEYEVI SFDQKRHVAR FRPLKAVMRH ANHEPMYKLT TRYGRSVKVT ASHSVFVLEN GQPVLKKGDQ IRLGDQLVAS RRIPRPASQP REIDLMKLFA EAGLIDNLYL RGESVRTIAA QRVLNHVSQP EQWNEARINL HTAAWERLVE YRQANGLSQK AVAQRLGVKQ AITISQWERG MLRPIQSQFD NYLATIGWEE PISYELVPSK IERLLLQDDS SANARWREVS NYKAFDSFSN DELDLLDRDV ELVPQAHDDR AFPRMLSITP ELLWFLGWFT AEGSLSKHQV SLSLGQKDAA FFDELKTTIE QLFGETPRFY QSPDNGGIKC YFHSVLAARL IRALGLGAVA HQKRVPNMLF SLSNDLQRSY LEGYFLGDGT LSDSTISMTT NSTELKDGLL YLLGQLGVFA GVSKIKPNLP ADAPIQTVHD YYNIAISGKQ QLEQLSGVWQ RHHLAAKVEA HLAKPATKAQ AFTPLSDDMV GLEVLAVEEL APTGEFVYDF SVEEDENFLC GTGGLYAHNT DADVDGSHIR TLLLTFFFRH MRDLITNGHL YVAQPPLFRV QHGKAYKYVY DEATRDEYIR SLPAGTKVTV QRFKGLGEMN PDQLWDTTLN PGNRMILQVT VEDAMEADET FSMLMGEIVL PRKRFIQTHA ADVKNLDV
|
| |