Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2207 |
Symbol | |
ID | 5734094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2804459 |
End bp | 2807164 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279348 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001544975 |
Protein GI | 159898728 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGATC GGCCAATCTT CCCTGAATGG ATCAAACAGG CTCGTAAGCA ATTGGGCCTT GGCAGAGCCG AGTTAGCCCA CCAAGTTGGC TGTTCGTTGG CTATGTTACG TCAGCTTGAG TATGGTACGC GCCGCCCTTC GCGCCAATTG GCCGAACGGC TCGCCAGCTT TTTGCAGATT CCAGCCGAAC AGCAATCACT CTTTGTCCAA ACTGCGCGGG TGCATCAACC ACCAAGCCCA ATCAACCCCG TTCCACTCGT GAAACGCCTG CCTGAGCCAA CCCAACCCTT AATTGGTCGC GAGGAATTAT TGGCCCATGC CAGTCAGCTA TTGCTCCAGT CAAGCACCCG TTTGCTGAGC ATTGTCGGCC CAGGTGGTGT TGGTAAAACC CATTTTGCAA GCCAGCTTGC CCAGCAGATT CAAGCACATT TCAGTGATGG GAGCTTTTTT ATTGGCTTAG CTTCGTTGCA CGATGCCGAA CAGCTACCAA CTCTGATCGC CCAAACACTT GAAATTCCCC AACCAACTCA TCAATCAACC TTAGAGCAAT TGATCGGCGC AATTGACCAA CGATCGATCT TGCTGATGCT CGATAATCTT GAGCATGTGA TGATGGTTGT GCCAATTATC AGTCAATTGA TCACCCAATG TAACAAGCTC AAAATGCTGA TAACCAGTCG TTTTGCCTTG AAATTACACG ATGAACACCT GATTGATTTG CCACCGCTCG ATGTACCACA ACACCCACCT GAGCACCAAG CTAGCGCCGA AACCAACTAT TCCGCCGTGG AATTATTTGA GCTTCGCGCC AAAATGGTGC AACCTCAATT TAGCCTAACA GCCCAAAATC GCGCGATCGT TGGCGAAATT TGCCGGCGAT TAGATGGCTT ACCCTTAGCA ATCGAGCTTG CTGCGGCGCG AATTCGTGGC CTACCACCCC AAGCAATGCT CGCGAGGCTC GATCGCCGGC TCGAATTGTT CGATCAAGGC AATAGCGATT TGCCTGAGCG GCATCAAACC TTGCGCAATT TAATTGCCTG GAGCTACACG CTGCTCACGC CCAACGAGCA AACTATTTTT CGCACGCTCA GCCTGTTTGC CAACCATTGG ACATTAGGTG CCGCCGAATA TCTGTGTCAA GCTCAAATTG CCAAACCCCA AGTTCTCACA ATCTTGCTCA ATTTGCTTGA TAAAAGTTTG GTGCGCGAAG AAAGCTCAAA TGATGGCATC GCCACCTTTG CCATGCTCGA AATTATTCGC GAATATGGGC TAGAGCAGCT TGAACAAACC AGCGAGGCTC ACGCATTGCG CTGGCGACAT GCCCATTATT ACATCCAACT GGCCCAACAC GCCGAGCAAC AACTCGGCCA ACAAGAAGCA TTATGGCTTG AGCGCTTGAC GTGGGAGCGT AGCAACCTTT GGGATGCGCT CAATTGGCTT GTGGCGCAGC AAGCAGCCGA AGCATTATTG CAGCTGATTG GTTCGCTCTG GAAATTTTGG CAAATTCGGC ATCTCTGGCG TGAAGGTTTG CACTGGATTG AGTTGGCTTT GGCCTTACCG CTGGTCAATA GCGAAGCTTA TCAACAAGCC CGTGCCAAAG TGCTTTGGGG TGCTGGTTGG TTGGCAGTCG ATTTACATCA GCATGATTTG GCACAAGCCA TGTTCGAGCA AAGCCTGCAC CTCGCCACCA GCCTCGACGA TCAACAGGGA ATTGCCCGAG CGTTGCATGG GGTTGGCTTA TTGGCCGAGT GGGCGGGGCA GCGCGATTTT GCCATGCGAG CCTATCAAGA AAGTTTAAGT CTCTTCCGCC AACTCGACGA TCAAGAGGAG ATTGCTTGGT CGTTGTTCCA TCTGGGAGCA GCCTTACAAT CACAGGGTCA AGCCAAACAA GCGCGGCATT TTTTAGAAGC AGCCTTAGCG ACCTCACGCC GCTTGGCCCA TTCGTGGAGC ATCGTGCATC AAACCAAAGC ACTCGGCCAA ATGGCGATTG ATCAAGGGCG GTATGCTGAT GCCGAGTTAT TGCTCAACGA AGGTCTCAAG CTCGCTCAAA ACCATCAATA TCAACAAATT TATCTTGAGA TTTTGCGGCA TCTTGGGCGT TCGGCATTAG AGCAAGGCCA CTATCAACAG GCCCGTGAAC GGTTTCAAGC CAGCCTTGAG CAAGCCCAAC TACTTAAAGA ACCCTCGGCG ATTCGTTGGG CCAGCATTCA CCTCAATTGG CTGGGTATTC TCGAAGGCGA TTTGACCCAA GCCTATGCGT TTGAGCAACA ATTGGCAATC TTCGAACGCG ACGAGCCTGC TTGGGCAATT GCCTGGCTCA AGGCGCGTTT GGGCACAATT GCTTTACTCA AATGCCAGCC TGAAGTAGCC CAAAGCTGGT TTCTGGCTAG TATTCAAATG TACCAAGCCA ACGATTTGCC ATGGGGCTTG GTCGAATGTC TTGAAGGCTT GGCACATAGC CTATGGTTGA GCAAGCAAGC CCATCAAGCC GCAGCCTGCT TGCAATTGCT GAGCGCTGCC GAACAACAAC GCCAAAGCTT ACCACGCCTG CGCTCCGTAC CGGAACAAGC AGCATGGCAA ACCAGCCTTG CATGGTGTCA ACAACAACTT AGCCCTGAGC AATTTGCCCA CATTTGGCAA ACTGGGGCGA CCCAAACGCT TGAACAATTG CTCAAGCCTT TTAATCACGA AACTCAATCA GCTTGA
|
Protein sequence | MDDRPIFPEW IKQARKQLGL GRAELAHQVG CSLAMLRQLE YGTRRPSRQL AERLASFLQI PAEQQSLFVQ TARVHQPPSP INPVPLVKRL PEPTQPLIGR EELLAHASQL LLQSSTRLLS IVGPGGVGKT HFASQLAQQI QAHFSDGSFF IGLASLHDAE QLPTLIAQTL EIPQPTHQST LEQLIGAIDQ RSILLMLDNL EHVMMVVPII SQLITQCNKL KMLITSRFAL KLHDEHLIDL PPLDVPQHPP EHQASAETNY SAVELFELRA KMVQPQFSLT AQNRAIVGEI CRRLDGLPLA IELAAARIRG LPPQAMLARL DRRLELFDQG NSDLPERHQT LRNLIAWSYT LLTPNEQTIF RTLSLFANHW TLGAAEYLCQ AQIAKPQVLT ILLNLLDKSL VREESSNDGI ATFAMLEIIR EYGLEQLEQT SEAHALRWRH AHYYIQLAQH AEQQLGQQEA LWLERLTWER SNLWDALNWL VAQQAAEALL QLIGSLWKFW QIRHLWREGL HWIELALALP LVNSEAYQQA RAKVLWGAGW LAVDLHQHDL AQAMFEQSLH LATSLDDQQG IARALHGVGL LAEWAGQRDF AMRAYQESLS LFRQLDDQEE IAWSLFHLGA ALQSQGQAKQ ARHFLEAALA TSRRLAHSWS IVHQTKALGQ MAIDQGRYAD AELLLNEGLK LAQNHQYQQI YLEILRHLGR SALEQGHYQQ ARERFQASLE QAQLLKEPSA IRWASIHLNW LGILEGDLTQ AYAFEQQLAI FERDEPAWAI AWLKARLGTI ALLKCQPEVA QSWFLASIQM YQANDLPWGL VECLEGLAHS LWLSKQAHQA AACLQLLSAA EQQRQSLPRL RSVPEQAAWQ TSLAWCQQQL SPEQFAHIWQ TGATQTLEQL LKPFNHETQS A
|
| |