Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1608 |
Symbol | |
ID | 5733510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1866787 |
End bp | 1869195 |
Gene Length | 2409 bp |
Protein Length | 802 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278747 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001544379 |
Protein GI | 159898132 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGAAG CACGATCATT TGGTCAACAA TTGCGCGACT ATCGCCATCA ACGCCAACTC ACTCAAGCGG CTTTGGCCGA GGAAGTTGGC TGCGCCATCG AGAGTATTCG CAAAATGGAG GCTAATCGCC AGCGACCATC ACGCAGTTTG GCGGCTCGTT TAGCCAGAAT TTTGCAGTTA TCAGCCGAGC AAAGCCAGAT TTTTTGCGAC CAAGCCCGAA CGGTTGGCAC TGATAGCGCC AATTCAGCGC CAAAACCAAG TGGCTTGCCA TTAACGGCGA CCAAGCTGAT CAATCGCCAA ACTGAGCTGG CAACGCTACA AAACTATCTC AACGCTGAGC ATATTCGGAT GATTACGCTG ACTGGCCCAG GTGGCGTAGG CAAAACCCGC CTTGCGCTGC AAATTGCCCA GCATAGTCAC AAGCATTTCC CCGATGGGGT GTATTTTGTC GATTTAGCTC AAGCAAGCAG CTTGGCGGAT ATTGGTTTAG CCCTCAGTCA AACGCTCAAT CTGCCCAGTA GCAAATACGC TTGGCAACGC CACATTCAAT TGCACTATCA ACAAGCCCGC ATCTTGTTGA TTCTCGATAA TGTTGAGCAA TTGGTCAGCG CTGCCGAGCA TTTCCGTGGT TTGCTTGACC ATACCAGCCA GCTCAAATTG CTCTTGACCA GTCGCACGCT GTTGCATTGC GCTGGTGAAT ATGCGATTCC GCTGACACCG CTGCGCTTGC CAACTGCCGA GGCCAGCCTT AACGAGCTTA AAACCAATCC CGCCGTTCAG CTTTTTGTCC AACGAGCGCA AACGCTCAAC CCACAGTTTG CCCTGACCAA CCACAACGCC GAAGCAATCA AACAGCTTTG TTGGCAAGTT GATGGCTTGC CTTTGGCCTT GGAATTGGCG GCGGCTCGCA CCCGTTTGCT CACGCCTGAA GCCTTGTTGG CTTATTTGCA ACCGCCCTTG GCCTTGCTCA GCACCAATGA TCCAACGGCT CCAGCTCGCC ACCAAAGTAT GTACAACGCC ATTAATTGGA GCTATCAGCA AATTTCGCCC AAGCAGCAAC AGCTTTTGCG CCAACTAGCA ATTTTTCAGG CTGGATGTAC TTTGGATGCA ATTCAGGCTA TCGTGCCAAA CAATAATCAG CTTGATCTGC TTGAACAATT GGCAGGCTTA ATTGACCATA GTTTGCTGAA CATGCAGGCT GAAGCTGAAC AGCCGCAACG TTTTAGCATG CTCAGTTTGA TCCACGAATT TGCCGCGCAG CAATTGGCCG AACAAGCCGA ATTTCCCGAA CTCGCCCAAC AGCATCTCAA TTATTATGTC ATGTACTGCG AATCGCTCAG CCAACAAGTT TTCACGGCAC GCCAAGCGCT TTTATCGGAG CGCGAGAATA TTCGGGCTGC AATTAACTGG GCAATCAGTA CTCAGAATTG GGTTGCAGCC AGCAGTTGCA TTTTGCCCTT GGCCGAATTT TGGTATCGTT ATGGAGCCGC TGAAGAGTTA CAAACGTGGC TGGCTTGGCT CCGCAGCCAA CCAATTGATT TAGCAACTCA AGCCCGTTGC AACGAAATGC AGGGCTATAT TGCAGCCTTT TTGCAAAGCC AATATCGCGC TGGTCAGGCG TGGTATCAAC AAGCGTTGGC GCAACGCCAA GCCCTGCAAC AAGCGGCGGC CATCGCCGAC AATCTCGCCA AATTGGGCGA AGTTGCGATG GAGCAAGGCC ATTATGCCCA AGCGCTTGAA CGCTATCGCC ACGCTTGTAG CATGCATGAA CAACTTGGCG ATCAAGCTTC AGTGTTTGCC ATGCACGATT GCCAAGCTAT GGTCTTGCTG CGTCAAGGTC AATTTGGCCA TGCCCAACAG CTGTTACAAC AAAGCTTAGA TTATTGGCAG CAACAACAGA TTTTGCCCAG CCTTGCATTT AGCCTGAATT ACCTTGGGAT GATTGCCTTT TATCAAATGC GCTTGAGCAA AGCCCAACAG GCGCATGAGC AAGCCTTGGC AATTTGGCAA ACCCTCGATG ATCAACGCGG GATTGCCTCA GCCTTGAATG CTTTGGCTCC AGTCTTGTTG CACCAAAACC AAACCGCTGC TGCACTGGCA GCAATCAAGC AAAGCTTGCA AATTCGCTGG AGCCTGCACG ATTACGATGG CCTCGCTTGG AATTTAGAGC GGTTTGGTGA AATTTTGAGC AAAGTGCATC AAGCTGAATT GGCGATGCAA TGTTGGAGCA AAGCCAAGCA ACTCCGCGAT GAACTAGCCT TGCCCTTGTT TGAGGCCGAA CAAAAACGTT TGCAAATCTA CATTAGGCAA ACTAAGCAAC AATTAACCTC CGCTCAAGTG CAACAGCTTT GGTTGAGCGG CCACAAGGTA GCGTTAGCGC AGCTAATTCA AACCCTCTTA ATCACTTAA
|
Protein sequence | MPEARSFGQQ LRDYRHQRQL TQAALAEEVG CAIESIRKME ANRQRPSRSL AARLARILQL SAEQSQIFCD QARTVGTDSA NSAPKPSGLP LTATKLINRQ TELATLQNYL NAEHIRMITL TGPGGVGKTR LALQIAQHSH KHFPDGVYFV DLAQASSLAD IGLALSQTLN LPSSKYAWQR HIQLHYQQAR ILLILDNVEQ LVSAAEHFRG LLDHTSQLKL LLTSRTLLHC AGEYAIPLTP LRLPTAEASL NELKTNPAVQ LFVQRAQTLN PQFALTNHNA EAIKQLCWQV DGLPLALELA AARTRLLTPE ALLAYLQPPL ALLSTNDPTA PARHQSMYNA INWSYQQISP KQQQLLRQLA IFQAGCTLDA IQAIVPNNNQ LDLLEQLAGL IDHSLLNMQA EAEQPQRFSM LSLIHEFAAQ QLAEQAEFPE LAQQHLNYYV MYCESLSQQV FTARQALLSE RENIRAAINW AISTQNWVAA SSCILPLAEF WYRYGAAEEL QTWLAWLRSQ PIDLATQARC NEMQGYIAAF LQSQYRAGQA WYQQALAQRQ ALQQAAAIAD NLAKLGEVAM EQGHYAQALE RYRHACSMHE QLGDQASVFA MHDCQAMVLL RQGQFGHAQQ LLQQSLDYWQ QQQILPSLAF SLNYLGMIAF YQMRLSKAQQ AHEQALAIWQ TLDDQRGIAS ALNALAPVLL HQNQTAAALA AIKQSLQIRW SLHDYDGLAW NLERFGEILS KVHQAELAMQ CWSKAKQLRD ELALPLFEAE QKRLQIYIRQ TKQQLTSAQV QQLWLSGHKV ALAQLIQTLL IT
|
| |