Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5030 |
Symbol | |
ID | 5736989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 40715 |
End bp | 43885 |
Gene Length | 3171 bp |
Protein Length | 1056 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641282197 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001547788 |
Protein GI | 159901542 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.311949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGCGAC AATTACCCAT ACCCTATTTG CCGCCGGACA CTTCTGCCGC CGCCGACTTT CGGGTGTACG TGCTTGGCAC TCCCGTCTTG CTTTGGGCCA ATACTCCCTT CTCGATTGCT CGGCGCCAAG CCCGCGCGCT GCTGTATCGT TTGGCCTCAG ATCTGAACCC AGTGGCACGC ACCGAGCTAG TTTACCTTTT CTGGCCCGAC ATGCCCGATC TGACCGCACG CCGCCATCTG ACCCATCTGC TGACCCGTCT TCGCCAGGAC ATCCCCGATC CTCGCATACT CATCGCCACA CCTGATCATG TTACCCTTGA TCCTGAACGC GTCTGGATTG ATAGCTCCGT TTTCATGCAT GCGGTTGCCA CCACTGATCC TGAGCACCGA CTGGAAGCCC TTGATCATGC AGTAACCCTG GTACGCGGCC CCTTTCTCCA TGGGGTGGCC CTCGCCGATG CCCCGGAATT TGAGCTGTGG CTGGCCCAGG AACGCAGCAA CTGGGAGAAC CGCACGTTGG CTGTGCTTGA CACCCTACTT GAGCAGGCCA CTGTGGTACG CAACTATCCG CTGGCTATCC GTACTGCCCA ACAATCCCTA GCCTTGAACC CGTTGGCCGA AGATGTCCAT CGACACCTGA TCGGCCTATA TGCAACCATC GGTGATCGCG GTGCTGCCGT GCGGCACTTT GAGCACTGCC GCACCTTACT TGAGCAGGAG CTTGGTGTCA CGCCTCTACC CGAAACGTTG GCCATCTACG AGCAGGTTCG TGCGGGCCAG AGTCCCTTTC CCTCTGAAAC GGCGGGTCAA CGAGGCCTGA TCCTAGCGGG AAACGCGGTG ACTCACCCAG TCGATTTCCC TGCTCGTCCC GAATCAGTGA TCAACCCCGA TTCGGCGGAC GAGGCCCCCG TCTGGGCACC TGATGTCACT GAAAAGGGCC AGATGTTGGT TACTAAACCA TTGTATGGCC GCGATGCCGA GGTAGCGATC ATAACAACCA TGCTCGCGAG TGCGATGCCT CGCCTGATCA CACTGAACGG TCCCGGCGGC AGTGGCAAAA CCCACCTCGC ACAGCAAATC GCCGCCAGCG TCGATTTCCC GGATGGGGTG GTCTGGGTCG CGCTTGGATC TCTGCGCGTA CCGGGATTAC TTCGCGATGC CATCGCCTAC GCCTGCGGGG TTCGCACCAC TGGCTGGAGC GCAGCCAGTG CGGCTGTGGC AGATCAGCTA CATACAGCGC TCCAGCCCAA ACACATGCTT TTGGTGCTCG ACAACGCTGA ACATCTCCTA GACGGGACAG GCGTGATCGC GGAGTTGTTG GTGGCAGCAC CAAACCTACG GGTTCTGGTG ACCAGCAGGG TCGCATTAAA TTTGCCAGGG GAGCAACTAG TTCCTGTGCC GCCACTGCCC GTCCCATCCC TAGCCACATT GCCACCGATC GAGCAACTGG CGCTCCAGCC AGCCGTAGCC CTGCTGATTC ATCGCGTACG CGAACGCCAG CCTTGGTTCA TGTTAAGTGA GGAAAATGCA GCAGATATCG CCGCCATCTG CGTGCGCTTG GATGGACTGC CGCTGGCCCT TGAACTGGCC TCCACCCGGC TTATCACCCT TACACCAGGT GCCGTGCTTG CACGACTGAA CCATCGCCTG ACGTTGCTCA CGCGCGGGCC ACACATTCTT CCAGAGCGCC AGCAAACCTT ACGGGCCACG ATTGACTGGA GCCATCGATT GCTTGATCTG TCGGCGCAGG CGATGTTTGC CAACCTAGCA GTCTTCGCCG GTGGGTGGTC ATTGGCTGCT GCCGCAGCCG TGATGCAGCA CCAGATGCCA GCCGCTGCGC CGGTGCCTGA CGAGATCGCC GTGTTAGATC TGATGCATGG ATTGCTTGAG CACAACATGA TTTTTTCAAT TCCGGGGGAG GAGCCGCGTT TTGACATGTT GGATACCCTG CGCGAGTACG CCCAGGAGCA ACTGAAAGCA CGCGGTGGTG CTGGCACGGC GGACGAGGCG CATGCCACGT TTTATCGTGA GCTCGCGCTT CGTGCCACAC TCCACATCCA GGGCGAACAG AGTACGGCCT GGCTGACTAT GCTTGCACTG GATCATGATA ATTTGCGCGT GGCACTGGCA TGGTTTCTCG GGCAACCCGA TGGCGGTGCC GGCGCTATCG ATATCACTAA CATATTGCAA GTGCTTTGGC GCTGGCGCGG TGAGTACCAT GAAGCCCGCC ATTGGATACC ACAGGTGCTT GATCACAGTC GGGGATTGGC ATCGGAGAAA CATATCCATC TGCTCATGAT TGCTGGGAGT GCTGCTGCGT TTTATGGTGA TCGGGATACG GCGCTGGACT GGTTCACTGA GGGTCTTGAT CTGTGCCGCG ACGTTGCAGC TCCGCTCATT GAGTCAGATT TGCTGATCAA CATGGGGCGA ATCTATTGCT ATCGTGGAGA TTTTGTGCAC GGCTGTGCAC TCTCTGAAGC AGCCCTCGTC CTCAGTCGTA AGACAGAAGA CCCACTGCAT CTCGTCAGGA CGATTCGCAA CCTTGCAGCC GCGCTCTGCG GAGGATTGGT CAATATTGAA CGTGGCATGG CCCTGTTTGA AGAAGGATTA ATCATTGCAC GTGCAACAGT CGATAGCGGT GCCTACAGTC AGGTTACCCT AGCGATCTTC CTCATTGATT TTGGATCTTA TCTGGCGTTG ACAAACCAGC ACACACGCGC GGCGATGCTC CTCGCCGAAG CCTTGGCGTT GGCCGAGCAG AACGATCATG CTATCGCCAG AGTCAACGCC CTGGGAAGTT TGGGGTTTCT TGCGTTGCTG CAGAGTGACC GCGCAACAGC GTGCCGCTAC TTTCTGGACA GCCAGCAGCT CATCAATGGA ATCTCGGCAC CCATGAATAC AGTCATGAAT ATTGAAGGAC TCGCCGAAGT GGCTGCGGAA AAACATCCGC GCCTGGCTAT TCAGCTGCTG GCCGCAACCA GCGCGGCACG TACGGCGCTG GATATGCAGA TCGAACCCAT CGAACAGCGG CATCGTGCCC AAGTCCTCGC TGACTTGCGA CACGAGCTTG GCGACGAAGA GTTCTCGATA GCATGGTATC AGGGTCAGGA CTGGACTGTA GAGCAGGCCC TAGCAGCAGC ACGCACTGAT ACTGGAGAAA AACAAGGGTA A
|
Protein sequence | MLRQLPIPYL PPDTSAAADF RVYVLGTPVL LWANTPFSIA RRQARALLYR LASDLNPVAR TELVYLFWPD MPDLTARRHL THLLTRLRQD IPDPRILIAT PDHVTLDPER VWIDSSVFMH AVATTDPEHR LEALDHAVTL VRGPFLHGVA LADAPEFELW LAQERSNWEN RTLAVLDTLL EQATVVRNYP LAIRTAQQSL ALNPLAEDVH RHLIGLYATI GDRGAAVRHF EHCRTLLEQE LGVTPLPETL AIYEQVRAGQ SPFPSETAGQ RGLILAGNAV THPVDFPARP ESVINPDSAD EAPVWAPDVT EKGQMLVTKP LYGRDAEVAI ITTMLASAMP RLITLNGPGG SGKTHLAQQI AASVDFPDGV VWVALGSLRV PGLLRDAIAY ACGVRTTGWS AASAAVADQL HTALQPKHML LVLDNAEHLL DGTGVIAELL VAAPNLRVLV TSRVALNLPG EQLVPVPPLP VPSLATLPPI EQLALQPAVA LLIHRVRERQ PWFMLSEENA ADIAAICVRL DGLPLALELA STRLITLTPG AVLARLNHRL TLLTRGPHIL PERQQTLRAT IDWSHRLLDL SAQAMFANLA VFAGGWSLAA AAAVMQHQMP AAAPVPDEIA VLDLMHGLLE HNMIFSIPGE EPRFDMLDTL REYAQEQLKA RGGAGTADEA HATFYRELAL RATLHIQGEQ STAWLTMLAL DHDNLRVALA WFLGQPDGGA GAIDITNILQ VLWRWRGEYH EARHWIPQVL DHSRGLASEK HIHLLMIAGS AAAFYGDRDT ALDWFTEGLD LCRDVAAPLI ESDLLINMGR IYCYRGDFVH GCALSEAALV LSRKTEDPLH LVRTIRNLAA ALCGGLVNIE RGMALFEEGL IIARATVDSG AYSQVTLAIF LIDFGSYLAL TNQHTRAAML LAEALALAEQ NDHAIARVNA LGSLGFLALL QSDRATACRY FLDSQQLING ISAPMNTVMN IEGLAEVAAE KHPRLAIQLL AATSAARTAL DMQIEPIEQR HRAQVLADLR HELGDEEFSI AWYQGQDWTV EQALAAARTD TGEKQG
|
| |