Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3683 |
Symbol | |
ID | 5735562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4633929 |
End bp | 4637033 |
Gene Length | 3105 bp |
Protein Length | 1034 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280835 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001546447 |
Protein GI | 159900200 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00198659 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTTG CCCTGCGTTT CCTCGGGGTT CCTTCAATCC TCTACAATCA GCAAGCACAG TCGCTTCCCA GCAAAGCGGT AGCCCTCTTG GGGTATCTTG CAGCAACCAT CCAACCTCAA CGCCGCGAAC ATCTTCTGGC ATTGCTGTGG GCTGAAAGTA GCGATGAAGC CGCCCGCAAA AATTTACGCA ATACGCTATG GACAATTCGG CGCAGTTTAG GTGGTGAGAT CATCTTAGCC GATGAGGATC GCTTGCGTTT GCATCCGGAT TGTGCGGTTG ATCTTTGGCA GTTACGCAGT TTGGCAGAAG GTTTGGTTTC ACCAACCGTT GAGCGTTTGT TGCAGCTTGC CCAAGGGCCA TTGCTCGATG GCGTGATGTT ACCCGACGCT CCCGATTTTG ATTTATGGCT GGTGACCGAG CGCGAACAGG TGCATTTAAC CACGATGCGC TTATTTCAAA CCGCGATCAG CAATTATCAA AAACAACAGC AATGGGTGCA TGTGTTGACT TTAGCTCGCG CCGCCCTGCG TTTCGATCCG CTCTCCGAAT CGTTGTATCA GGCGATTATC GAGGCCCATT GGCGGCAAGG TGAACGCGCC GAGGCCTTGC GCCAATATGA GATGTTGGCC AATACGCTGC AGCGTGAGTT AGGGATCGAT CCCTTGCCTG AAACCCAAAT GCTGCGCCAG CGCATTTTGC AAACCAACCA ACTGATGAGT ACAAACAACG CATCAGAACC AGCCAAACCT GTGGTCACAG CGCCGATTGC TGAGCCAAAG CCCTTGCTCA AGCCAATTAA ACGCGCCGCC CCCCGCGCCG CGCCCTTTGT TGGGCGGCAA TATGAGCAAT TACTGCTTAA CCAAGCCTTG ATGCGCAGCA ACGAACGGGG CTTGCAAGTG GTGCTCATTA CTGGCGAAAT TGGGGTTGGC AAATCGCGGC TCTGGCAGGA ATGGGCACAG CAATTACCTG CTGATAGCAC CATGTTGGCG ATGCACTGCC TTGAATCGAC CCAAAGCTTA CCGTTTGCCC CGTTGACCGA ACTATTTGGT CAGCGCATCT GCCTATCACG GCTTTTTTCG GGCGATTCAG CGGTTGATCC GATGTGGTTG GCCGAGGTAG CGCGATTACT GCCGCAATTG CGTGGGCATA TTCCCAATTT GCCAGAGCCA CCAGTGTTAC CCGCCGATGA AGAGCGACGA CGTATGTTCG AGGCTTTTGC CCAATGTTTA CGGGCAGTAA TGACCAATCA TCTGGTAATT GGCATCGACG ATTTACACTG GGCCGACCAA GCCACCGCCG AATGGCTCGA TTACGTCGTT GATCGCTTGC ACGATTTGCC GATTGTGTTG GTGGTGACCT ATCGTTCGGA AGAGGCCAAC GCCCGTTTGC AACGCTTGGT GGTTGGTTGG CAACGCGGCA GTTTGGTCAC CCGCATCGAC GTGCCACGGC TCGATCTCAG CGAAGCACGT GAACTTTTGG GTGCATTGGG TCATCCTAAT CCTGATGATC AGACCTTGCT CGAACGTAGC GCGGGTAATC CATTATTTTT GCTCGAACTT TGCCGCACGG CTGATCCGGC TGATGTGCCG CCGATGTTGG TCGATGTGCT CAGCGCTCGT TTGGCGCGTC TGCCAGAACG AGCAGCCCAA ATTGTCCAAG CGGCAGCGGT GCTTGACCCC TTGTTTGGCT ATAGTATTCT GCGCGAAACT GGCGGACGTT CCGATGAGGA AACCCTCGAT GGGCTTGATG GGTTGCTCAG TGCCGCTATC TTAAAAGAGC ACGGCGATAG CTATACCTTT AGTCATCCCT TGGTAGCAAC CGTTGTCCGT AATAGCCTCA GTCGCGCCCG CCGCAGCTTT TTGCATCGCC GCGCCGCCCA AGCTCTACAA CACGAATATA GCGATCAACG GGCGATTGCA GGACGCTTGC TATTGCATTA TCGCGAGGCG GGCGAGGCCA AATTGGCTGC CAGTTATGCC GACCAAGCTC TAGAGCACGC GCTTTCATTG GCAGCACCGA ATGAGGCTGT GGCTTTTGGG CAACAGGCGG TTGAGCTTGA TCCAACGCCT GAGCGCTATT GTCGGCTGGG CGATGCCTTG GAGTGGAATA GCGAAGTGCC TGCGGCGCGT GAGGTTTATC ATACGGCGCT GGCGCAATAT CAAGCCCAAG CCAATTGGCT TCGAGTTGTG GCAGTGTGCA CCAAGCTTGG CCGCACCTAT TTGGTCGTCG GCAGGCCTGA GGTGGTGATC GAATGGGCGG AACGGGGCTT GGCAATTTAT CATAAACACA AGATTGATGA TCAATTGATC GAAGCCGATT TGCTGTTGCT CTTAGCGATT AGCCAACGTT TGGCGGGCTA CCCATTAAAC GTAGCCTACG AAAATATTCA AGCCGCCTTA GAGGTTGCGA CCGCCCAGCA AAATCACTCG TTGATTGGGC GTTGTCAATT TGAGTTGGGC AATATTTTGG CGCAACGCGG CGAAATTAAG GCAGCGGTCG AAACTTTTGC CTTGGCGATT GCCAGCACCG CCGCCACCAA CGAGCAATAT CAAATTATCT TGGGCTATAA CAATGCTGCC TATAATGCCA CCTTGATCGG CGATTTGGTC ACGGCGCATA GCCATATTCA AGCAGGCTTG CACTTGGCCG AACGCTTGGC CTTGCGCGTG CCATTGCAAT ATTTGTATAG CACACGCGGC GAAATTGCCT TAGCCGAAAA GCATTGGGAC GAGGCCGAAG AATGGTTTGA ACGCGGAATT AGCGTGGCTC AAGCCAATGG TAATGCGGCT CAAGTTGCCA ATTATCGTGC TAATTTGGGC TTGGTGGCGC GTGGCCGTGG CGATCTTGAT CAAGCGTTGG TCTTGATGCG CACAGCACTC AGCGAAGTTG AATTGCTCAC TGCGCCATTT TTACAAACCC AAATCAACAT TTGGCTGGCC GAAATTTGGC AAGAACGCCA CGATTATCTA GCTGCCAATG CTGCCTTACA ACGGGCACAA CAGGCCGTCA CACCTGAACA AGGCTTTTTG TATCAACGGG TTCAGCAATT ACAACAACAA CTTGGCAGCA TGCAACCAGC AATGGTTCAT CAGCGTTCTA TTTAA
|
Protein sequence | MTVALRFLGV PSILYNQQAQ SLPSKAVALL GYLAATIQPQ RREHLLALLW AESSDEAARK NLRNTLWTIR RSLGGEIILA DEDRLRLHPD CAVDLWQLRS LAEGLVSPTV ERLLQLAQGP LLDGVMLPDA PDFDLWLVTE REQVHLTTMR LFQTAISNYQ KQQQWVHVLT LARAALRFDP LSESLYQAII EAHWRQGERA EALRQYEMLA NTLQRELGID PLPETQMLRQ RILQTNQLMS TNNASEPAKP VVTAPIAEPK PLLKPIKRAA PRAAPFVGRQ YEQLLLNQAL MRSNERGLQV VLITGEIGVG KSRLWQEWAQ QLPADSTMLA MHCLESTQSL PFAPLTELFG QRICLSRLFS GDSAVDPMWL AEVARLLPQL RGHIPNLPEP PVLPADEERR RMFEAFAQCL RAVMTNHLVI GIDDLHWADQ ATAEWLDYVV DRLHDLPIVL VVTYRSEEAN ARLQRLVVGW QRGSLVTRID VPRLDLSEAR ELLGALGHPN PDDQTLLERS AGNPLFLLEL CRTADPADVP PMLVDVLSAR LARLPERAAQ IVQAAAVLDP LFGYSILRET GGRSDEETLD GLDGLLSAAI LKEHGDSYTF SHPLVATVVR NSLSRARRSF LHRRAAQALQ HEYSDQRAIA GRLLLHYREA GEAKLAASYA DQALEHALSL AAPNEAVAFG QQAVELDPTP ERYCRLGDAL EWNSEVPAAR EVYHTALAQY QAQANWLRVV AVCTKLGRTY LVVGRPEVVI EWAERGLAIY HKHKIDDQLI EADLLLLLAI SQRLAGYPLN VAYENIQAAL EVATAQQNHS LIGRCQFELG NILAQRGEIK AAVETFALAI ASTAATNEQY QIILGYNNAA YNATLIGDLV TAHSHIQAGL HLAERLALRV PLQYLYSTRG EIALAEKHWD EAEEWFERGI SVAQANGNAA QVANYRANLG LVARGRGDLD QALVLMRTAL SEVELLTAPF LQTQINIWLA EIWQERHDYL AANAALQRAQ QAVTPEQGFL YQRVQQLQQQ LGSMQPAMVH QRSI
|
| |