Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2467 |
Symbol | |
ID | 5734347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3154083 |
End bp | 3155093 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279606 |
Product | LacI family transcription regulator |
Protein accession | YP_001545233 |
Protein GI | 159898986 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGGAA GTAAACGGAT CACCATCCAT GATATTGCAC GCAAAGCTGG CGTATCACCC AGTACTGTCT CGCGGGTCTT GAACAGCACC ACACCTGTAG CCGAAGCCAA ACGCCAAGCT GTAACAACGG CGATTCAACA GTTGGATTAT CGCCCAAATC TGATTGCCCA AGGCTTGGCT CGTGGCACAT CGACGATTAT TGGCGTGCTG ACCCAAGATA TTGGTAGCCC GTTTTATGGC GAGTTACTGC GCGGCATCGA ATATGGATTT CGTGGCAGTC GCTATCACCC GATTTTTGCC GATGGCAACT GGCAACAAGC TGAGGAATAC AACGCATTAA ACATTCTGCG CTCACGCCAA CCTGAGGCAT TAATTATTTT AGGTGGTTTA ATGCCTGATG CCGAAATGTT GGCCGCAGCC CAAGAATTTC CCTTGATCAT TATTGGGCGA AGTGTGCCAA GTTTGGAAGA ATATTGTGTT TTGGTTGATA ATTTCCAAGG AGCTTATCGC GCAACCCAAT ATTTAATTGA AATGGGCCAT CAGCGGATTG CCCATATTAC TGGAATTCGC AGCCATCAAG ATACACTTGA TCGCCAGGCA GGCTACGAAC AAGCCCTGCG CGATGCCAAC TTGCCAATTA ATCCCGACCT GATTGTTGAG GGAACGTTTC AAGAACAATC GGGCTTACTA GCCGTCGAAA CCTTATTAAT GCGAGCAAAC CCGTTTACCG CCCTCTTTGC AGCCAATGAT CAAATGGCCT ATGGTGCTCG CTTAGCGCTC TATCGGCGAG GAATTCGGGT GCCCGAAGAT GTTTCGCTGA TTGGCTTTGA TGATTTGCCA AGCTCAGCCT ATACCACGCC CCCATTAACT ACTGTTCGCC AACCAACCTT CGAAATGGGC ATGAGTGCAG CCAAAGCAAC GCTTAATTTA ATCGATCAAC GGCCATGGCC ATTGCCTCAG CTAACTCCCG ATTTAGTGAT TCGTGAGTCA ACTGGCTTCG CACGGCGTTA A
|
Protein sequence | MTGSKRITIH DIARKAGVSP STVSRVLNST TPVAEAKRQA VTTAIQQLDY RPNLIAQGLA RGTSTIIGVL TQDIGSPFYG ELLRGIEYGF RGSRYHPIFA DGNWQQAEEY NALNILRSRQ PEALIILGGL MPDAEMLAAA QEFPLIIIGR SVPSLEEYCV LVDNFQGAYR ATQYLIEMGH QRIAHITGIR SHQDTLDRQA GYEQALRDAN LPINPDLIVE GTFQEQSGLL AVETLLMRAN PFTALFAAND QMAYGARLAL YRRGIRVPED VSLIGFDDLP SSAYTTPPLT TVRQPTFEMG MSAAKATLNL IDQRPWPLPQ LTPDLVIRES TGFARR
|
| |