Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0872 |
Symbol | |
ID | 5732773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 993339 |
End bp | 994217 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278004 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001543648 |
Protein GI | 159897401 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTTGG AATCGGGATT AATCAACGAG GTAAAGGGGA AGCTTGCTGA TCAACGAGGA ACGCTATCGA GTATTCGCCA GCCATCAAGA CCAGACCAGT GGCATGATCA CGATCAAACA ACCCATTATA GCGCGGCGGT TGAGCGGGTT ATTGGCGTAA TGCGGGGCCA ATCCCATGAA TTATTACGGC TGGAACAATT AGCTGATATT GCAAACCTTA GCCCTTTCCA TTTTAATCGT ATCTTCCGCC AAACGGTTGG GCTGCCACCT GGTAAATTTT TATCAACCCT GCGGCTTGAT CGGGCCAAGC GTTTATTGCT TACCACCGAT TTAAGTATTA CTACAATTTG TTTCGAGGCA GGCTATAGTA GTTTAGGCAC GTTTACGACC CAGTTCACCC AGGTTGTTGG GGTTTCACCG CGACGCTTGC GCTTGCTCAG GGCCACATTT GAAACTCCGC GCTTGGATCG TTTACACCAC CAATATGTTC ATGAAACCAA ACCTGATCAA CAAGCATTGA CGGTGCATGG GAGCATCATT GCCCCAGAAA GCTTTAGTGG CTCGATTTTT ATCGGTCTCT ACCCAATTGC AGCGCCTTTA GGCCAGCCTG TTAGTTGTGT TTTTCTTAAT GCACTTGGCG ATTTTCAACT CAAGGCTGAA AAGCCAGGCC GATACCATCT ATTTGCAGGA GCTATACCTT GGTCGAATGA CCCCTTGGCC TATCTCTTAC CTTGTGAACA GACGGCTTGG TTTGCTAGAA CCGCCAGTGT TATTATGCTC CGTTCAGATA CGTTTACGAG CTGTGGCGTA ATGAGTCTAC GCCGTCGCCG CCCAACTGAT CCCCCACTGC TGACGATACC CTCAGCCCTG ACAACCTAA
|
Protein sequence | MSLESGLINE VKGKLADQRG TLSSIRQPSR PDQWHDHDQT THYSAAVERV IGVMRGQSHE LLRLEQLADI ANLSPFHFNR IFRQTVGLPP GKFLSTLRLD RAKRLLLTTD LSITTICFEA GYSSLGTFTT QFTQVVGVSP RRLRLLRATF ETPRLDRLHH QYVHETKPDQ QALTVHGSII APESFSGSIF IGLYPIAAPL GQPVSCVFLN ALGDFQLKAE KPGRYHLFAG AIPWSNDPLA YLLPCEQTAW FARTASVIML RSDTFTSCGV MSLRRRRPTD PPLLTIPSAL TT
|
| |