Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1887 |
Symbol | |
ID | 5733776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2275514 |
End bp | 2276722 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279031 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001544658 |
Protein GI | 159898411 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCGA TTGCAACAAC TCCTAGCCAA TTTGAACTCG CTGCTTGGGC TAAGACCATC ACACCCTCGG CGCTGCAAGA TATGCTTTCG GCAACTGCCA ATCCTGAGGT TATTTCGTTT GCTCTGGGGC TACCCGCCCC AGAGCTGTTT CCTCGCCACC AATTTAGCCA ATTGGCCAGC ACGTTACTCG AAGCCGAACC TTTGGCGTTG CAATATGGCC CGCCAAGTAC CACCCTCAAA ACGGCAATTG TTTCATTGAT GGCGCAACGA GGGGTACGCT GTCGGCCCGA GCAAATCTTT CTGACCAATG GTGCGCAACA GGGCATGAAC CTGTTGGTAC GCTTGCTTTT GGCCGATGGC GGCAGCGTTT TGTTGGAAGA TTGTATTTAT ACGGGCTTTC AGCAAGTGCT TGATCCATTT CAAGCTAAGT TGCTAACCGT GCCAACCAAC CCTGAAACTG GTATGGATGT AGCAGCAGTC GAAGCTCATT TAGCTGCAGG CCAACGCCCA AGCTTAATCT ATGCGATCAG CGATGGACAC AACCCGCTTG GCGTGAGCAT GAGCCTCGCC CAACGCCAGC AGCTCGTTGA ACTAGCCCAA CAGTATCAAA TTCCCATTAT TGAAGATGAT GCTTATGGCT TTTTAAGCTA TCAGGCTGAT ACGATTGCCC CAATGCGAGC CTTAAGTGAC GACTGGGTTT TATATATTGG CTCATTTTCG AAAATTCTAG CCCCATCGTT GCGGGTTGGT TGGTTAGTCG TACCCGAGTG GTTAATCGAA CGCTTGTCGA TCGTCAAAGA GGCGAGCGAT ATTGGTACAG CCACGCTGAG CCAACGTTTA GTCGCAGCCT ATACCCAAAC CCATCAATTA ACTACGCATA TCGACCAATT ATGTCAAATA TATACAACTC GCCGCGATAC AATGTTTAGC GCGTTGGAGC AGCATTTTCC GTCCCAAACC CGTTGGTATC AGCCTAGCCA TGGCATGTTT ATTTGGGTTG AACTGCCTAC AACAGTTGAT CCCTTTAAAC TCCTAGACCG AGCGATCAAC CAAGCGAAGG TGGCGTTTAT CCCAGGCAGT GTGTTTGGTG TGGCGGGCAA ATCGATGAGT ACCAATGGAA TTCGCCTGAA TTTTTCGAAT GCCGATATTG ACCAGATTAA TGCGGGAATT GAGCGTTTAG CCACAATCAT GCAAACCCTC AAAGCCTGA
|
Protein sequence | MAAIATTPSQ FELAAWAKTI TPSALQDMLS ATANPEVISF ALGLPAPELF PRHQFSQLAS TLLEAEPLAL QYGPPSTTLK TAIVSLMAQR GVRCRPEQIF LTNGAQQGMN LLVRLLLADG GSVLLEDCIY TGFQQVLDPF QAKLLTVPTN PETGMDVAAV EAHLAAGQRP SLIYAISDGH NPLGVSMSLA QRQQLVELAQ QYQIPIIEDD AYGFLSYQAD TIAPMRALSD DWVLYIGSFS KILAPSLRVG WLVVPEWLIE RLSIVKEASD IGTATLSQRL VAAYTQTHQL TTHIDQLCQI YTTRRDTMFS ALEQHFPSQT RWYQPSHGMF IWVELPTTVD PFKLLDRAIN QAKVAFIPGS VFGVAGKSMS TNGIRLNFSN ADIDQINAGI ERLATIMQTL KA
|
| |