Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3364 |
Symbol | |
ID | 5736906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4242088 |
End bp | 4243584 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280511 |
Product | transcriptional regulator |
Protein accession | YP_001546128 |
Protein GI | 159899881 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTTC AGCTTGATCG TCAGCACGCC AAACCACTCT ATATTCAATT ATCCGAACAA CTGCAAGAGC GGATTCGGAG TGGCTCGTTG CCAGCTGGCA CGAAACTGCC GCCAGTGCGC GATTTAGCTG AATCACTTGG TCTCACCCGT TTGACGGTTC ACAATGCCTA TAGCGAACTG CAAGCAAGTG GTTGGGTTGA GGCCTACGTT GGTCGCGGCA CCTTCGTCGC CGAGCGGATC AAGCCGATTA TTCCAGCATA TGAGATTCGC CAACGGGTAG TGGATGAGCT ACAAACGCCA TGGTTTAGCC AAGGCATGTT GGCCGATATG CTACGCTTGG CCCAACAGCC AAATTTAATC TCCTTTGCCC AAGCCGCGCC AGCCGAAGAA ACCTTTCCAG TGCGTGAAAT TGGCCGCGCA ATTCAGCAAG CCCTGCGCGA CCCCAGCGCC CTAGGCTATG GCCCAACCCA AGGCGAATTA TGTTTGCGCG AAGCGATTGC CACATGGCTG CTCGACCGCA ATGTTGTAAC CTCGCCCGAC CATGTGCTGG TGACAACTGG TGCTCAGCAG GGCGTAGCCT TGGCATTAAA GGCCTTTGTT CGCCAAGGCG ATGTGGTTTT GGTCGAGGAG CCAACCTATT TGGGCTTTAT CGAGCAGGCT ACGGCCTTGG GTGTGCGCTT AATCGGCATT CCATTGGATG ATCAAGGCTT GCGGTTGGAT ATTTTGCAAC GGGTATTGTG TGAATACAAA CCACGGTTGC TCTATACCGT GCCAACCTTC CACAACCCAA CCGGCGTTTG CCTTTCGACC GAGCGCCAAG AAGCCCTATT GCAATTGGCC CAAGAACATA ACTTAATTAT TTTAGAAGAT GATGTCTATG GGCCGCTGAG CTACGATGCT CAAGCACCAC ACCCAATCAA AGCCCGCGAT ACTAATGGGC AGGTGGTCTA TCTTGGCAGC TTCTCCAAAA TTCTAACTCC GGGCTTACGC CTAGGTTATT TGGTTGCCCG TGACGAATTT TTGCACCCGT TGCTGACTGC CAAGCGTGGC AACGATCTCC ACTGCTCGCC ATTATTGCAA CGAGCTTTGG CCGATTATCT TGGCCGTGGT CAGTTGGCGG CGCATTTGCG CTATGTGCGT GAACTCTATC GTGAGCGTCG CGATGCCATG GAACGAGCGT TGAACCGCTA TTGTCCCCGT GATATTCAAT GGACGCATCC ACGTGGCGGG TTATGCTACT GGCTAACCTT GCCCTCTGGA TTAAATGGCA CCGATATTTA TACCGAGGCG ATTGAAGCAG GCGTTGGCGT GACCCTTGGC AATGTCTTTT TTCCACAACC GCCACGCAAC GCCCACTTAC GGCTCTGTTT TGCCACCCAA TCACCAGAAT TAATTGATCG TGGAATTCGC ATCCTTGGCG ATGTGCTAAC CCGCCATGTC TTGCGTTGTG GTCAACTTGC TGCCCGTGCT TGGCGCGAAA CCACCCCACT GATGTAA
|
Protein sequence | MEFQLDRQHA KPLYIQLSEQ LQERIRSGSL PAGTKLPPVR DLAESLGLTR LTVHNAYSEL QASGWVEAYV GRGTFVAERI KPIIPAYEIR QRVVDELQTP WFSQGMLADM LRLAQQPNLI SFAQAAPAEE TFPVREIGRA IQQALRDPSA LGYGPTQGEL CLREAIATWL LDRNVVTSPD HVLVTTGAQQ GVALALKAFV RQGDVVLVEE PTYLGFIEQA TALGVRLIGI PLDDQGLRLD ILQRVLCEYK PRLLYTVPTF HNPTGVCLST ERQEALLQLA QEHNLIILED DVYGPLSYDA QAPHPIKARD TNGQVVYLGS FSKILTPGLR LGYLVARDEF LHPLLTAKRG NDLHCSPLLQ RALADYLGRG QLAAHLRYVR ELYRERRDAM ERALNRYCPR DIQWTHPRGG LCYWLTLPSG LNGTDIYTEA IEAGVGVTLG NVFFPQPPRN AHLRLCFATQ SPELIDRGIR ILGDVLTRHV LRCGQLAARA WRETTPLM
|
| |