Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0608 |
Symbol | |
ID | 5732506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 700782 |
End bp | 701750 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277735 |
Product | helix-turn-helix type 11 domain-containing protein |
Protein accession | YP_001543384 |
Protein GI | 159897137 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATG AACGTTTAGC CAGTAAAGCA GCACGTTTAC GCTGGATTGA ACGCAAACTC TATAATAATC CCCAAGGCTT GCGGGTGATG GATTTGGCCG AGGCTACGGG CATGGATCGG CGCACCATCT ACCGCGATTT GGGTGCGCTT GAAGACATGG GCGTGCCAAT GTGGCAATTT GAGGGCAAAT TTGGGATTAA TCGCGAAGAT TATCTCTCAA CAGTGCGCTT AAATTTAAAT CAAACAATTT CGCTGTTTTT TGCTGCCCGT TTGCTGGCCC ACCATAGCGA TGAGCAAAAC CCGCATGTGG TGGCGGCGCT TGAGAAAATT GCGGCCTCGC TGCCCGATGA AACAATTGCT AAACATTTGA GCAATGTTGC CGCCCAAATT CAACTGCGCC CAACCCGCCG CGAATATATT TTGGTGCTCG AAACCTTCAC TCGCGCTTGG GCTGATCGGC GTATGGTTAA ATTTTCGTAT TGGGCCTCGA ACCGTGAGGA GCCTGAGGAG CGCACGGTTG CGCCGTATGT GCTCGAAGTG TCGCGGTTTG AACCAGCCTC ATATGTGATT GGCCACGACC CATTACGCAA TGCTTTGCGC ACATTTAAGC TCGAACGAGT CCAACGCGCC GAAATTCTTG ATCTCGAATA TGTGATTCCC GCGGATTTCG ACCCATACTC CATGCTGGCC GATAGCTGGG GCATCATGGA CGAGGGCAAT ACTGTCACGG TGCGCTTACG ATTTAGCGCA GTTGTGGCCC GTCGCGTCAA AGAAAGCACG TGGCATCGGT CGCAAGAAGT GATCGATTTG CCCGATGGTG GCTGTGAGCT TTCGATGAAA CTCGCTGGTA CGCGTGAAAT GCGTTCATGG GTGCTGGGCT GGGGTGCTGA TGTTGAGGTA TTGGCTCCAG CTGATCTACG CGCTGAAGTT GCCGAGCACG CTCAACGGAT GGTTCAGCAA TATCAATAG
|
Protein sequence | MSDERLASKA ARLRWIERKL YNNPQGLRVM DLAEATGMDR RTIYRDLGAL EDMGVPMWQF EGKFGINRED YLSTVRLNLN QTISLFFAAR LLAHHSDEQN PHVVAALEKI AASLPDETIA KHLSNVAAQI QLRPTRREYI LVLETFTRAW ADRRMVKFSY WASNREEPEE RTVAPYVLEV SRFEPASYVI GHDPLRNALR TFKLERVQRA EILDLEYVIP ADFDPYSMLA DSWGIMDEGN TVTVRLRFSA VVARRVKEST WHRSQEVIDL PDGGCELSMK LAGTREMRSW VLGWGADVEV LAPADLRAEV AEHAQRMVQQ YQ
|
| |