Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4797 |
Symbol | |
ID | 5736641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6114581 |
End bp | 6115591 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281962 |
Product | LacI family transcription regulator |
Protein accession | YP_001547556 |
Protein GI | 159901309 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.184791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATATA CGATTAAAGA TGTGGCGAAA CGGGCGGGAG TTGGCATTGC AACCGTTTCG CGGGTGCTCA ACGAATCGCC CAACGTGCTA CCAGAAACTC GTGCCAGAGT TTTGGCAGTG ATCGACGAGC TTGGTTATCG ACCAAATCAT GCTGCTCGCC AACTGGTAAC CGCCAAAACC AATGCGATTG CGATTATTCT GCCCTTCCTT ACGCGACCAT TTTTTATTGA AGTGCTGCGG GGCATCGAAG CGGTTGTGGC CAATTCTGAA TATCAGTTGA TTATTTTTAA TGTTGACTCG CCGGAACAAC GCACGCGCTA TTTTAATACG CTGCCATTTT TGGGGCGTAC CGATGGGCTG TTGATTGTTT CGTTGCCTTT GGCTCAGCCT GAAATCAAAC GCTTGCAAGC GGCCAATTTG CCAGCAGTGA TGATCGACAC TCAAGTTGCC AATTTACCAT CAGTGGTCGT TGATAATGTG GGCGGTGCAT TCAAAGCCGT CGAACATCTA ATCAGCCAAG GCCATCAGCG GATTGGCTTT GTTTCGGGTC AGTTGGAACC AGATTTGGGT TTTACCGTCA ACCGCGATCG GCGGCGAGGC TACGAGGCTG CTCTCACTGC GCATCATCTG CCATTGCAAC CTGAATATCT GCGGCCAGGC TTTGATCGGC GCGATTGGGG CCATCAGGCG GCGCTTGAAT TGCTGGCATT ACCTGAGTCA CCGAGCGCTA TTTTTGCTGC CAACGACGAT TTAGCCTTTG GCGTGATCGA TGCTTTGCGC GAACGCGGCT TGAAGGCAGG CGAAGACATC GCCGTGGTTG GCTATGATGA TCTCGAAATG GCGCAGTTGG TGGGTTTAAC CACAATTCAT CAGCCAATGG AGCAAATGGG CCGTAAGGGA GCTGAGGTTT TGCTGGCCGC GCTGAATGAA GGCACACGCC GCCCAACCCT CTATACCTTG CCCGTTAATC TGATCGAACG CGCCAGCAGC AGCAAACTCA GCCAAGCATA A
|
Protein sequence | MAYTIKDVAK RAGVGIATVS RVLNESPNVL PETRARVLAV IDELGYRPNH AARQLVTAKT NAIAIILPFL TRPFFIEVLR GIEAVVANSE YQLIIFNVDS PEQRTRYFNT LPFLGRTDGL LIVSLPLAQP EIKRLQAANL PAVMIDTQVA NLPSVVVDNV GGAFKAVEHL ISQGHQRIGF VSGQLEPDLG FTVNRDRRRG YEAALTAHHL PLQPEYLRPG FDRRDWGHQA ALELLALPES PSAIFAANDD LAFGVIDALR ERGLKAGEDI AVVGYDDLEM AQLVGLTTIH QPMEQMGRKG AEVLLAALNE GTRRPTLYTL PVNLIERASS SKLSQA
|
| |