Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0892 |
Symbol | |
ID | 5732793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1019600 |
End bp | 1020640 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278024 |
Product | LacI family transcription regulator |
Protein accession | YP_001543668 |
Protein GI | 159897421 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00803151 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAGC GGATCACGAT GGAAGACATT GCGCGGCAAA GCGGTGTCTC GTTGGCAACA GTTTCATTAG TATTACGCGA CAAGCCTGGG ATTAACGACG AGACACGCCG CCGCGTGTTG GATATTGCCC GTGATCTTGG TTATCGCAAG CGCTTGAATC ATGAGAAGTT GGTTTCGCAA TCGTTGCACA ACGCAGGCGT AATTGTTAAG GCCTCAATTG GCGACGATAG CCCACTGACC AACCCGTTTT ATGCCCCGAT TGTCGCAGGT ATCGAGGCCG CCTGTCGCAA AATGCATATT AACTTAATGT ATGCCACTGT GCCAGTTGAT ATGGATAATC ATCCTCAAGA GATGCCCCGT TTGCTCTCGG AAGATCACCT TGATGGGGTA TTGTTAGTTG GCGCATTCGC CGATGCAACC ATCACCAAGC TTTTGCAACG TGAGGGCATT CCGGCGGTTT TGGTCGATGG CTACTCACAC GAACATGTCT ACGATTCAGT TGTTTCAGAT AACTTCCGCG CAGCCTATGA AGCAGTCAGC TATCTGATTA GCTACGGGCA TCGTCATATT GGCCTGATTG GGACAACCAA AGAGGCCTAC CCTAGCATTG CCGAACGGCG CAAAGGCTAT ATTCAAGCCT TAACCGATCA TGGCATTCAT GATCAGTATT TTGGTGATTG CTTGCTCACA ATGCACGAAG GCAGCGATAC ATCGAGCATT CTGTTGCAGC GCCATCCGCA AATTACCGCG CTGTTTTGTG CCAACGATAT GATGGCGATT GGTGCAACCC AAGCCGCGCG GGCGTTGCAT CGCCAAATTC CCCAAGATTT ATCAATTATT GGTTTCGATA ATATTGATCT GGCTCAGCAT GTTGCGCCAG CGCTTACCAC AATGCATGTC GATAAAGTCA GCATGGGGCG CTTTGCGGTG CAATTGTTAG CCAATCGAGC CGAATACCCA GACCAGGCTC CGGCGACAGT CTCGCTGCGG CCACGGCTGC TTGAACGCCA ATCAGTTCAA CGTTTGCAAC CACCAAAGTA G
|
Protein sequence | MTQRITMEDI ARQSGVSLAT VSLVLRDKPG INDETRRRVL DIARDLGYRK RLNHEKLVSQ SLHNAGVIVK ASIGDDSPLT NPFYAPIVAG IEAACRKMHI NLMYATVPVD MDNHPQEMPR LLSEDHLDGV LLVGAFADAT ITKLLQREGI PAVLVDGYSH EHVYDSVVSD NFRAAYEAVS YLISYGHRHI GLIGTTKEAY PSIAERRKGY IQALTDHGIH DQYFGDCLLT MHEGSDTSSI LLQRHPQITA LFCANDMMAI GATQAARALH RQIPQDLSII GFDNIDLAQH VAPALTTMHV DKVSMGRFAV QLLANRAEYP DQAPATVSLR PRLLERQSVQ RLQPPK
|
| |