Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4711 |
Symbol | |
ID | 5736947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6017093 |
End bp | 6018097 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281875 |
Product | LacI family transcription regulator |
Protein accession | YP_001547470 |
Protein GI | 159901223 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGCA TCAAAGATGT TGCCAAAGCA GCCAATGTTT CAACGGCGAC AGTTTCGCGG GTTTTGGCCA ATCATCCGCA TGTGCGGCAA GAGGTGCGCG AACGGGTGTT AGCGGCGGTG GCTCAGCTCG AATATCGGCC TAATTTGATT GCCCGCAATT TGCGTTCGCA GCAATCCAAC ACCTTGGGCT TGATCGTTTC AGATATTCGT AATCCCTTTT TTACCGCCGT TAGCCGTGCC GTCGAAGATA CGGCCTATGC CCATGGCTAC AATGTGCTGC TCTGCAACAC CGATGAAAAT CCCGAAAAAG AATTGCTGTA TTTGCAACTG ATGGGCGACG AGCAAGTGGC TGGGGTGATT TTCTCGCCCA CCTTACAAAC CCTCAATCGC TTTCATGAAT TGAATTTGAG CTTTCCGACC GTATTAATCG ACCGTTCGTT GCGCAGTGGC GATGTTGATG CCGTGCTGTT GGATAATGTG AGCGCTGGTT ATACCTTGGC CCAACATCTG ATCAATCAAG GCTATCGGCG GATTGGGGCG ATTTTTGGTG AGGCCAGCAC GACTGGGCGC GAACGCCAAC GCGGCTTTGA AGATGCCTTG CGCGATGCAG GCATCAGCAT GCAGCCCGAA TATTTGCGCT TTGTGCGACC ACGCAGCGAA GCTGGCCATA GCACAACCTT GGATCTCTTG CGTTTGCCGC AACCACCAGA GGCCATTTTT ACTAGCAATA GCTTACTGAC GGCTGGCGCA CTTCAAGCCA TTCGTGAACG GCGTTTGCAG ATGCCTGAGC AAATTGGCTT GGTTGGTTTC GATGATACAG CTTGGGCCAG TTTGGTGCAG CCAGCGATAA CTGTTTTAGC CCAACCAACC GATGAAATTG GCCGCTCAGC GACGGAATTA GTCTTGCAAC GGGTGGCAGA CCCACAACGG CCAACCCGCA AAATCATTTT GCAAGGCGAG CTGATTGTGC GCGAATCGAG TGTTGAGCAG CGGGTAAGAG CCTAA
|
Protein sequence | MSSIKDVAKA ANVSTATVSR VLANHPHVRQ EVRERVLAAV AQLEYRPNLI ARNLRSQQSN TLGLIVSDIR NPFFTAVSRA VEDTAYAHGY NVLLCNTDEN PEKELLYLQL MGDEQVAGVI FSPTLQTLNR FHELNLSFPT VLIDRSLRSG DVDAVLLDNV SAGYTLAQHL INQGYRRIGA IFGEASTTGR ERQRGFEDAL RDAGISMQPE YLRFVRPRSE AGHSTTLDLL RLPQPPEAIF TSNSLLTAGA LQAIRERRLQ MPEQIGLVGF DDTAWASLVQ PAITVLAQPT DEIGRSATEL VLQRVADPQR PTRKIILQGE LIVRESSVEQ RVRA
|
| |