Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1047 |
Symbol | |
ID | 4709803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1133092 |
End bp | 1134048 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639855518 |
Product | LysR family transcriptional regulator |
Protein accession | YP_001002625 |
Protein GI | 121997838 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00687136 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCA CACTACGGCA GCTACAGGTG TTCGAGTCTG TAGCCCGACA CCTCAGTTTC ACGCGCGCCG CCGAGGAGCT CTATCTCACG CAACCGGCGG TCTCCATGCA GATCAAACAG CTCGAGCAAC AGGTCGGGCT GCCACTGTTC GAGCAGATCG GTAAACGAAT CTACCTCACC GAGGCGGGGG AGGAGGTGCG GCGCTATGCT CAACGCATCT CCGCCGAGCT GCGCGAGCTT GCCGACGGCC TCGAGGCCCT GCGCGGGCTC AACAGCGGGC GCCTGCGGCT GACGGTGGCC TCGACCGCCA ACTACTTCGC CACCGATCTG CTCGCCGCCT TCACCCGGCG CCAGCCGGGC GTGACCTTCC AGCTGGAGGT CACCAACCGG GAGGGAGTCA TCCGCCGCAT CCAAGACAAC GAAATGGATC TGGCCGTCAT GGGCCGCCCC CCCGAGGGGC TGGACGTCGC CGCCGAGGCC TTCATGCCCA ACCCGCTGGT GATCATCGCC GCGCCAGACC ACCCGCTGGC CGACGGTTCA CCGATTCCGC TCGAGCGCCT GCAGGACGAA CTCTTCGTGC TCCGCGAGCA GGGTTCGGGC ACCCGCAACG CCGTACAGCG CGTGCTCGAG GAGCGCGGCA TGAACCTGCG CGGCGGGCTG GAGATGAGCT CCAACGAGGC CATCAAACAG TCGGTACAGG CCGGACTCGG CCTCGGCGTC GTCTCGATCC ACACGGTGGC CCTGGAGCTG GAGCTGGGCC GCCTACGCGT GCTCAACGTG GAGGGCTTCC CCCTGGAGCG GCAGTGGTAC CTGGTACACC GCTCGGGCAA GCGCCTGTCG CCGGCCGCCG AGGCATTCCG CCAGTTCATC CTCGACGAGG CCCACCGGCA CTGGCAGGTC CCCGAGACCC CGGGCGTCGC GCCCCCGCAG GAACCACCGC TGCAGGCCGT TCCGTGA
|
Protein sequence | MNITLRQLQV FESVARHLSF TRAAEELYLT QPAVSMQIKQ LEQQVGLPLF EQIGKRIYLT EAGEEVRRYA QRISAELREL ADGLEALRGL NSGRLRLTVA STANYFATDL LAAFTRRQPG VTFQLEVTNR EGVIRRIQDN EMDLAVMGRP PEGLDVAAEA FMPNPLVIIA APDHPLADGS PIPLERLQDE LFVLREQGSG TRNAVQRVLE ERGMNLRGGL EMSSNEAIKQ SVQAGLGLGV VSIHTVALEL ELGRLRVLNV EGFPLERQWY LVHRSGKRLS PAAEAFRQFI LDEAHRHWQV PETPGVAPPQ EPPLQAVP
|
| |