Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1311 |
Symbol | |
ID | 4710819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1421753 |
End bp | 1422976 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639855778 |
Product | helix-hairpin-helix DNA-binding, class 1 |
Protein accession | YP_001002880 |
Protein GI | 121998093 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0682087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCCTGT TCAATCTCGG CAAGAAGGAC GCCTACGGCA AGCAGCGGCG CGTTGAGCAC CGCGGCAAGT ACCTGCGGGC CAGTCGCACC GGCGGCGTGG CGCTGCGCGC CCAGGCCCGG GCGGCGGGTG TGAACCTCAC CGCCAACACC CGGCGCGGTG TCCGGGCTTC GATGACGCCG GCGAAGAACA CCCAGGTGGC CTTGCAGAAC GGCCGTTTCA TCCTGCGCGG ACGCTACGGG AACGGGCCGA CCAAGCTGAA CCTCTCCAAA AGCGGTGCGA CGGTCTCCAC GCGCAACCGG CTCGGCTCGT TCAACTGGCT CAAGCCCAAC CGCTCCTCGG CGAAGCTCTT CGGCGTCCAG GTCCGGGGGC AGAAGGCGGC CCAGCTTCAG GTCTTCTATA TGCTCTTCGC CGCCGTTGTC GGTGGCGTGC AGCTGCTCCT GATGCTCATT GGCGGGCTGC TGCGCGGCGC CGTTGCCCTC GGGCAGTGGG TGGGTGACCA CGTGCACGCC CTGCCGCGGC GTTGGCGCAA TGCCCGGTTG CGCCGCCAGC GCGGGCGGAT CGACGAGGCC GTTGAGCAGG CGATCAACCG CTGGGACGCG GACCGGCTCA GTGCTGCAGT CGCCCTGGCG GTTGCCCTAT GGGGGCGCGG TGAAACGCTC AAAGCGGGGT GGCACCGCGT CCAGCAACGT GTCACCCAGA ATCCGGGCTT TGAAGCCCTG CCCCGATCAC CGGAGGTGTT CGAAGAGGTC GCTGCCGAGC TCGAGCGCTG CCGGGCCGCG GTGAAGCTGA CACAGGATGC GCATCGGATC GTGCTCGCGT TGCTGGCGGA GGCCGCTACG CAAGGGATGG ACGGGGGGCG GCGGGCAGAG CTGCTGTTCG ATGCGGACGA TCTGGCCCTG GCCCGGGGTC CGCGGACCGT GCTGCAGGAA GAGCTGCTCG AGATCTTCGC CGACCATGCG CAACTCTGTC TGGAGCCGGC CTTGCCGGTG GACACGACAC AACGGCAGTG CGGCCGATCC CGCCCAGGGC GTGACCTGTC GCAGGGACTG ATCGACCTCA ACACCGCCTC CATCGAAGAG CTTCAGGTGA TCCCGCACAT CGGCCCGGAG CGCGCCGAGG CGATCGTCGC CATGCGACCG ATTCGGCGGA TCGAGCAGCT TGAAGAGGTC GACGGCATCG GACCGAGCCG GCTCGCGGAG ATTGCCGAAC AAACCCGGGT ATGA
|
Protein sequence | MGLFNLGKKD AYGKQRRVEH RGKYLRASRT GGVALRAQAR AAGVNLTANT RRGVRASMTP AKNTQVALQN GRFILRGRYG NGPTKLNLSK SGATVSTRNR LGSFNWLKPN RSSAKLFGVQ VRGQKAAQLQ VFYMLFAAVV GGVQLLLMLI GGLLRGAVAL GQWVGDHVHA LPRRWRNARL RRQRGRIDEA VEQAINRWDA DRLSAAVALA VALWGRGETL KAGWHRVQQR VTQNPGFEAL PRSPEVFEEV AAELERCRAA VKLTQDAHRI VLALLAEAAT QGMDGGRRAE LLFDADDLAL ARGPRTVLQE ELLEIFADHA QLCLEPALPV DTTQRQCGRS RPGRDLSQGL IDLNTASIEE LQVIPHIGPE RAEAIVAMRP IRRIEQLEEV DGIGPSRLAE IAEQTRV
|
| |