Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1302 |
Symbol | |
ID | 4710794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1412420 |
End bp | 1413349 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639855771 |
Product | helix-hairpin-helix DNA-binding motif-containing protein |
Protein accession | YP_001002873 |
Protein GI | 121998086 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTCAGT CTACGTGGCC ACTAACAGCG AAGCTCTTCG GCGTCCAGGC CCGGGGGCAG AAGGCGGCGC AGCTGCAGCT GATCTACATG GTCTTCGCTG CCATCGTCGG CGGTGTGCAG CTGCTCTTGA TGCTCATTGG CGGGCTGCTG CGCGGCGCCA TCGCCCTCGG GCAGTGGGTC GGTGACAACG TGCACGCCCT GCCGCGGTGG TGGCGCAATG CCCGGCTGTG CCGGCAGCGC CGGCGGATCG ACGAGGCCGT CGAGCAGGCG ATCAACCGCT GGGACGCGGA CCGGCTCAGC GCCGCCTTCG CCCTGGCGGT GGCCGTCTGG GGGCGGGGTG AGGCGCTCCA GGACGGCCAG CGGACGCACC GACGGGTCAG CGAGAAGGCG GGATGGGTCG CGCTTCCGCG CTCCCCGGAG GTCTTCGCGG AGGCGGCACA GGGACTTGAA CATTGCCGCG CCGCCGTTCA GCCGAAGGAG GATGCGCACC GGATCCTGAT CGCACTGCTC GCCGAAGTGG CGGCCGAGAA ACTGGAGGGG TCCCGGCGGG CGGCGCTGCT CTTTGAGGCC GATGACCTGG CCCTGATCCA GGGCCCACGG ACGGTGCTCC AGGAGCAGAT GCTGGAGATC TTCGCTGATC ACGCCCAGCT ACAGATCGAG CCGGCCCGGC CGGTGGATGA GGCGACGAAG CCATCCTCAG CGCGGTCGGC TCGTGGTGCA CCGGGGGCGG GGCAGGGCGA CGAGCCGACG GGGCGCATCG ACCTCAACAC GGCCTCCATC GAAGAACTCC AGGCGATCCC CCACATCGGC CCGGAGCGCG CCGAGGCGAT CGTCGCCCTG AGACCGATCC GGCGGATCGA GCAGCTTGAG GAAGTCGACG GGATCGGCAC GAGCCGCCTG GCGGAGATCG CCGATCAGGT GAAGGTGTGA
|
Protein sequence | MGQSTWPLTA KLFGVQARGQ KAAQLQLIYM VFAAIVGGVQ LLLMLIGGLL RGAIALGQWV GDNVHALPRW WRNARLCRQR RRIDEAVEQA INRWDADRLS AAFALAVAVW GRGEALQDGQ RTHRRVSEKA GWVALPRSPE VFAEAAQGLE HCRAAVQPKE DAHRILIALL AEVAAEKLEG SRRAALLFEA DDLALIQGPR TVLQEQMLEI FADHAQLQIE PARPVDEATK PSSARSARGA PGAGQGDEPT GRIDLNTASI EELQAIPHIG PERAEAIVAL RPIRRIEQLE EVDGIGTSRL AEIADQVKV
|
| |