Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1153 |
Symbol | |
ID | 4710143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1256434 |
End bp | 1257723 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639855627 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_001002731 |
Protein GI | 121997944 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.487263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTCC CGGCGTATCC CGAGTACAAG GATTCTGGGG TCGAGTGGCT GGGGGAGGTG CCGGAGCATT GGTCCGTCAG TGCGCTGAAG CGTGTTGCGC GTCTCGAAAG TGGCGACGCG ATAAGCAGCG ATCACATCAG TGAAGAGGGG GAGTATGCCG TTTACGGCGG GAATGGCATA AGGGGTTTTT CATCTGGATA CACTCACGAC GGTTTTTACC CTTTGATTGG GCGCCAAGGA GCTCTTTGCG GTAACGTCAA TTACGCGAAA GGAAGGTTCT GGGCATCTGA GCATGCGGTT GTTGTTTGGC CTGGAAGACA AATTGACGGT TTTTGGCTCG GTGAGCTTCT TCGCTCAATG AATCTTAATC AATATGCGAC ATCGGCTGCG CAACCGGGTT TGTCGGTTGA GACTATTGAA AATCTTTATG TTCCTGTTCC GCCGGATGAA GAGCAACAAA AGATAGCGGA GCTCCTCGAC CACGAAACCG CCCGTATCGA CGCCCTGATC GAGGAGCAGC AGCGCCTGAT CGAGCTGCTC AAGGAGAAGC GCCAGGCGGT GATCTCCCAT GCCGTCACCA AAGGCCTCGA CCCCGATGTG CCGATGAAGG ACTCCGGCGT GGAGTGGTTG GGGGAAGTGC CGGCGCATTG GGATGTCGTG AAGTTCGTCC GGTGTGCAAA AATTGCTGAG GGTCAGGTTG ATCCAAAGCA GGAGCCATAT AGGAGCATGA TGCTTGTTGC TCCAAATCAC ATTGAGTCAG GGACTGGACG ACTCATGGCT CGTGAGACTG CAGAAGAGCA GGGGGCAGAG AGTGGCAAGT ATTATTGCTA TGCTGGCGAC GTAATATACA GCAAGATTCG ACCGTCATTG AGAAAAGCAT GTGTAGCCTA CGAAGATTGC CTATGCAGCG CTGATATGTA TCCTCTCAGG GCGCAAAGTG GGGTGTATGG CGATTATCTG CGCTGGACGA TTCTGTCTGA ATCGTTCTCG ACGCTAGCTT TTCTGGAATC AGAGCGCGTG GCGATGCCGA AAGTCAATCG GGAGTCGATT GAAGAGATTC GAATCCCTAT GCCGCCACCG GAAGAGCAGC TACAGATATC CCGTACCCTC GAAAAAGAAA CGGCCCGCAT CGACGCGTTG ATGGAGGAGG CTGAATCGGG TATCCAGTTG CTCCAAGAAC GCCGCTCCGC CCTGATCTCC GCCGCCGTCA CCGGCAAGAT CGACGTGCGT GACTGGGCGC CGCCGGCCGC TGCCGAACCG GAGCAGGAAC GCGAAGGAGC GGCGCTATGA
|
Protein sequence | MSFPAYPEYK DSGVEWLGEV PEHWSVSALK RVARLESGDA ISSDHISEEG EYAVYGGNGI RGFSSGYTHD GFYPLIGRQG ALCGNVNYAK GRFWASEHAV VVWPGRQIDG FWLGELLRSM NLNQYATSAA QPGLSVETIE NLYVPVPPDE EQQKIAELLD HETARIDALI EEQQRLIELL KEKRQAVISH AVTKGLDPDV PMKDSGVEWL GEVPAHWDVV KFVRCAKIAE GQVDPKQEPY RSMMLVAPNH IESGTGRLMA RETAEEQGAE SGKYYCYAGD VIYSKIRPSL RKACVAYEDC LCSADMYPLR AQSGVYGDYL RWTILSESFS TLAFLESERV AMPKVNRESI EEIRIPMPPP EEQLQISRTL EKETARIDAL MEEAESGIQL LQERRSALIS AAVTGKIDVR DWAPPAAAEP EQEREGAAL
|
| |