Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0631 |
Symbol | |
ID | 4709493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 709213 |
End bp | 710424 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639855095 |
Product | CRISPR-associated RAMP Cmr6 family protein |
Protein accession | YP_001002218 |
Protein GI | 121997431 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1604] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01898] CRISPR-associated RAMP protein, Cmr6 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.296132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCG CCGCCGTTCC CGCCTACCTC GGTGAGGACT TCCGGGAGGC CGCTCCGGGC CACCGCTTTG CGCTCTATCT CAGCGTCTGG GATCAGCAAT GGAAGAAGCA GCGGGGCGGT GCGATCGAGG AGCTGCTGAA GCTGAACGAC GACGATCGGA AGCGGCTGGG CGCTTTGATC GAGCGCCAAC GCCGGCTACT CCAGCATGAG GAGAGCGCAC AACCAGGGAG CACCCTGAAC CTGCCGGCCA CGAGCACGAG CCCCTTCACA ACAGGGCTCG GCATGGCCCA CCCGCTGGAG AACGGCTTCG CTTTTCTCAC CCCCTACGGC CTGCCTTACC TCGCGGGAAG CGGCGTCAAG GGTGTCCTGC GCCAGGCGGC GCGGGAGCTC GCCGAGGGGG GTGAGTTCGA GGATCCCGGG CGCGATTGGG ACTGGCCCGA GATCGAGGCA CTTTTCGGGA GCCCCGGCGA GGACGAAGGC GGCGTGACAG GCCGGCGCCG AGGCGCCCTA AACTTCTGGG ACGTCTTCCC CGAACCAGGG CGAGGGCAAG ACCTGGCCTG GGAGGTCATG ACACCCCATC AGGGCAGCTA CTACCAGGAC GCTACCGGCC AAACCGCACC GCACGATAAC GGCGCGCCCG TGCCGATCTA CTTCCTCGGT ATCCCGGCCG GCAGCGGTTT CCGCTTTCAT ATCCAGTGCA ATCGGGCCCT GTTACGGCAG ACGGGACCGA CACTGCTTGA GCCAGCGGGC GAGGACGGCG ATCGAGCACG CTGGCAGGTT CTGCTCGAAA CCCTCTTCGC GCACGCCTTC GAGTGGCTCG GCTTCGGCGC CAAGACGGCG GTCGGCTACG GCGCGCTGTC CATCGACGAG AAGACCCGGC GGCACGAGGC CGAGGCGCGC GAAAAGATCC GGGCCGAACA GGCACGCAAG GAACAGCTTG CGAGCTTGCC GCCCGGGCAG CGTCGCGCGG AAGAGCTGCT TGAGCAGCGC CAAGACCCCA GCTATCCGGC GCACCGCTTC TTGCTCGAAC AGCTCGAAGC CGGTGCAGTA GCCGCCGAGG AGCAAGCGGC CCTGGCGCAG GTTGCGCTCG ACTACCTCGC GCAAGACCGT GAAAAGGTTC GTAAGACGAA GAAAGCCAAG CAGAAGCTGC CGAAACTGGA CGAGGAAGAG GTGCAACTTC AGCGCTTCGT CGACGAGGAA GGCACAGGGT GA
|
Protein sequence | MSRAAVPAYL GEDFREAAPG HRFALYLSVW DQQWKKQRGG AIEELLKLND DDRKRLGALI ERQRRLLQHE ESAQPGSTLN LPATSTSPFT TGLGMAHPLE NGFAFLTPYG LPYLAGSGVK GVLRQAAREL AEGGEFEDPG RDWDWPEIEA LFGSPGEDEG GVTGRRRGAL NFWDVFPEPG RGQDLAWEVM TPHQGSYYQD ATGQTAPHDN GAPVPIYFLG IPAGSGFRFH IQCNRALLRQ TGPTLLEPAG EDGDRARWQV LLETLFAHAF EWLGFGAKTA VGYGALSIDE KTRRHEAEAR EKIRAEQARK EQLASLPPGQ RRAEELLEQR QDPSYPAHRF LLEQLEAGAV AAEEQAALAQ VALDYLAQDR EKVRKTKKAK QKLPKLDEEE VQLQRFVDEE GTG
|
| |