Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2478 |
Symbol | |
ID | 5734359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3169087 |
End bp | 3170112 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641279618 |
Product | CRISPR-associated RAMP Csm4 family protein |
Protein accession | YP_001545244 |
Protein GI | 159898997 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1567] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01903] CRISPR-associated RAMP protein, Csm4 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTTA ATCTGATTCG GCTCAAGCCA CAAGCTGCGT TTCACTTTGG CATGCAAGGC ATCGATATGG AAGTGGTCAG CGAAACCTGC CCATCCGACA CGCTCTACGC CGCTTTATTT TGGCAGGCAT TGCAACAAGG CCAGCGCTGG GCCAGCGAGC CGAGCAATCC GCCATTCACA ATCTCGTCGT GTTTTCCGTA TGTTGATGGA ATTCAACTAT TGCCTGTGCC AATGCTGCCA CCCTTGCCCA GCGATAAGCA AAACCCGGGC GAGCGCAAGC AATTCAAAAA AGTGCGGTTT GTTTCCAGCG AGATTTTTAT TAATCTACTG GCCGGAACCC ACTCGCTGAG CCATTATTTT CGGCCAACCA ACGGCGTTGC TTTGCAAAAT GGCAGTGTCT TGGTCAGCCA AGCTGAATTT AGCGCCAACA AATGGGCTGT GGCAGATCCG CTTTGGAAAA TCGAGTCGAT TCCGCATGTG GCAGTTGATC GTTGGAGCAA TGCTTCGGCC TATTACGAAA CTGGCCAAGT GCGCTTTGCC GAAGGTTGTG GCTTAGCAAT CTTGGCACTT GGCGATATCA AACAACTGAT GTCGCTCTTG CACCAAGTTG GCATCGATGG CTTAGGCGGG CGGCGCAGCA AGGGGGTTGG GATGTTTGAG CCAGAGCTGC AAACCGAAAC TCTCGATTTA CCAGCTGCTA CCAGCGATTC GGTAATTGTG CTCTCGCGCT ATTTGCCGAG TGCCGCTGAG CTTGCGGCAG GAGTGCTCGA TCTGCCAGCA GCCTATAGTT TAGAGGATGT GACTGGCTGG ATGTATTCGC CCGCTGCCAA AGCCCAACGC CGCAAGGCAA TTTGGATGAT CGGGGTTGGC TCGCGCTTAA ATCGCACAGG GTTGGCTCAT TCGATCATCG GCTCAAGCGT TGATGTGGCC CCAACCTACG ACACCCCCAA CGCTGGCGTG AATCACCCCG TTTGGCGACA TGGGCTAGCG CTCACGGTCG GTTGTAGCGT AGGAGATTCG CAATGA
|
Protein sequence | MEFNLIRLKP QAAFHFGMQG IDMEVVSETC PSDTLYAALF WQALQQGQRW ASEPSNPPFT ISSCFPYVDG IQLLPVPMLP PLPSDKQNPG ERKQFKKVRF VSSEIFINLL AGTHSLSHYF RPTNGVALQN GSVLVSQAEF SANKWAVADP LWKIESIPHV AVDRWSNASA YYETGQVRFA EGCGLAILAL GDIKQLMSLL HQVGIDGLGG RRSKGVGMFE PELQTETLDL PAATSDSVIV LSRYLPSAAE LAAGVLDLPA AYSLEDVTGW MYSPAAKAQR RKAIWMIGVG SRLNRTGLAH SIIGSSVDVA PTYDTPNAGV NHPVWRHGLA LTVGCSVGDS Q
|
| |