Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0698 |
Symbol | |
ID | 5732599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 801167 |
End bp | 802663 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641277828 |
Product | CRISPR-associated RAMP Cmr1 family protein |
Protein accession | YP_001543474 |
Protein GI | 159897227 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1367] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01894] CRISPR-associated RAMP protein, Cmr1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGTA AACCTAAATT CTCACCACTA AAAATAGCAT ACGAGCATCC AAGCTATAAA GCTGGGAAGA ACCCTAGCGC ATTTACCGTT GGCGAACACA CCTATATCAC CGAAACCCGC AGCTATAGGT TGATCACCCC ATTATTTGGT GGCGGCGTTA AGGCTGGGGT GAATGACCCA ATTACGCCAA TTCGCGCCAG TGGCATTCGC GGTCAATTGC GCTTTTGGTG GCGAGCTATT CGCGGCGGCG GCTATCGCTC AATTGATGAT TTACGGGCAA AAGAGGCTCA GATTTGGGGT AGTGCCAATA CTTCATTCTC TTCTGAAAAA GAAAGAGAGG ATGAAGATTC TGGCAATCTG GTAGATGATA AAAATCAAGA AAATAATAAA TTAAAAAATA TCATTCCTAT TCAAATCTCT GTTGAGATAA GTTCTGATGG AGAAGAAAAA TATCCATATG TTATGAAAAA GAATGCTATA GGAAATTATA ACCCAGAGCC TATATCTAGT ACTGCGCCTG CATATGCATC TTTTCCATTA AAACCAGAAA TGAACGCTAT AAGAAAATAT GGACCTAAAA CTCCTATCAA TCCTATTCGT AATAAGATTA TTTTTAGATT AAAAATTACC TACTATCAGC ATCATAAGCT GGAAATTCAG GCTGCACTGT GGGCATGGGA AATGTTTGGT GGAGTCGGTG GTCGTACCCG ACGTGGATTT GGTTCGATCA CTCAGTGTGA TAGTGATAAT AAACCACAAA ATGCAGAATC AGTTGCGCAA TGGTTAAAAC AAAACATTAA AAATTATACA GAAGATATTT CAAAAAGAGG TTTTAATTGG ATTAATGATA TTCCTAATAT TGTAGAAAGT ATTAACGAGA GTAAATTTTT CTGCTCATCA AAAGGCTTAA ATGGTATACG TATTTGGAAT AATATGATAA ATACGTTACA AAAATTTCGC CATCAACGAA ATCAGTATGG CAAAAAATTT GGTAGAAATC ATTGGCCTGA GCCAGATTTT ATTCGAGAAA TCACGGGGGA GCAATCTAAT GATCATCAAG AAGCTATTAT TAAGGTTGAC AAATTCCCAC GGGCAGCCTT TGGATTACCA ATAATATTCC ATTTTATCAA TCGTAAAGAT AAAAAAAATC CTAATGCTCC TAGAGATCCT TATGATACAA CGCTAGTGCC CAAAGGTTTT GAGCGCTTTT CTAGCCCTTT GATTATTAAG ATTATTCAAT GTGCTGATGA AAGTTATGTT GGAATTGCAC TTATTTTGAG CAAAACCCAA GTCCCTAATC AATTACAACT GAAAAAGGGT GGAACTCCAT TGCTCAATCC TAACGATCAA ACCGAAGATT TTCAACATCT CTTGACACCT AATGAAGCAC AACAAATCAA GCAAATTGAA GCTAATAAAG CGTCAGGTTC ACTACTGAAT CAAGGAACCG ATATTCTTAA AGCATTTTTA GCCTATCTTG CGAAGGAGCT ACGATAA
|
Protein sequence | MNRKPKFSPL KIAYEHPSYK AGKNPSAFTV GEHTYITETR SYRLITPLFG GGVKAGVNDP ITPIRASGIR GQLRFWWRAI RGGGYRSIDD LRAKEAQIWG SANTSFSSEK EREDEDSGNL VDDKNQENNK LKNIIPIQIS VEISSDGEEK YPYVMKKNAI GNYNPEPISS TAPAYASFPL KPEMNAIRKY GPKTPINPIR NKIIFRLKIT YYQHHKLEIQ AALWAWEMFG GVGGRTRRGF GSITQCDSDN KPQNAESVAQ WLKQNIKNYT EDISKRGFNW INDIPNIVES INESKFFCSS KGLNGIRIWN NMINTLQKFR HQRNQYGKKF GRNHWPEPDF IREITGEQSN DHQEAIIKVD KFPRAAFGLP IIFHFINRKD KKNPNAPRDP YDTTLVPKGF ERFSSPLIIK IIQCADESYV GIALILSKTQ VPNQLQLKKG GTPLLNPNDQ TEDFQHLLTP NEAQQIKQIE ANKASGSLLN QGTDILKAFL AYLAKELR
|
| |