Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5276 |
Symbol | |
ID | 5737234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 62612 |
End bp | 63868 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641282440 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_001548031 |
Protein GI | 159901786 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAATAG AAGATACTAC TACTTCCTCG ATTTGGGATT TACCTTCTCA TTGGGGTGTA AAAAAACTTA AATTGATAGC AAAAGAAATA TCGCAACAGA TAAAACCTGC TGATAATCCA TCCACAGTGT ATAACTATTG GGGGCTAGAT GCAATTACTA AGGGGCAATT TCAAGAACCA AAACAGAATC TTGTTAAAGG TTCTAACATA GAAAGCACAT GCGTCACATT TACAGAAAAT CAGATTATTT ATTCAAAATT ACGCCCTTAC TTAAACAAGG TTATTGTCCC ATCTATTCCT GGTATTGGTA CAACCGAGTG GATTGTTGTC GAACCTGATG CAAATGTTGT GGATAGGAAG TATCTTGCTT ATGTTTTACG ATCACCAGCT TTTTTAAGGT ATGTGTCTCG TGGTGAGAAT ATTAATGGTG CCAGAATGCC GAGGTTAAGA AAGGACAGCT TCTGGAATTT TCCCATTCCT CTGCCATCTC TTTCTAATCC TGCACGATCT CTTCAGATCC AGCAGTCAAT CGTTGTTCGG ATTGAGTCGC TCCTAAGTGA GCTGGGAGAG ATACGTGAGC TTCATCGAAG AATCGATCTT GATGTTTCCA ATGTGATGGA TAGTATATTT CGAGATGTTT ATATAGATTT GGAGAACAAA TACCCCTCTC GTCAACGGAT TGACTCCTTC ACACAAGTGA AAACCGGAGG TACTCCTAGT CGTAAGCATT CAGAGTATTA CAACGGTGAT ATTCCTTGGG TAAAGACTGG AGAACTCAAA GACGGCCTGA TCAAAAAGAC TGAAGAGTAT ATTACCTTAG AAGCAATGCA GAATAGCAAT GCAAAAAAAA TACCGATAGG AACTCTTTTA GTTGCTATGT ACGGTCAAGG TCAAACCAGA GGAAGAACTG GTTTATTAGC GATTGAAGCT ACAACAAATC AAGCCTGCTG TGCAATATTA CCAAATCCCT ATATCTTTAT TCCGCGTTAT CTTCAATTTT GGTTTATTTT TATGTACCAT GATCTTCGCA AAAAGAGCGA TGCAAGAGGA GGAAATCAAG CAAATCTTAA TTCTCAAATA ATAAAGGAAT TAAAACCGCC ATTACCGCCT ATATTCGTGC AACAACAGGT AGTATCTTAT CTAGATGCAG CGTATAACGA ATTGATCGAC ATGCAATCTA TTCAATCTAT CAACAAGCTA TTATTCGATC AAATTGAACA ATCCATTCTT GAGCAGGCGT TTCGTGGAGA GCTATAA
|
Protein sequence | MKIEDTTTSS IWDLPSHWGV KKLKLIAKEI SQQIKPADNP STVYNYWGLD AITKGQFQEP KQNLVKGSNI ESTCVTFTEN QIIYSKLRPY LNKVIVPSIP GIGTTEWIVV EPDANVVDRK YLAYVLRSPA FLRYVSRGEN INGARMPRLR KDSFWNFPIP LPSLSNPARS LQIQQSIVVR IESLLSELGE IRELHRRIDL DVSNVMDSIF RDVYIDLENK YPSRQRIDSF TQVKTGGTPS RKHSEYYNGD IPWVKTGELK DGLIKKTEEY ITLEAMQNSN AKKIPIGTLL VAMYGQGQTR GRTGLLAIEA TTNQACCAIL PNPYIFIPRY LQFWFIFMYH DLRKKSDARG GNQANLNSQI IKELKPPLPP IFVQQQVVSY LDAAYNELID MQSIQSINKL LFDQIEQSIL EQAFRGEL
|
| |