Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1335 |
Symbol | |
ID | 5733227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1546936 |
End bp | 1547901 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278473 |
Product | HNH endonuclease |
Protein accession | YP_001544108 |
Protein GI | 159897861 |
COG category | [V] Defense mechanisms |
COG ID | [COG1403] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.516631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGTTT GCACGCGCTG CCACCAAGAT ACTGGTCTTA TGGGTGCTTT ACGCTTCAAT CGTCAAACCA ATCGCTGCGG CCCCTGCGAC GGCGTTGTCC AACAACAACT CCAGCGCTTT CGCCAGGCGT TTCTGCAATT TTGTAATGAT GGGCTTTTAT CGCCACAAGA ATGGACGATC CTTATTCAAG GCTCGCAGCA AGAAGGCTTG AGCTGGGATG AGGCGCTGAA CTATATTCGT GGTGATGCAC TGAACTTCCT TGAACGCACA CTGGCTTTTG CATCGACCGA TGGCGTAATT ACGAATGAAG AAGCCGCCCA TATTCAGCAA CTCCAACAGG CCTTTCGCAT CCCCGATCAT CAAGCCCAAC CGCTGCTCCA GCGCCTCAAA TACTTGCAAA CCATTAGTAA TGTGCGCCAA GGTCATCTGC CAACGATTCA ACCAACCGTT CGGCTCGATG CTGGCGAAAT CTGCCATCTC GAAACTGATG CCCGCTATCA TAAAGTCACC GCCAAAGGCA CTACCTTGCA ACCTGGGCGT TTTGTGGCTA CCAATAAGAA ACTGCATTTT CTTTCCCAAG CGGGTGGAAG CAGCATCGAC TGGAAGAAGG TCATGCGGAT TAACGCTGAG GCAGGCAGCA TCTACTTAGA ACTATCGGTT AAAAGTGGGA ATGGTCGCTA TAGCGTGGCA GATCCAACCA TGGCTGAAGC AGTATTTGAT GTGTTAGTGC GTATGGCCAA ACGAGAGTTT ATTGCACCCC AAACTGGACA AGAAAGCCGC TATATTCCCC AAGATGTAAA ACTGGCAGTG TGGCAACGTG ATCAGGGCAA GTGTACCCAG TGTGGCGATG CCTCTTATCT TGAGTTTGAT CATATCATTC CGCACAGCAA GGGCGGAGCA AACACCGTCG GCAATGTGCA GCTGCTTTGT CGGAAATGTA ATCTGGCGAA GGGTGATCGG ATTTAG
|
Protein sequence | MPVCTRCHQD TGLMGALRFN RQTNRCGPCD GVVQQQLQRF RQAFLQFCND GLLSPQEWTI LIQGSQQEGL SWDEALNYIR GDALNFLERT LAFASTDGVI TNEEAAHIQQ LQQAFRIPDH QAQPLLQRLK YLQTISNVRQ GHLPTIQPTV RLDAGEICHL ETDARYHKVT AKGTTLQPGR FVATNKKLHF LSQAGGSSID WKKVMRINAE AGSIYLELSV KSGNGRYSVA DPTMAEAVFD VLVRMAKREF IAPQTGQESR YIPQDVKLAV WQRDQGKCTQ CGDASYLEFD HIIPHSKGGA NTVGNVQLLC RKCNLAKGDR I
|
| |