Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1281 |
Symbol | |
ID | 5733174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1491032 |
End bp | 1492228 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641278421 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_001544057 |
Protein GI | 159897810 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAAC TGATTACCGC GCTCGATCAG TTAGCTGAAA CGCCCCAAGC CATCAGTCGC ATTCGCGAGC TTATCTTGCA ATTGGCTGTG CAAGGCCGCT TGGTCGAGCA AGATCCTAAC GATGAGCCAG CGTATAACAT GTTTAAGCCG CTAATCAAAG AGCAACAAAT GTTGAATATA GACGTTAGAT CATCTATAAA TAAAGAACAT ACAAAATTTC AGATTCCTCC TTCATGGATA TGGGTATCAC TGGATGATAT TGTAGTCTAT GATGCAGGTT CAAAGCATGA TCCCAACAAT CTTGATCCTG ATAGTTGGTT GTTAGAACTT GAGGATATTG AAAAAAATAC CTCTGTTATT TTAGGACAAT TTCTAGTAAA AGAGCGAAAG CCTAAAAGTA ACAAAGCAAG CTTCCAGAAA AATGATATTC TTTATGGAAA ATTGCGACCT TATTTGAATA AAGTTATTGT TGCTCATACT TCAGGATTTT GTACTACTGA AATAGTGGTG TTACGTCCAA AATTGGAATT GAGTCCCTTC TATATACAAA ATTTCCTCAA AAGCCCCTTT TTCGTTAGCT ACGTAAACCA ACATTCATAT GGAACAAAGA TGCCTCGACT AGGAACACTA GATGGCAAAA AGGCATCTAT ACCCCTACCA CCACTCGCTG AACAACAACG CATCGTCGCC AAAGTTGCGC AATTGATGGC GTTGTGCGAT CAGCTTGAGC AGCAGCAAAC CAGCCGCGAG GCGCTGCGCC AGCAAGTCCA GCAAAGCGCA ATCAAGCAGC TTTTGAGCGA GCTAGCCCGA CCAGCCGATG CGCAGCAGAT TGCCCAACCA AGCCAACCTG AGCGCCAAAC CAGCCTCTTT ACTCGGCCCA CTCTGGCGAA TCAACCAATC GAGCCGCTTG CTGACGACGG ATTGAGCATA AGCAGTGAGC AACAGTTGTT TTTTGAGCAG TTTGACGATC TGCATACCAC GCCCAAAGCA ATCGGCCAAT TGCGCGAATT GATTTTGCAG CTCGCCGTGC AAGGCCGCTT GGTGGCTCAA AACCCCAGCG ACGAGCCAGC GAGCATTTTA TTGGAACGAA TTCAAGCCGA AAAACAACGC CTGATTGCAG CGGGCCAACT CAAGCCCGAA AAAGCGCTTA CGCCCATCGC CGCCAGCGAG CTACCATTTG GCTTACCCAA GGGCTAG
|
Protein sequence | MNELITALDQ LAETPQAISR IRELILQLAV QGRLVEQDPN DEPAYNMFKP LIKEQQMLNI DVRSSINKEH TKFQIPPSWI WVSLDDIVVY DAGSKHDPNN LDPDSWLLEL EDIEKNTSVI LGQFLVKERK PKSNKASFQK NDILYGKLRP YLNKVIVAHT SGFCTTEIVV LRPKLELSPF YIQNFLKSPF FVSYVNQHSY GTKMPRLGTL DGKKASIPLP PLAEQQRIVA KVAQLMALCD QLEQQQTSRE ALRQQVQQSA IKQLLSELAR PADAQQIAQP SQPERQTSLF TRPTLANQPI EPLADDGLSI SSEQQLFFEQ FDDLHTTPKA IGQLRELILQ LAVQGRLVAQ NPSDEPASIL LERIQAEKQR LIAAGQLKPE KALTPIAASE LPFGLPKG
|
| |