Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5076 |
Symbol | |
ID | 5737034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 93117 |
End bp | 94595 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641282241 |
Product | CRISPR-associated Cst1 family protein |
Protein accession | YP_001547832 |
Protein GI | 159901586 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01908] CRISPR-associated CXXC_CXXC protein Cst1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTAAGG AAACTGGTCA TCCGCTGGTC GATGTAGGGT TTGCCACCAT TGCCGCCCAT GTCAACAAAA CCAACATCAA GGCCGTTACT GCCGCAGATC TCGAAAAGGT GGCGGCATAT CTTGAGGCTG AGTTTGTGGT GAACCCATTG CGTTCATTTC TCACCGTCGC GTTTACCAGC AATGCCTGGT TTATCCAAGA TGCTTACAAT CCTGATAAAC CTCAATTAAC TGATGAGCAG CGCACAACCC GTCGGGCAAC CCGTAAAAAA TTGGCTGATG CCCATCTGCG TCAATGGCAA CAACCAAGTA CCAGCGAGCA ACGTTGTGTT TTTACCGATG AGCCAGCAGT TGGCGAAGTT TTATCAAATA CATTAACGGT TGGGCGGGCA GCACGTGCCC AAATTCCCTT ATTGCAGGGC GACGCTACGA TCAACTTTTT TCCGAATGGC AATGCTGGCC TGCCAGTTTC GGGGATTGCT CTCTTGGCGT TGCAAGCTTT CCCGCTGGGC AGCGCCAAGG CTGGCAATGG CATCTTAGTG GTGCACTCGG CCAACCCACG CTTGACCTTA AATTTTGCCA AAACCTTTTA TCAGCGCAAT AAAAGCAAAA TTGAACGAGC ACGAATTGCA GGCGATGATC GGGTTCCAGG CGAAGTACGC TCACCAAAAA CCTTATTAAT TGAAACGTTG TTTGAAATTA CAGCCGAACG CGAAAATTAC GATGACGACC AAATTACCAG TATTGTGGCC TACAACCTGA ACAATGGTAA AACGCCGAGC TTAGCAATTT ATGAACTGCC CCATGAAATT ATTTTGTTCA TGCTCGAAGC TAAAGGGCAT TTTCCTAAGC AATGGAATCA ATTAGTGCAT GGAGCTTGGG AACAAGCTTC GGCGGCAAGT AAGAAATCAA ATCAACATTT TGAGCCACGG CGCAATTATT TTTATGAAGA TATGTTTGAT CTACCAAATA ATGCCAAACG CTTTGTTCGT ACATATTTCC TCAAAATGCC CTATTTAGGC ACGCTCGATG ATGATAATCA ATCACGCTAT TACTTACACG ATTATTATCA TTTGATTGAT TTTCCAATTG TTGAACTTTT TTTAAGGAGG ATTTTCCTGA TGGACGACCT ACGGATCGCA CGAATTAAGC ATTTTGGCGA TAAATTAGCC AATTATGTCC GTGCTGAAAA TGGCAAACGC TTTTTTCGGG CTTTTGCCAG CGAGTATAAA TATCAAGATT TTCGGCAGCG CTTAATTAAA GCGAGTGAAG ATTATGTACG CTCTAGCCAA GAACCCTTAT TTACCATTGA TGAATATGTC GATGTTTTTG AACGCCAATA TAGCGACCAA GCTCCCGATT GGCGGTTATC CCGCGATCTG GTGCTGATTC GCATGATTGA ACAATTAAAG GATTGGCTGA GTCGCAACCC CGATGCGATG CCTGAGCGGC CTGAGCCAAA GCAGGAACAA TCGAATTAA
|
Protein sequence | MLKETGHPLV DVGFATIAAH VNKTNIKAVT AADLEKVAAY LEAEFVVNPL RSFLTVAFTS NAWFIQDAYN PDKPQLTDEQ RTTRRATRKK LADAHLRQWQ QPSTSEQRCV FTDEPAVGEV LSNTLTVGRA ARAQIPLLQG DATINFFPNG NAGLPVSGIA LLALQAFPLG SAKAGNGILV VHSANPRLTL NFAKTFYQRN KSKIERARIA GDDRVPGEVR SPKTLLIETL FEITAERENY DDDQITSIVA YNLNNGKTPS LAIYELPHEI ILFMLEAKGH FPKQWNQLVH GAWEQASAAS KKSNQHFEPR RNYFYEDMFD LPNNAKRFVR TYFLKMPYLG TLDDDNQSRY YLHDYYHLID FPIVELFLRR IFLMDDLRIA RIKHFGDKLA NYVRAENGKR FFRAFASEYK YQDFRQRLIK ASEDYVRSSQ EPLFTIDEYV DVFERQYSDQ APDWRLSRDL VLIRMIEQLK DWLSRNPDAM PERPEPKQEQ SN
|
| |