Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3095 |
Symbol | |
ID | 5734967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3903033 |
End bp | 3904538 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280239 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_001545861 |
Protein GI | 159899614 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000127417 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACG AAGCGCAAAA TGTCAGCGCG ACTAACGGTG GAGAGGAACA AGAAGAGGGC GTATTGCAAA AGCTGGCCGA CAAAGCCCGC GAAGTAGTCG CCGAAGTGAA GGAAGAATTG GCCGAAGTGA AGGAAGAAAT TGGCGAAGCA TTGGCTGAAG CCCGCGAAAA AGCGAGCGAA GTCTTGGCTG AAGTCAAAGA GCGAGTTGGC TTAGGTGGCG ATGATGAAGC TACCGAAGAA GCTGGCGCAA CCTTTGAACC AAGCGGCGAC GAAGATGGCA CTACGCCACG TCGTTTGGCC GACTTGCACG CTGGGATGGA GCTTGATGGT AAGGTCACCA GCACCGCCTT GTACGGTGTG TTTGTTGACA TTGGCGTTGG CCGCGATGGC TTGGTTCACA TCTCAGAAAT GAGCGACCAA CGCATCGAAT CACCAACCGA TGTTGTGCAA ATTGGTGATA TTGTCAAAGT TCGCGTCAAG AGTGTTGATC CAGATGCTCG CCGTATCAGC TTGACAATGC GCTCGCCTCG TTCAGAAGGC CGCCGCCGCG CTCCCAAGCG CCCTGAAGTC AACAACGACA AATTGGGCGA ATTGAAGCCA GGCGATTTGG TTGATGGTAC GGTTAACGGC ATCGCGCCAT TCGGCGTGTT CGTTGACATC GGTGTTGGCA AAGATGGCTT GGTTCACATT TCTGAGCTTT CAGAAAACCG CGTGGAAAAA GCTGAAGATG CTGTCACCGT TGGCCAAAGC TACACCTTCC GCGTGTTAGA AGTTGACACT GGCGCTCAAC GCATTAGCTT GAGCTTGCGC CGCGCCAAGG AAGATTTCCA AGAACGGCCA AAAGCCCCAC GCCGCCGCGA AGTTAACTTA GATGTGATTG CTCCAGGCAC CGTGCTCGAT GGCAAAGTCA GCGGGATTGC TCCTTTCGGC GCATTTGTTG ACCTTGGCGT TGGCCGCGAT GGTTTGGTCC ACATCTCAGA GCTTTCCGAA GGTCGGGTTG GCAAAGTTGA CGATGTGGTT AAAGTTGGCG ATCCAGTCAA AGTTCGCGTG TTGGAAGTCG ATCCCGATTC AAAACGGATC AGCTTGACCA TGCGGGTTGA AGAAGCTCCA ACCACCCCCA TCTCAACCAG TGGCTCATCA CGCTTGGATC GCGACTGGAC AAACCCAGCA AGCCGTGAAG AACGCCCACG CGAAGAACGC CGCGCTGTTG GTGGTGGTAA CCCTGGTGGC AACCCTGGCG GTGGCCGTCG TAACGACCGC CGCGACCGCC CAGCCCGCGA ACCTGAAATC TATAGCGTTG GTGGAACTGA AGAAGAAGAT TTTGGTGGTA ATGCAACCCT CGACGACTTG TTGTCGAAGT TTGGTTCAGG CCACGATGAT CGCCGTTCAG CTCGCCGTCG CTACGAAAAG CCTGAACAAG AAGAAGAAGA CGGATTTGAA AATCGTGAAT CACGCGCCCG CCGCGATGCA ATTCGCCGCA CCTTGCGCGA TTCACAAGAA GACTAA
|
Protein sequence | MTDEAQNVSA TNGGEEQEEG VLQKLADKAR EVVAEVKEEL AEVKEEIGEA LAEAREKASE VLAEVKERVG LGGDDEATEE AGATFEPSGD EDGTTPRRLA DLHAGMELDG KVTSTALYGV FVDIGVGRDG LVHISEMSDQ RIESPTDVVQ IGDIVKVRVK SVDPDARRIS LTMRSPRSEG RRRAPKRPEV NNDKLGELKP GDLVDGTVNG IAPFGVFVDI GVGKDGLVHI SELSENRVEK AEDAVTVGQS YTFRVLEVDT GAQRISLSLR RAKEDFQERP KAPRRREVNL DVIAPGTVLD GKVSGIAPFG AFVDLGVGRD GLVHISELSE GRVGKVDDVV KVGDPVKVRV LEVDPDSKRI SLTMRVEEAP TTPISTSGSS RLDRDWTNPA SREERPREER RAVGGGNPGG NPGGGRRNDR RDRPAREPEI YSVGGTEEED FGGNATLDDL LSKFGSGHDD RRSARRRYEK PEQEEEDGFE NRESRARRDA IRRTLRDSQE D
|
| |