Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0502 |
Symbol | |
ID | 5732416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 583543 |
End bp | 584790 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277628 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_001543281 |
Protein GI | 159897034 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | [TIGR00717] ribosomal protein S1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAGC AAACACAACC GGGCATGGAT CAATCTGATG TGCTTCCCAG CACAGGATTG AACGCGACCC CACAGGCCGA TTATTCTGGC GATGATGATC GTGCCCTGCT GGAGGAGTAT CTCCGTGATC CAGCCCACGA TTATCGCAAT CTCAAGTATG GTGATTCGGT CGATGGCACG ATTGTTCGCG TTGATCGCGA CGAGGTGTTG GTCGATATTG GCTCCAAGTC GGAAGGCGTG GTGCCAGGCC GCGAGATGAC CAGTCTGTCG TCGGAAGAAC GCGCCGAACT CAAGGTTGGC GATGTCTTGC TTGTTACAGT CGTTCAAACC GAAGACGCTG AAGGGCGTAT CGTGTTGTCG ATAGATAAAG CACGCCAAGA AAAGAGTTGG CGAGCCTTGC AAGTCAACCA TGAGGCTGGC GATGTGATTC ACGCCGCCGT GACCAACTAT AACAAGGGTG GTCTGTTGGT TAATTTAAGT GGGGTGCGTG GCTTTGTGCC ATCATCACAG GTCAGCAGCG TCAGCCGTGG CTCCGATGTC CAAAAACAAT CGGATATGGC AAAACTGGTC GGCCAAACCT TGCCACTGAA AATTATCGAA ATCAATCGTT CGCGCAATCG GCTGATTCTA TCCGAGCGCC AAGCCGTCCA AGAGGTTCGC GATTCGCGCA AGGATCAACT GCTTGAAAAA CTGGAACCAG GCGCAGTTCG CACTGGCCGC GTAACCAGTT TGTGCGATTT CGGCGCGTTT GTCGATATTG GCGGAGCAGA CGGTTTGGTT CACCTTTCCG AGCTTTCTTG GAGCCGCGTC AAACATCCCG AGGAAGTGCT GAAAGTTGGC GATGCAGTCA GCGTCTATAT TTTAAGCGTC GATGAAGATA AAAAACGCAT CGCGCTGAGT ATCAAGCGCA CCCAAGCTGA GCCTTGGACA ACCGTTACCG ACCGCTACCA AATTGGCCAA AGCGTTTCAG GGGTTGTTAC TCAATTGACC GCCTTTGGCG CGTTTGTCCG GCTTGAAGAT GGCATCGAAG GTCTGATCCA CATCTCAGAA ATGAGTGATG AACGGATTCA GCACCCACGC GATGTGATTA ATGAAGGCGA TAGCGTTTCA GCCCGCATTA TTCGGATCGA CCCAACGCGC AAGCGGATTG GCTTGAGTAC CCGCAGTGGC AGCGCTGAAG CAACCGCTGA AGCAACTGCT GAAACAGCAA CCGAAGAACC AAGCGCTGCA GCCGAAGACG AAGAATAA
|
Protein sequence | MDEQTQPGMD QSDVLPSTGL NATPQADYSG DDDRALLEEY LRDPAHDYRN LKYGDSVDGT IVRVDRDEVL VDIGSKSEGV VPGREMTSLS SEERAELKVG DVLLVTVVQT EDAEGRIVLS IDKARQEKSW RALQVNHEAG DVIHAAVTNY NKGGLLVNLS GVRGFVPSSQ VSSVSRGSDV QKQSDMAKLV GQTLPLKIIE INRSRNRLIL SERQAVQEVR DSRKDQLLEK LEPGAVRTGR VTSLCDFGAF VDIGGADGLV HLSELSWSRV KHPEEVLKVG DAVSVYILSV DEDKKRIALS IKRTQAEPWT TVTDRYQIGQ SVSGVVTQLT AFGAFVRLED GIEGLIHISE MSDERIQHPR DVINEGDSVS ARIIRIDPTR KRIGLSTRSG SAEATAEATA ETATEEPSAA AEDEE
|
| |