Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3713 |
Symbol | |
ID | 5735577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4670188 |
End bp | 4671381 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280865 |
Product | RNA-binding S1 domain-containing protein |
Protein accession | YP_001546477 |
Protein GI | 159900230 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | [TIGR00717] ribosomal protein S1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000276005 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGTGG ACACCGAAAG CATGGATGCC GTTCAAGATT GGACGACACT GCTTGCTGAG TACGATTATC AACGCCCAGA ACGCGGTCAA TTGCGTGAAG GTATCGTCAT GCGCGTCGAA GACAGCCAAA TCTTGGTTGA CATTGGCGCA AAACACGAGG GCGTTATCCC TAACCAAGAT CTGCGCCGTT TGCCGCCAGA ATTGGTCAGT GGGATCAAAA ACGGTGACAC ACTGCAAGTG TACGTGATGG AGCCAGAATC AAAAGAAGGC GAACTGGTTC TTTCATTGAA CATGGTGCAG GTCGAGCGCG ATTGGCAAGA AGCCCAAACG ATGCTCGAAA ATGGTCAAAT CATCGAAGCT GGCGTGGTCG GCTACAACAA GGGCGGTTTG TTGGTTCAAG TTGGGCGTGT GCGCGGTTTC GTACCAGCCT CACAAGTGGT CAACTTGCAC AGCCGCACTG GCACTGAAGG CCAACAAAGC GCCATGACCA AGATGGTCGG CCAAAATATT CCTTTGAAAG TTATCGAAGT TGATCGCGAT CGCAATCGTT TGGTGCTTTC AGAGCGTGCC GCTATGCAAC GCTGGCGACA ATCGCAAAAG GAACGCTTGC TCGAAACCCT CGAACCAGGC GCAGTCGTCA CTGGTCGGGT CAACCAACTC ACTCCATTCG GTGCTTTCAT CGATTTGGGC GGTGCTGATG GTTTGGCTCA CATCTCAGAG CTTTCATGGC AGCGCGTCAA CCACCCACGC GAAGTCTTGC AACCAGGCCA AGAAGTCCAA GTATACGTCT TGGAAGTCGA TCGCGATCGC GAACGGATTG GCTTGAGCTT GCGCCGTTTG CAGCCAGATC CATGGGCAAC CATCGATCAA CGCTACGACC TCGGCCAATT GATCGTTGGT GAAGTAACCA ACATCGCTCC TTTCGGCGTG TTTGTACGCG TTGAAGAAGG CGTTGAAGGT TTGATCCACG CTTCAGAATT GACCGAAAAC GGCCAATCGC CCGACTCGTT GCAACAAGGC CAACAAGTGC AAGTGAAGGT GATCAGTCTT GATCGCCAAC GCCAACGCCT TGGCTTGAGC TTGCGCCGCG TCGATGGCGA AGGTGAAGCC GCCGAAGCAC CAGCAGCTCC TGTAGCTGAA GTGGTCGCCG AAGCCGCTAC CGAAGCTACC ACCGAAGAAG AAGTCGGCGC ATAA
|
Protein sequence | MTVDTESMDA VQDWTTLLAE YDYQRPERGQ LREGIVMRVE DSQILVDIGA KHEGVIPNQD LRRLPPELVS GIKNGDTLQV YVMEPESKEG ELVLSLNMVQ VERDWQEAQT MLENGQIIEA GVVGYNKGGL LVQVGRVRGF VPASQVVNLH SRTGTEGQQS AMTKMVGQNI PLKVIEVDRD RNRLVLSERA AMQRWRQSQK ERLLETLEPG AVVTGRVNQL TPFGAFIDLG GADGLAHISE LSWQRVNHPR EVLQPGQEVQ VYVLEVDRDR ERIGLSLRRL QPDPWATIDQ RYDLGQLIVG EVTNIAPFGV FVRVEEGVEG LIHASELTEN GQSPDSLQQG QQVQVKVISL DRQRQRLGLS LRRVDGEGEA AEAPAAPVAE VVAEAATEAT TEEEVGA
|
| |