Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1477 |
Symbol | |
ID | 5733362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1725813 |
End bp | 1727123 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278615 |
Product | hypothetical protein |
Protein accession | YP_001544249 |
Protein GI | 159898002 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1030] Membrane-bound serine protease (ClpP class) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.135375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACAGC TGCGAATCCT AGCTTTAATT GTTTGGCTAA GTTTGGTAAC CCTACCAAGC TTGAGTTATG CTGCGACTGA CGATGCTATG TGTGTGGCTT CGGTCAGTGG CATTATCAAT CCAGCGGTTG CCGATTATCT GCAACGGAGC GTGATCCAAG CTGAACAGCA AGCATGTCAA GGTTTAGTGA TTAAATTAAA CACGCCTGGT GGGCTAACAA CCTCAACTTG GCAAATTGGC GAGACTGTAC TCAATGCCAA ATTGCCAGTG ATTGTCTATG TTACGCCCCA AGGTGCGAAT GCTGGTTCGG CTGGAGTTTT TATTACCTAT GCTGCCCATA TTGCCGCAAT GTCGCCCAAT ACCAACATTG GTGCGGCCCA TCCAGTCGAT GGGAGCGGCG TAGATATGGA AGGTGATCTG CGCGATAAAA TTACCAACGA TGCCGTTGCG CGAATCACCA CTTGGGCCGA ATCGAATGAT CGGGATGCTG AATGGGCTGA GCAAGCAGTG CGCCAAAGTG TTTCGATTGG TAGCAACGAG GCACTTGAAC TCGGTGTGAT TAATCTGATT GCCCAAGATG ATGCTGATTT ATTGCGTCAG CTTGATGGCC GCGAAGTAAC CTTGGCCAAT GGTAATCAGG TGACGCTGCA AACCCAAACC AGTGCTTTCC AACCAATTGA AATGACGTGG CTCGAACGGC TTTTGCACTT CTTGGGCGAC CCCACGATTG CCACAATGTT GATTAGCCTC GGTAGCTTGG GCATTTATTT CGAGGCGGCC AACCCAGGTT TAGGCGTGGG TGGTTTCTTG GGCGTAATCG CGATTTGCTT GGGCTTGTAT GGCCTCAGTG TACTGCCGCT CAACACGGTT GGGGTGGTGC TGCTGATTCT GGCTTTTGTG CTATTTGGGA TTGATATTTT TGCGACCGCA CACGGAGCTT TGACGGTTGG TGGCCTAGCT TCATTTGTGA TTGGTGCCTT GTTGCTGGTT GATCCAGCCG AAGCACCTGG GATTATCGTC TCACGCGCCT TGATTACAGG CCTTGGCCTT GGGCTGGCAA GCGTGATTGG GCTAAGCATT TGGATTATTC GCCGCAGCAA GCGCGTTGGT CGTGGAGCCA GCGGCGATCG TTTGGTTGGC ACAATTGCTA AAGTTCGTTC GTCAGTTGCG CCTGAAGGCA CGGTATTTGT TGAAGGAGCC TTGTGGCAAG CCCGTTCCGA TGATGTATTG ACCGTAGGCG ATCAGGCTGA AATTGTTGGT TTAGATGGTT TAACCTTGAT TGTGCGTCGA GTGGCGCATA GTGGGCGCTA A
|
Protein sequence | MRQLRILALI VWLSLVTLPS LSYAATDDAM CVASVSGIIN PAVADYLQRS VIQAEQQACQ GLVIKLNTPG GLTTSTWQIG ETVLNAKLPV IVYVTPQGAN AGSAGVFITY AAHIAAMSPN TNIGAAHPVD GSGVDMEGDL RDKITNDAVA RITTWAESND RDAEWAEQAV RQSVSIGSNE ALELGVINLI AQDDADLLRQ LDGREVTLAN GNQVTLQTQT SAFQPIEMTW LERLLHFLGD PTIATMLISL GSLGIYFEAA NPGLGVGGFL GVIAICLGLY GLSVLPLNTV GVVLLILAFV LFGIDIFATA HGALTVGGLA SFVIGALLLV DPAEAPGIIV SRALITGLGL GLASVIGLSI WIIRRSKRVG RGASGDRLVG TIAKVRSSVA PEGTVFVEGA LWQARSDDVL TVGDQAEIVG LDGLTLIVRR VAHSGR
|
| |