Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2586 |
Symbol | |
ID | 5734464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3319658 |
End bp | 3320932 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279726 |
Product | hypothetical protein |
Protein accession | YP_001545352 |
Protein GI | 159899105 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000365583 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGG ATGTTGTTCA GATACAGTAT GAGCAAACTG CCCAAGTTGC TCAGCATTTT GGCCGCCTCA AGCAAACGGT TGAACAATTG CAAACGACAA TTCGCAACGT CAGCCAATCA CTGGTTGATG GCGATTGGCA GGGCGATGCC AGCGTTGCCT TTGCGAAAGA ACTCGATGGT GAAATTTATC CAACCTTCAA TCGTTTGATC ACTGCATTCC AAACCGCCCA AGAAGTCACG CTCGAGGTCC AAAAAATCTT TGGTCAAGCC GAAGAAGAGG CCGCTGCCTT ATTCAAAGGC GAGTTATTCG GCAACGCTGA TGGCGGCAGC AAGGGTGGCG GAAGTTGGCT CGATAGCGTT GGTGGCTTCT TCAGCAGTGT TGGCAATGGC ATCAAAGATT TCTTCGTTGG CGCAGGCAAA GAACTCAAGG ATATGGTGGT TGGGGTTTGG AACATGGTCA CCAGCCCAAT CGAAACCGCC AAAGGCATTT GGCATGCGGT AACCCACCCA GGCGAATTCT GGGAAGCCTT CAAAGCGCCG TATGTTGAAG CCTGGGAAAA TGGCCGCCCA TGGGAAGCAA TTGGCCGTGG CACGATGTTT ATTGGCTCGT TGCTAATTGG CACTAAAGGT GCTGATAAAG TTGGTAAAGC GGCTAAGATC AGCAAAGCAG CAACCGTTGC TGATGCAGCA TCAGACATTG CCCGGGTCAG CAACCCACTC CAAGCCGCCA GCGATATTGG CATGCTCGCC AAAGGTGGTG GCGCTGAAAC TGCCATGGCT CGCTATATTG CCAATCAATC AAGCCATGTT GGTGCAGGCA CAGCCTTGAC CGATCGGGTG GTTTTAGGCG CATTCAAGGC AGATCCGGCC TCGGGCTTCC TTGGCTATAT CGGTGAAGCC AACGCTCATG GTGGCCGCTA TTTCAGCACC ACCAGCGATG TGTGGAATAA GCTCAAACCA ATTGCCGAAG GTGGCAATAA TCGGATTTGG CCGGTCAATC GCGAATTCTT GCAAGCCCAG CTTGAAAGTG GCATTAGCCG CATCGATATC AAAGGCAGCT CAATCGACGA TATTCTGACC AAACGGCCTC AATCATATAG CGCCATGGAA GTGCGCTTTA TGCAAAGCCA AGCCTACCAA TATGGCTATC GTCAAGTTGG CAATAGCTGG ATCAAGACTG GCGATTGGCG GGCCAGCACA ACTGGTCGGG TCATTGGCGG CAGCATCGGA CCAGGTGGCG AGATTTTACA AACCAGCCAA TCATTGAACG ATTAA
|
Protein sequence | MSKDVVQIQY EQTAQVAQHF GRLKQTVEQL QTTIRNVSQS LVDGDWQGDA SVAFAKELDG EIYPTFNRLI TAFQTAQEVT LEVQKIFGQA EEEAAALFKG ELFGNADGGS KGGGSWLDSV GGFFSSVGNG IKDFFVGAGK ELKDMVVGVW NMVTSPIETA KGIWHAVTHP GEFWEAFKAP YVEAWENGRP WEAIGRGTMF IGSLLIGTKG ADKVGKAAKI SKAATVADAA SDIARVSNPL QAASDIGMLA KGGGAETAMA RYIANQSSHV GAGTALTDRV VLGAFKADPA SGFLGYIGEA NAHGGRYFST TSDVWNKLKP IAEGGNNRIW PVNREFLQAQ LESGISRIDI KGSSIDDILT KRPQSYSAME VRFMQSQAYQ YGYRQVGNSW IKTGDWRAST TGRVIGGSIG PGGEILQTSQ SLND
|
| |