Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1076 |
Symbol | |
ID | 5732865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1231139 |
End bp | 1232665 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278214 |
Product | PSP1 domain-containing protein |
Protein accession | YP_001543852 |
Protein GI | 159897605 |
COG category | [S] Function unknown |
COG ID | [COG1774] Uncharacterized homolog of PSP1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTTGG TTGTTGGAGT TAAGTTTAAA GATTCGGGCA AAATTTACCA TTTTGACCCA AATCAACATC ATTTAGAGCT AGGCGATGCC GTAGTAGTTG AAACGGTACG CGGGCTAGAG CTTGGCAAAG TGGCAGCACC GATCGAAGAT CTACCCGATT CTGATTTGAT TGCCGAACTC AAACCAGTGA TTCGCCAGCC CACCACCGAA GATTATGACC GCATGCGAGT GCTAGCTGAG CGACGCGATG ATGTATTGCG GATTTGTGCC GAACGAATTA GCGTGCATCG CTTGCCGATG CGCCTGATTC GCAGCGATTG GAATTTTGAT GGCACCCGTT TGACGATCTA TTTCACCTCG CAGCAGCGGG TTGATTTTCG CCATTTGGTA CGCGAACTAG CGCGAATTTT CCATGCACGG ATCGAATTAC GGCAAATTGG GGCACGCGAT GAAGCCAAAC TGCTGGGCGG ATTAGGGCCA TGTGGCCGAC CATTGTGCTG CTCAACCTTC TTGCCCGATT TTGCCCGCGT TTCGGTCAAA ATGGCCAAAG ATCAAGACTT GCCACTCAAT CCCTCGAAAA TTTCGGGGGT TTGCGGGCGC TTGCTCTGCT GCCTTTCGTA TGAACAAGAG CAATATCTTG AGATTAAGGC CGAGCTGCCA ACCCGTGGCG AGCAGGTCAA AACGCCCGAA GGCATTGGCT ATGTAGTTGC TGTCAATACC ATTCGCGAAA CCGTTACCGT TGATATTGAA AGTAATTACC ACGATTTCAA AGCTGATCAA CTTGAAACCC TGACAGGCGC AGCTGGAGCG ATTGCCCGCG AACGGCTCGA ACAAGGCGAG GCCGCACCAA TTGCCCAAAA ACGTTTCAAC CGCCCAGTTG GCGAAACGAT TAAATCAACG CTGAACAACT GGGATAACGA AGATTGGGGC TTGGATGAAC TACGCTCGTT TGAGGAAGAT CCCGAAAGTT CAGGCGATAA GCCCAAACCA CGCCCCAAGC CAGCCCAACA AGCTCGGCCA AACCCCGAGG CACGCGAACA ACGTACAAAT CCGAACGAGC AACGGCCACG CTTTGACCGA GCAGAACGCC AGAAACGCTT TAATCGCTCG GAGGCGGCCA ACGCTGAGGC TAGCGTAGCC CCAACACCAA GCAACGAGCC AAGTGAACGG AAACCACGTC ATGCCTTCAA GCGCAAAGGC GACACGACCC CAACGCCTGA GCGACCAAAA CTACCGGCGA CAGTGCGTCA GCAATCGCTG GGCGGCGTGC CAGAGCAACT AACTCCGCGC GGTCGGCAGC CCTCGACGAA TGACCCTGAT GAACCGAAGC AACCAAATCG TCGCCGACGG CGCGGCAATG GTGGTAATGG CAATCAGCAA GGGCAACAGC AAGTTCAGCC CAAGCCCAAC CAGCCGCAAG TAACATCAAG CAAGCCATCT ACGGTCGAGA AAACCGAATC AGAATCAGCG AAGCCAGAGG CGAATAACGA GCGTCGTTCG CCAAGGCGAC GCAAACGCCG CAGCTAA
|
Protein sequence | MPLVVGVKFK DSGKIYHFDP NQHHLELGDA VVVETVRGLE LGKVAAPIED LPDSDLIAEL KPVIRQPTTE DYDRMRVLAE RRDDVLRICA ERISVHRLPM RLIRSDWNFD GTRLTIYFTS QQRVDFRHLV RELARIFHAR IELRQIGARD EAKLLGGLGP CGRPLCCSTF LPDFARVSVK MAKDQDLPLN PSKISGVCGR LLCCLSYEQE QYLEIKAELP TRGEQVKTPE GIGYVVAVNT IRETVTVDIE SNYHDFKADQ LETLTGAAGA IARERLEQGE AAPIAQKRFN RPVGETIKST LNNWDNEDWG LDELRSFEED PESSGDKPKP RPKPAQQARP NPEAREQRTN PNEQRPRFDR AERQKRFNRS EAANAEASVA PTPSNEPSER KPRHAFKRKG DTTPTPERPK LPATVRQQSL GGVPEQLTPR GRQPSTNDPD EPKQPNRRRR RGNGGNGNQQ GQQQVQPKPN QPQVTSSKPS TVEKTESESA KPEANNERRS PRRRKRRS
|
| |