Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2812 |
Symbol | |
ID | 5734693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3573037 |
End bp | 3575424 |
Gene Length | 2388 bp |
Protein Length | 795 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279955 |
Product | kelch repeat-containing protein |
Protein accession | YP_001545578 |
Protein GI | 159899331 |
COG category | [S] Function unknown |
COG ID | [COG3055] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.743776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTGA TTGCGGTTCG TTGGTCGGTG TTGTTGGTCG TGCTTAGTTT GGTCGCTCTA GCCTGGTTGA AACCAACTCA AGCCCAAATT CCACTTGAAT CCCCCACTCC TGCTATTACT ACAACCACTA ATATGTTTAA TTGGAATACG TTTCAGCCGA TTCCGTTGGC GCGATTTGAG GCTGGCGGGG CAGTTGTTGG TGATAGCTTG TATGTGATCG GCGGTTTTTA TACCAACCAA GTTGAGGCTA CTGATACGGT TTTTGCCTAT AACATCACTA CTAATCAATG GCGAATCTGC GCTAATATTC CCGAAGCGAT GACTCATGCC CCTGTGGTTG CCGATGGCCA TTTGATTTAT GTATTGGGTG GCTATATTGG CAATTCGCCT GGTGGTAGCA CTGATCATGT GTGGGTCTAT AACACGCTGA CCAATGCTTG GAGCCGTGGC CCCGATTTGC CCGAAGATCG CGGCGCTGCT GGGGCAACCA AACTTGGCCG CGAAATTCAC TTTTTTGGTG GAGCGCATCG GCGTAATTTG CACCTCGAAG AATGGGATAG CAACAAACAT TTTGTACTGA ATTTGGATAC TCAAGTTTGG CGCACTGCCG CACCCATGCC CAATGCCCGC AATCACCTTG GAGCGGCGAC GCTCAATGGC TATGTCTATG CAATTGGCGG CCAATATTTA GCTGCCGAAT CAACAGCCGC TCAAGTTGAA GTCAATCGTT ATGACCCCAG CACTGACACT TGGACACGAG TCGCTGATTT GCCCAAAGGT CGCGGGCATA TTACCTCATC GGTGTTTGAA GTCGATGGCC GAATTATGGT CGTGGGCGGC TCGGTTAATG GCGGCGATTA TGGCTTGGCT TCTGCCGATG TGATGCTGTA CGACCCCAAT GATGATGTTT GGATGAAATT AACCTCGATT CCTGGGGTGC GCAAAACTCC AGTTGCCGCA GCCTATGGCA ACAAAATTTT GGTAACGACT GGTGGCTATG TGCCCAATCC CGAAATGTGG ATCGGCCAGC TTGAAAATCA CTGGGAATTA GCGCGAACCT TGCCGATCTC ACTGGGCGAG GCTTCTGGCG GGGTTATCGG CAATAAAATG TATCTGGTTG GTGAGAGCAA TGGTGCAACC GCTGCCTTTG ATCTTGGGGC GAATTCGTGG AATGCTGGCT TGGCGCAACG CCCCTATAAA ACCCATAGCC ACAGTGCCCA AGTCTGGAAC CAGCGGCTCT ATTTGTTTGG CGGGGCAGGC ACAAGCGCAG GCAAAGTGCA AATTTACCAA CCTTCCAGCA ATAGTTGGTC GCAAGGCACG GCCATGCCAT TTGCGACAAT GGCGGCTAGC TCTGCATTCA TTGATGGCAA AATCTATGTT GCAGGCGGCA TTGTCAGCGG CAATACCTCA AATTATCATG CCGCCTATGA TCCAACCGCC AACACTTGGG CCAATTTGCC AAACATGCCA TTGGCGCGTA ATGGTGCTGC TGGTGGCACA GATGGCCATT TTTTCTATCT GTTTGGTGGG CGGGCGAGTG GTACAATCGG TGCTGCCAGC AACGATCTGC AAATCTATGA TCCACTGACC CAAACGTGGA CGAGCAGCGC CAGCGATCCA ACGATTCCGC CATTACCCCA AGCTCGTGCC GATCTGGGCC AAGCAATTTG GTATAAAGGT GAGTTCTATC TGCTGGGCGG GGCCGATCAG GCCGGTGTGA GCAATCGAGT TGATGTGTAT AACCCGTTGA CTAAGAGTTG GCGCAGTGTT GCCCCGATGC CAACTGCCCG TCAGGGTCAT GCTCCAATCT TGGTTGGCGG GCGAATCTAT GTGCCCGCTG GCGGAACTCA AGCCAGTAGC AGCCAATCGC GAATTTTCGA GGTCTATAAT CCAGGCTCGG CTGCGACCCA AATTCCTGCA ACGGCCACGC CAACCGACAT TCCAACAGCG ACAAATACCG TAACTGAAAC GCCGACTGAA ATTCCAACGG CGACGAATAC GGCGACTGAA ATTCCAACCG ACATTCCAAC GGCAACGGCC ACGCCGACTG CAACGCCAAC TGAAACGGCG ACTGCAACGC CAACCGCCAT TCCGACGGCG ACGGATACAG CTACGCCAAC GGCCACGCCA ACTGAAATTC CAACGGTGAC GAATACGGCA ACAGCTACGC CAACTGAAAC GGCGACGAAT ACGGCAACGG CCACGCCAAC TGAAATTCCA ACGGTGACGA ATACGGCAAC GGCTACGCCA ACTGGAATTC CAACGGTGAC GAATACCGCA ACACTAACAG CGACTTATAC CAACACGGTT GTACCAACTG AGACCGCCTT GCCCACGACA ACCTCAATCC ATCAGTTTCG GGTTTATTTG CCGTGGGCAA CTAAATAA
|
Protein sequence | MRLIAVRWSV LLVVLSLVAL AWLKPTQAQI PLESPTPAIT TTTNMFNWNT FQPIPLARFE AGGAVVGDSL YVIGGFYTNQ VEATDTVFAY NITTNQWRIC ANIPEAMTHA PVVADGHLIY VLGGYIGNSP GGSTDHVWVY NTLTNAWSRG PDLPEDRGAA GATKLGREIH FFGGAHRRNL HLEEWDSNKH FVLNLDTQVW RTAAPMPNAR NHLGAATLNG YVYAIGGQYL AAESTAAQVE VNRYDPSTDT WTRVADLPKG RGHITSSVFE VDGRIMVVGG SVNGGDYGLA SADVMLYDPN DDVWMKLTSI PGVRKTPVAA AYGNKILVTT GGYVPNPEMW IGQLENHWEL ARTLPISLGE ASGGVIGNKM YLVGESNGAT AAFDLGANSW NAGLAQRPYK THSHSAQVWN QRLYLFGGAG TSAGKVQIYQ PSSNSWSQGT AMPFATMAAS SAFIDGKIYV AGGIVSGNTS NYHAAYDPTA NTWANLPNMP LARNGAAGGT DGHFFYLFGG RASGTIGAAS NDLQIYDPLT QTWTSSASDP TIPPLPQARA DLGQAIWYKG EFYLLGGADQ AGVSNRVDVY NPLTKSWRSV APMPTARQGH APILVGGRIY VPAGGTQASS SQSRIFEVYN PGSAATQIPA TATPTDIPTA TNTVTETPTE IPTATNTATE IPTDIPTATA TPTATPTETA TATPTAIPTA TDTATPTATP TEIPTVTNTA TATPTETATN TATATPTEIP TVTNTATATP TGIPTVTNTA TLTATYTNTV VPTETALPTT TSIHQFRVYL PWATK
|
| |