Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3158 |
Symbol | |
ID | 5735030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3986763 |
End bp | 3989207 |
Gene Length | 2445 bp |
Protein Length | 814 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280301 |
Product | TPR repeat-containing protein |
Protein accession | YP_001545923 |
Protein GI | 159899676 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.385095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGATGC CAGCCCTAAT TGGATTGCTC GTTGATGCTG TGCTTGCTGT GTCGCCTGCC GCTGATCGCC TGACCATTGA AGCGGCGTTT GGTGCGATGT TGGCAGGTAA ACCAACGCTA CTTGATGATG CTCCTTTATC GCTGCTAATT GATCAATCGA GCGCTCAATC CCATGCGACA ATCGTGTATG CAGGCCAGAC GATTGCTGTT GAGCTGGCTG AGCCGCTTGA CCCGCTGGTT GTAGCGCTAG CGGCGGTCGC CTCGTTGCCA CTGACAACTG TGCCTGCCCC GCGTGCCGAT CAACCAGCGG CCTCGCGACT ACCATTTGAA TCAAGTAGCA CGTTTGTTGG GCGTGAGCAA GAATTGTTGG CGCTGGCCGC CGCGCTTAGC CATGCTCAGC CTACGATTGT GCTGCCCGCC GTGGCAACTG GCCTTGGCGG AATTGGCAAA ACCAGCTTGG TCACTGAGTT TGCCTATCGC TATGGCAGCT ATTTCCACGG CGGCGTATTT TGGATCAACT GCGCTGATCC TGAGCAGGTT GAAAATCAAA TTGCTGCATG TGCCGAGGCG CTGGCGATCG ATCCAACAGG TTTGACGCTT GATGCCCAGG TGCAACACGT TTTAGCAGCA TGGCAAGCGC CGCTGCCGCG CCTACTCATT TTCGATAATT GTGAAGATCC AGTGATCCTT GAGCGTTGGA TGCCGACGTT GGGCGGTTGT CGGGTGTTGG TGACCGCCCG CAATCAATTA GCAACGATGA GCGCGATTCG GCTTGGAGTT TTGGCTCCTG CCGAAAGCCG TGCCTTACTA CAACAGCTTT GCCCACGCCT GACAACTGCT GAAGCCGAGG CAATTGCCGC CGATCTTGGG CATTTGCCGT TGGCCTTGCA GCTTGCAGGC AGCTACCTCA ATACTTATGA TCAACAGAGT GTTGCTCAAT ATCGCCAGGA TTTGGCGGTT ACTCATCATT CGCTCAAAGG TGGTGCGGGA TTGCCCTCGC CAACCCGCCA TGAACAAGAT GTTGAAGCGA CGTTTATGCT CAGTTTGCAC CAGTTTGATT CGGCTAATGC ACTGGAGATG TTGGCCTTAG ATATGTTGGA TGGTGCGGCT TGGTGTGCGC CAGGTGTGCC AATCCCCCGT CAGCTTGTGC TCGATTTTGT TCCCGATGAA ACGAATGCTG AAACTGCACT CGCAGCGCTG CAATTGTTGG AGCAACGTGG ATTAATTGAT GGGAGCGAGG CGCTGGTTGT GCATCGTTTG TTGGCGCAAG TTGTTCAGGT TCATCGCGGC TCAGCACAAA TCCGCGAACT GGCCGAATAT CGAATTAACG AGCATGCTGT GCGAATTAGT GCCACGCGTG TGCCAAAGCA GATGCTGCCA CTTGAGCCAC ATTTGCGCCA TGTGACCGTG CGAGCATTGG CGCGTGAGGA TGAACGGGTC GCGCGTTTGT GTAATAGTCT GGGCTATTGG GAGCACTTGC GTGGCGTTTA TGGTGAGGCC GAGCGCTGGT ATGAACGGGG CTTGGCGATT ATGCAAAAGG TCTTAGGGCC AGAGCATCAA AATACTGCCC GCATGATGAA TAATTTGGCA GGTATTCGTT TGGAGCAAAT GCGCTATGCC GAGGCGCAGG CCTTGTATGA GCAGGTTTTG GGGATTTGGA ATGTCACTTT TGGCCCAGAG CATCCTGATA CCGCGCGATG TATGAACAAT CTGGCCTCGG CTTTAGGGCG ACAAGGACAG AATGCCGAGG CCTTGGCGAT GCTTGAACAA GCATTGGTAG TTTGGGAAGC AGCCTTAGGC CCAGAGCACC CCGATACCGC GATTAGTATC AACAATCTGG CGGTAGCCTT GGAGCGCGAA GGGCGCTATG CCGAATCGCA GGTGTTACAG GAACGAGCAC TGAAGGTATG GAAAAAAACT TTAGGGCCAG AGCATCCCGA TACTGCGGCA AGTTTGAACA GTTTGGCCCG CTTGTTGGAA CATCAAGGCA AATATTCACA AGCTCTGCCA TTCTATCAAC AGGCGTTGGC AATTCGCGAA ACCGCCTTGG GGCCAGAACA TCCCGACGTT GCTTCTAGCC TGAACGATCT GGCGGGATTG CTGATCGAAC AAAAACGCTA TACTGACGCG CAGGCCTTGT ATGAACGGGC GCTGACGATT CGTGAATTGG TGTTTGGCCC AGAGCATGCC GATACGATCA CTGCTATGGC AAATTTGGCG GTGGCATTGG AGCGGCGCGG CCAATACCGC GAAGCGTTAG AGTTGCATGC CCAGGCATTG ACCATTAGCC GAAAAGTTTT TGGCGATAAT CATCAGACGA GCCAACGGAT TCGTGCTAGC CATGCCCGAA CCGTCCAAGC GATTCAAGAA GCCTTCGACC AATCGGCAGC CAAACGATCC AGTGGCAAAC ATACCAACGA TGATCACGTT AAACAATGGA AGTAA
|
Protein sequence | MEMPALIGLL VDAVLAVSPA ADRLTIEAAF GAMLAGKPTL LDDAPLSLLI DQSSAQSHAT IVYAGQTIAV ELAEPLDPLV VALAAVASLP LTTVPAPRAD QPAASRLPFE SSSTFVGREQ ELLALAAALS HAQPTIVLPA VATGLGGIGK TSLVTEFAYR YGSYFHGGVF WINCADPEQV ENQIAACAEA LAIDPTGLTL DAQVQHVLAA WQAPLPRLLI FDNCEDPVIL ERWMPTLGGC RVLVTARNQL ATMSAIRLGV LAPAESRALL QQLCPRLTTA EAEAIAADLG HLPLALQLAG SYLNTYDQQS VAQYRQDLAV THHSLKGGAG LPSPTRHEQD VEATFMLSLH QFDSANALEM LALDMLDGAA WCAPGVPIPR QLVLDFVPDE TNAETALAAL QLLEQRGLID GSEALVVHRL LAQVVQVHRG SAQIRELAEY RINEHAVRIS ATRVPKQMLP LEPHLRHVTV RALAREDERV ARLCNSLGYW EHLRGVYGEA ERWYERGLAI MQKVLGPEHQ NTARMMNNLA GIRLEQMRYA EAQALYEQVL GIWNVTFGPE HPDTARCMNN LASALGRQGQ NAEALAMLEQ ALVVWEAALG PEHPDTAISI NNLAVALERE GRYAESQVLQ ERALKVWKKT LGPEHPDTAA SLNSLARLLE HQGKYSQALP FYQQALAIRE TALGPEHPDV ASSLNDLAGL LIEQKRYTDA QALYERALTI RELVFGPEHA DTITAMANLA VALERRGQYR EALELHAQAL TISRKVFGDN HQTSQRIRAS HARTVQAIQE AFDQSAAKRS SGKHTNDDHV KQWK
|
| |