Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5149 |
Symbol | |
ID | 5737107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 213995 |
End bp | 217312 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641282314 |
Product | TPR repeat-containing protein |
Protein accession | YP_001547905 |
Protein GI | 159901659 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000182587 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACTTAC TCGATATTCA ACACAAAATC TCAAGGCTCA TGGCCCGGTT TGTCGAAGAA GTAAAAAGTT CCACAGCGAT GGGTCATAGT GATATTAATC GTGTTGCTGA AACGGTGCTG ATTCCGCTCT TAGGCCGTGT CTATGAATGC CCCAGTTTAC AGAATCTCAA TAGTCTCCAT CCCAATTATC CAGCGGTTGA TTTAGGGGAT GTAGCCCGTC GGATCGCCTT TCAGGTCACG ACAACGCCGG ATAGTAAAAA AATCAAAGAT ACCCTCACCA CGTTCATCGC TCATAACCTG CATACCCAGT TTGATACGGT GTATGTATAT ATTTTGACGG AAAAACAACA CTCATATAGT CCTGCTATTT TTAGTACTAT CACCGGCAAT CACCTCCTCT TTGATCCAAA AAGACATATC CTTGATGCGA ATGATCTGCT GAAACAGATT GCTACCTACC ACGTTGACAA GGCCCAGCAA ATTCTCACGA TACTTGAGGC GAATTTTGAT ACGCCCATTG ATCCCCTTGC TCATGCGCTT GCCGTATATG GAACATTGCC GCTGGATTAT GTTCCTATGG CACGGTTGGA TCTGCCCCAA GCCTCACGCA TTCCCTTTGA ATCGAGTGCC TATTTTGTTG GGCGCGAAGC CGAATTGAAA GCTTTAGCCC GCGCGATTAT CCAAACCCAA CCGACCGTCG TTGTGCCTGC GGTTACGACC GGACTGGGGG GGATTGGCAA AACGAGTCTG GTGACGGAAT TTGCCTATCG CTATGGGGTC TATTTTCATG GCGGGATATT TTGGTTGAAC TGTGCTGATG CTAATCAGGT GGCGAGCCAG ATTGCAGCCT GTGCGGTTGG TTTGAAGATT GATACTACTG GGATGGCGCT CGATGAACAG GTGCAGCAGG TTTTGTATGC CTGGCAATCT CCGATGCCCC GCTTGCTGAT TTTCGATAAC TGTGAAGATC CAGCGATTCT TACGCAGTGG AAGCCCACTA TTGGTGGTTG TCGGGTGCTG GTGACGGCGC GGTCAGATCA GTGGCCAACG CTGACGCAGA TTCGTTTAGG GTTGCTCTCA CCTGTCGAAA GTCGCGCGTT ATTGCAGCGA CTCTGCACGC GGCTGACTGA CACCGCAGCT GATGCGATTG CCGAGGATCT AGGGCATTTG CCATTGGCCT TGCACCTAGC AGGCAGTTAT CTTAATACCT ATTCCCATCA CACGGTCGAG CAGTACCGCA CGGAGTTAAC CATTGCCCAC CGCTCGCTCA AGGGGCGAGG GGCGTTTCCA TCCCCAACCC AGCATGAACT GGATGTGGAA GCCACTTTCA TGGTGAGCGT GAATCAGCTT GATCCAAATG ATCCAATCGA TGCGCTCGCC TTGGGCATGC TGGATGGTGC TGCGTGGTGT GCGCCAAGCG TTCCCCTTCC GCGCTATGTG ATACTATCGT TCGTTCCCGA TGGAACGGAT GGTGATGATG CCGTTGATGC GCTGCGGCGT TTGCAAGCAT TGGGCTTATT GGATGGTATC GAGACAGTGA TCTTACATCG CTTGCTCGCC CAAGTCATTC ATGTCCATAT GGGATGGTCT GCGACATTGG CGCTGGTAGA GCAGCGGATG GTCGCTGCGG CGGAACAAGC GCATAAGACG GGGATTCCGA AGCAGATGAA TCCGCTCGAA CCCCATCTGC GGGGTATGAC GCTCCGGGTG TTAGATCGTG ATACAGAACA AACGGCACGA CTTGCAACGA ACCTTGGACT GTTTGCACAA CACCAAGGAT GGTATGCAGA GGCACAGGCG CTACATGAAC GGGCGTTTGG TATACGAAGA GTGCTTGTTG GTGAAAACCA TTCTTCTACG GCAATGAGCA TCAATAATCT TGCAGAAGCG TTACATCAGC AAGGGCGGTA TTTGGAGGCG CAGGACTTAT TTGAACGGGC GTTGGCGGTG CGGGAAGTGG TGTTGGGGTT GGATCATCCC GATACGGCAC GGAGTGTGAA CAATCTGGCG TTGGTCTTGG AGAGTCAAGG GCGGTATTCG GAGGCGCAGG ACTTATTTGA ACGGGCGTTG GCGGTGCGGG AAGCGGTGTT GGGGTTGGAT CATCCCGATA CGGCGGTGAG TGTGAACAAT CTGGCATCGG TTTTGGAGAG TCAAGGGCGG TATTCGGAGG CCCGAGGCTT GTATGAACGG GCGTTGGAGG TCACGGAAGC AGTTTTAGGT AGGGAACATC CTGATACTGC GCGAAGTGTG AACAATCTGG CATCGGTTTT GGCGCGGCAA GGGCGGTATT CGGAGGCACA ACCCTTGTAC GAACAGGCGT TGGCGGTGAA TGAAGCAGTT TTAGGTAGGG AACATCCTGA TACTGCGCGA AGTGTGAACA ATCTGGCATC GGTTTTGGAG AGTCAAGGGC GGTATTCGGA GGCACAACCC TTGTACGAAC AGGCGTTGGC AGTGCGCGAA GCGGTGTTAG GCGAGAATCA TCCGGATACG GCCATGAGTA TGAACAATCT GGCAATGGTA CTGTTGAATC AAGGACGGTA TTCGGAGGCG CAGGGCTTGT TAGAACGAAC CTTGACGGTG CATGAAGCGG TGTTGGGGGC GGAGCATCCG GACACGGCCA TGAGTGTAAA CAATCTTGCT GTGGTCTTGG AGAGTCAAGG GCGGTATTCG GAGGCGCAGG GCTTGTTAGA ACGAGCATTG GCGGTGCGGG AAGCGGTGTT GGGGGCGGAA CATCCGGATA CGGCCATGAG TGTGAACAAT CTTGCGGGGG TCTTGGAGAG TCAAGGGCGG TATGGGGATG CGCAGCGGTT GTATGAACGG GCATTGGTGG TTACGGAAGC GGTGTTGGGG GCGGAGCATC CAAATACGGC GCGAAGTATG AACAATCTGG CAATGGTACT GTTGAATCAA AGGCGGTATT CGGAGGCGCA GGGCTTGTTA GAACGGGCAT TGACGGTGCA TGAAGCGGTG TTGGGGGCGG AGCATCCGGA TACAGCCATG AGTGTACACA ATCTGGCGGT GGTTTTGGAG CGGCAAGAGC GGTATAGCGA TGCACAAATG TTATATGAAC GGGCGTTAGC CATCAATAAA GCGGTGTTAG GCCGCGAGCA TCCGGATACC ATGACAACAA TGGGCAGCTT GGCAGGTGTG CTTGAAAGGC AACGGCAGTA TGGGAAAGCC CAATCCCTCT ATGAACACGC ATTCGCCATC AGGAAACGCG TCTTGGGATT AACGCACCCA GATACCCAAT CCCTCCAACG GGATGTAGGA CGAGTCCAAC GCTTGCATCT GACTACCAAA AAGAAAAAGC GGAAATGA
|
Protein sequence | MHLLDIQHKI SRLMARFVEE VKSSTAMGHS DINRVAETVL IPLLGRVYEC PSLQNLNSLH PNYPAVDLGD VARRIAFQVT TTPDSKKIKD TLTTFIAHNL HTQFDTVYVY ILTEKQHSYS PAIFSTITGN HLLFDPKRHI LDANDLLKQI ATYHVDKAQQ ILTILEANFD TPIDPLAHAL AVYGTLPLDY VPMARLDLPQ ASRIPFESSA YFVGREAELK ALARAIIQTQ PTVVVPAVTT GLGGIGKTSL VTEFAYRYGV YFHGGIFWLN CADANQVASQ IAACAVGLKI DTTGMALDEQ VQQVLYAWQS PMPRLLIFDN CEDPAILTQW KPTIGGCRVL VTARSDQWPT LTQIRLGLLS PVESRALLQR LCTRLTDTAA DAIAEDLGHL PLALHLAGSY LNTYSHHTVE QYRTELTIAH RSLKGRGAFP SPTQHELDVE ATFMVSVNQL DPNDPIDALA LGMLDGAAWC APSVPLPRYV ILSFVPDGTD GDDAVDALRR LQALGLLDGI ETVILHRLLA QVIHVHMGWS ATLALVEQRM VAAAEQAHKT GIPKQMNPLE PHLRGMTLRV LDRDTEQTAR LATNLGLFAQ HQGWYAEAQA LHERAFGIRR VLVGENHSST AMSINNLAEA LHQQGRYLEA QDLFERALAV REVVLGLDHP DTARSVNNLA LVLESQGRYS EAQDLFERAL AVREAVLGLD HPDTAVSVNN LASVLESQGR YSEARGLYER ALEVTEAVLG REHPDTARSV NNLASVLARQ GRYSEAQPLY EQALAVNEAV LGREHPDTAR SVNNLASVLE SQGRYSEAQP LYEQALAVRE AVLGENHPDT AMSMNNLAMV LLNQGRYSEA QGLLERTLTV HEAVLGAEHP DTAMSVNNLA VVLESQGRYS EAQGLLERAL AVREAVLGAE HPDTAMSVNN LAGVLESQGR YGDAQRLYER ALVVTEAVLG AEHPNTARSM NNLAMVLLNQ RRYSEAQGLL ERALTVHEAV LGAEHPDTAM SVHNLAVVLE RQERYSDAQM LYERALAINK AVLGREHPDT MTTMGSLAGV LERQRQYGKA QSLYEHAFAI RKRVLGLTHP DTQSLQRDVG RVQRLHLTTK KKKRK
|
| |