Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1534 |
Symbol | |
ID | 5733421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1787524 |
End bp | 1790130 |
Gene Length | 2607 bp |
Protein Length | 868 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278674 |
Product | TPR repeat-containing protein |
Protein accession | YP_001544306 |
Protein GI | 159898059 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTTAC TTGATGATGC CTTCGAATCA ACGGTTACAC CGGCACTTAC GTCTTACATC TGGCTCCTGC GTTGGTGTGA TTCGGCTCAG TTGGCTCACC TAACGCCTTA TTCACCCGAA CAGATCGAAC GCTTTTGGCA AAGCGCCTTG GTCATTGAAC ACCCCCATCA TGGTTGGTAT CAACTACGCG AAGCACCTAG CTTAAATGAA CGACCTTATC GTGAGCACGA AGTCTTTGCT GCCGCCTTTG AGTATAGCCA GCAACAACTC AATCGTCTAG AGGCTGAAGC TTGGCAATTT GAACTCCAAC GCTGGCTCTA CTACCTTGAG GAATATTTGG AAGTGCTTTC GGCTCGCCGC GATTGGCCAA CCATCGCCGC AGTGCTCGAA AAAGCCACCA GCATTCCGCA AGTCAATCTG CGCCAACAGC AACTGCTGAT GCTCTACAAG GCGATTATCA CCATGCGGCT TGAACGTCAG TATGATACCG CCCAAAGCTT ATTACAACAA TTACGCGATG ATATCCAGCT TGAAGCCGAT TTAGTGCCGA TGGTGATCAA CAGTATGGGC ACATTAGCAT GGTTTCGCGG AGCCTACGAT CAGGCGATTC AACATTATAT TGAGCAACAT CAACACGCTC AGCAGGTGCA AAATTGGCTC TATCAAGGCC ACAGTTTGCT CAATCAGAGC ATTTTATCCA ACCAACTACA ACGCCCAGAA TACGCCTTGG AGCTAAGTTT ACAAGCGCTC GAAGCCTTAC AACGGGCGGG CAATCGCTAT CGCGAGGCGC ATGCGCTCTA CGAAGTTGGC TCGAATTTGC TCTATTTGTG CCGTTGGGAT GAGGCCGATA GCTATTTTTC ACGCTCGGCG GCGCTCTACG AAACCCTCGA TACGATCGGC GATTTGGCCA ATTTGTATTG GCATATTGGC TTTTTGAAGC ATTTAGCAGG CGATTTGCCG GCCAGCGAAC AAGCCTATCA AATTAGTATT GAGGCGGCGC GGGCCAGCGT TGTGCCCAAT GATGATAGTC TGCGCGATTC GTTGGCATTT TTGGGCTTGC TCTATTGCAG TATGCACGAG TATCAAACGG CCTTGGAAAT TTATGCTCAA GCCGAAGCCT TAGCCCGCCA ACACAACAAT CGCCATGAAT TAGCCCTGAT TCTCAATCAG CGCGGCGATG CTGAACGGCG GAGCGGGGCA ATTGATGCAG CCTACGCCGC CTTTGCCGAA TCGATCGCAA TTATTGAAGA TTTACGCACC TCGTTCGGCG ATGAAGATAC CAAATTGGGC TTGATTAGTA CAGCCCAACA GGTTTATGAG CATATGGTTG TGCTATGTAT TGAGCGCGGC GATGCGGCTC AAGCAGTCAA TTATATCGAG CGAGCACGTT CGCGAGCCTT CCTTGATGCG CTGCAAGCTG GCGATGAAAG TACCGCAATT GAGCTTTCGC AGCAATGTGC CGATTTGGCC GAAATTCAAG CCCAACTTGA CCAACGCACA GCGGTGATCG AATACTTTAC GGTTGGCGTA TTAGGTCGAT CATTGCGTTT CTTGGCAGCG TTGGCCGAAC GCAAATCACC CATTTTACAT CATTTCAGCC TTGATCCAGC CTTGTATAGT GTGGCAATTA CGGCCAACAA TGCCACGATT CATCAGCATA GCTTCAATCC ATTGAATCTA ACGCGCGGCC ATGGCGGGCA TCATCGCTTG CTACAACCTC GGATTTTGAA GGCCATTTCG CAAGCATTAA TCGAGCCATA CGAAGCCATG TTGGCGACAG TTGATTTGGT GTATGTTGTG CCGCATGGTC CGCTACACGA TGTACCATTT ATGGCCTTGC AAACTAGCGA CGGCAATTGG TTGGTGCGCG AAGAAAACCC AGCGATTGCC TTGGCTCCTA GTGCCACTAT TTTGGTGCGC TATGCCTTAG GTCGCGCCGC CAGCAGCCAA ACTCAGCACT ATTGTTTTGG CTACAACAGC GTCGGAGCCG AAGCCCTGAC CTATGCTGAA CACGAAGCCC AAGAAATTGC CAAATTAGTG GGTGGGCAAG CATGGACAGG CGCATTAGCC ACCGATCAGT TTTTGCGCTA TGCCCACGAT GCGCGAATTA TTCACATCGC CTCGCACTGT GTGTACGATG CTCAACAGCC ATTAAATTCG CACTTGATTC TTGGCCACGA AACGCTAAGC GCCCAAACGA TTATGGATCA GGTTGAAATT GACACAGATT TGGTAGTGTT AAGTGCTTGT GTCAGTGGAC GCAGTTTTGT GGCGGTCAGC GACGATCAAT ATGGCCTACA ACGGGCATTT TTATATGCCG GAACCCGTAG TTTACTCTGC TCGTTGTGGA ACGCCTCGGA TGTAGCGGCG TTGTTCGTGA TGGATCGTTT TTACCGTGAA TTGCAGGCCG GAGTACGCAT TGCTATAGCA CTCAAACATG CCGTCATCGC TGTACGTGAC TTGACGCGGG CTGATATTAT TAAACAGTTC CAGCTTTGGC AGCTACCAGC TAGCGCAATT CCGCTCGAAC CAGACGGCCA GCACAGCGAA AGCCCTTTGG CAGACCCGCG CTTTTGGGCT GGCTTTATGG TGATTGGCAA AGCCTAA
|
Protein sequence | MVLLDDAFES TVTPALTSYI WLLRWCDSAQ LAHLTPYSPE QIERFWQSAL VIEHPHHGWY QLREAPSLNE RPYREHEVFA AAFEYSQQQL NRLEAEAWQF ELQRWLYYLE EYLEVLSARR DWPTIAAVLE KATSIPQVNL RQQQLLMLYK AIITMRLERQ YDTAQSLLQQ LRDDIQLEAD LVPMVINSMG TLAWFRGAYD QAIQHYIEQH QHAQQVQNWL YQGHSLLNQS ILSNQLQRPE YALELSLQAL EALQRAGNRY REAHALYEVG SNLLYLCRWD EADSYFSRSA ALYETLDTIG DLANLYWHIG FLKHLAGDLP ASEQAYQISI EAARASVVPN DDSLRDSLAF LGLLYCSMHE YQTALEIYAQ AEALARQHNN RHELALILNQ RGDAERRSGA IDAAYAAFAE SIAIIEDLRT SFGDEDTKLG LISTAQQVYE HMVVLCIERG DAAQAVNYIE RARSRAFLDA LQAGDESTAI ELSQQCADLA EIQAQLDQRT AVIEYFTVGV LGRSLRFLAA LAERKSPILH HFSLDPALYS VAITANNATI HQHSFNPLNL TRGHGGHHRL LQPRILKAIS QALIEPYEAM LATVDLVYVV PHGPLHDVPF MALQTSDGNW LVREENPAIA LAPSATILVR YALGRAASSQ TQHYCFGYNS VGAEALTYAE HEAQEIAKLV GGQAWTGALA TDQFLRYAHD ARIIHIASHC VYDAQQPLNS HLILGHETLS AQTIMDQVEI DTDLVVLSAC VSGRSFVAVS DDQYGLQRAF LYAGTRSLLC SLWNASDVAA LFVMDRFYRE LQAGVRIAIA LKHAVIAVRD LTRADIIKQF QLWQLPASAI PLEPDGQHSE SPLADPRFWA GFMVIGKA
|
| |