Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1113 |
Symbol | |
ID | 5733005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1274432 |
End bp | 1275868 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641278252 |
Product | TPR repeat-containing protein |
Protein accession | YP_001543889 |
Protein GI | 159897642 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0928766 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATTC TTGAACCAAC CATCCAACAG CCCAATCTTG ATTGGCAATT ATATTATCAG CAATGTCATC AGCGATTTTT AAATTACTGG CTTAATTTTG TGCTTGAAAA CCCGCGTGCT CTCGCTGCTT TTGAGCTGGA TGCTGATAAT ATTTATAATG CTTTGGCAAC CGCTTATAGC GAAAAGCATC CACAATTTTT AGCCAGCGCT GTGGCATTTA GCTATTTTAT TGAATATAGC AGCGATATCG AACGTCGGCA TAATTTATCT AATTGGGCAG TTGAAGCCAC CCAAACCAGT AATGATCATC AAAGTGCTGC CTCGGCATTG TTGTACGCTG GCCGTTTAGC GACGTTTTCT AAATCGTATG CGGTGGCAAT GCATTATTTT GAGCAAGGTT TACAACGCGC ACGCCAAGCC CAATCACGCT TAATCGAAGC CAGTCTATTA AATGATTTAG GGGCGAGTGC GATTAACCAA GGCTTTTATG ATCAAGCCTA TGCCTATTTA GAACAGGCCT TAGAAATTGC CCAAGATCAT CATGATAACG AACTAATTGC CGATATTTTG GTCAAACGTG GAGCTGCTTA TTCCAATCGC GGCGCTTACG ATGTAGCATT TGCCGACTTG CAAGCGGCCC TAGCCGCAGC TCGCTTAACT AATTATATTG ATGTAATGGG CAAGGCCTTG GCTTTGTTGG GTGCGGTTGA AACCAATCGC GGCGATTATG CAGCAGCCAA AGCCTATTTT TCCGAAGGCT TGACGATCTG TCGCCAATTG AATATTCCTG AGCGTTTGAG TGAATTGTTG AATAATTTAG GTGTGATTTG TATGCGCCAA GGGCTTGATG ATCAAGCTGA GGCCTATCTC AACGAAGCCT TTGAAATTGC TCGTAGCCAA GGCCAACAAG AGCGGATGAG CTTTTTATTG GTCTATTTGG GCAGTATTAG CAACCATAAA GGTGCATATG ATCAAGCGGC ACAGCAGCTT AATCAGAGCT TAGCGCTAGC CCGTCAGGTT GGCAATCAAT TAATCATCGC CTTCTCGCAA GGCCATTTGA GCGAAACCTT ACGTCAACAA GCCAAGTATA CTGAGGCCTT GGATGTGGCT AACGCAGGCT ATCCGATTGC AACCAGTCTG AATATTCCCT TAATTATTTG TCTTTATTTA CATAATTTTG CTGAGCTTGG GCTGGCACAA GCTGATTATG GCTTAGCTTT GGAGAAGTTT CAGCAAGGCT ACGAGCAAGC CAGCGCGATC GGCCTCAAAG AGTGGCAAGG CTTGATGGGC TATGGTTTGG CTCGTACTTA TTTGGCCCAA GCCCAGCCTG AGCAAGCCTT AGCATTTGGT CGGGCTAGTT TGGCAATATT GCAGGCAATC GGCCATCGCC GCACCAACGA AGTTCAAGCA TGGGTTGCCC AGCTAACGCC GCTATAA
|
Protein sequence | MSILEPTIQQ PNLDWQLYYQ QCHQRFLNYW LNFVLENPRA LAAFELDADN IYNALATAYS EKHPQFLASA VAFSYFIEYS SDIERRHNLS NWAVEATQTS NDHQSAASAL LYAGRLATFS KSYAVAMHYF EQGLQRARQA QSRLIEASLL NDLGASAINQ GFYDQAYAYL EQALEIAQDH HDNELIADIL VKRGAAYSNR GAYDVAFADL QAALAAARLT NYIDVMGKAL ALLGAVETNR GDYAAAKAYF SEGLTICRQL NIPERLSELL NNLGVICMRQ GLDDQAEAYL NEAFEIARSQ GQQERMSFLL VYLGSISNHK GAYDQAAQQL NQSLALARQV GNQLIIAFSQ GHLSETLRQQ AKYTEALDVA NAGYPIATSL NIPLIICLYL HNFAELGLAQ ADYGLALEKF QQGYEQASAI GLKEWQGLMG YGLARTYLAQ AQPEQALAFG RASLAILQAI GHRRTNEVQA WVAQLTPL
|
| |