Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0136 |
Symbol | |
ID | 5732031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 168061 |
End bp | 170391 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641277260 |
Product | TPR repeat-containing protein |
Protein accession | YP_001542916 |
Protein GI | 159896669 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.955072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGGAA ACTCGCACGT CCGGTTCTTA GGGGGCGGGA GTGTAGTAAT ACGCTCCTGC TACCCGACAG ATCATTTGCC TGAACGCAGC CAATTGCCAC CACATTCGGT TATGCCCTAT CAACCACTGA GCGATTTTGT AGGGCGTGAG GCCCAACTAT ATCAATTGGC TCAAGCAATG TTGCGCTCCG ACCCAACCCT GATTACACCA ACTGCTCTTG CCACAGGTAT GGGCGGGATT GGCAAAAGTA GCCTAGCCCT AGAATTTGCC CATCGTTATG GTAGCTATTT TGCGGGCGGG GTATTTTGGC TGTATGCCGC TACCAACGAA ACCCTCCAAG CCAGCCTTGA TCGTTGTTGG GATAGCCTGA AACCAGACGA GTGGCGTTAT GAAGTTAAGC CTGAAACGCG GCTGCGGGTG GTGCGTGAAT TATTTAATCA ACCTATACCA CGCTTGTTGA TTTTCGATAA TTGTGAAGAT CCAGCCTTGC TCACGGCCTA TCGTCCCCAG GCTAGCAGTG GTTGCCAACT GTTGGTTACT AGTCGGCGCA GCCAATGGCA AGGCACAAAT TTAATTACAC TTGATACCTT GCCGCCGCTT GAAAGCCGCC AATTGCTACA ACAACTAGCT GCTCAGCCAA ATATCAACAA CTATCTCAGC GATACCGATG CTGATCAGTT GGCTGAATTG GTGGGGCATT TGCCTTTAGC TTTGCATTTA GTTGGCTCAA GCTTGAAATT TTATTTTCGC AAGCCTGCGG CTGAGTATAT TGCAGCGCTG CAAAACCAAC GGATTGCCAG CTTGCAAGCG ATGGTTAAGC CAACCAGTAA GCTCCATCAA AATACAATTA ATAATTTTTG GAGCGTGCGC GATACAGTTG AAGTGAGTTA TGGGTTGTTG CCTGCCGAAT TAGGCCAAGC CTGTCGGCGT TTATTGTTGA TGATGGCCTA TTGTGCGCCG AATGTGGTTA TTCCATGGGA GTTATTGCAA GCTGCTAGCG GCTACGATGA TGATAGCCTG ACCGAATATC TATGGGAATT AACCCAAGCG GGCTTTTTCA ATGACCCAAC TCAACCGCGT TTACATCCGC TGATGGCTGA TGTAATTATT GATCTTGATG CGGCGAATGA GCCTGAAAAT TATAGATCGC TTGAACAAGC ATTGATAATG CTCAGCAACC GTTATCATGA TCAATGGGCA ATGCGGGAAA TTGAAATGTT GCTGCCCCAT CTTGAATATT CGGATCGCCA AGCGAGCCAA CATCCAGATT ATGCTGGCGA ATTAGGCTAT CAAGCAGGAC TTTGTTTATA TCGTCAAGGT AAGTATGTTG ATGCAGAACA CATTTATCGA GAAGTTTTAT CAACCCAAAA CCAACTATTT GAAACAGAAA ATCCAATCAT TCTTAATACA AAACATGCCC TGGCTGATGT ACTTGGCGAT CAAGGGTTAC TTCAAGAGGC TGAACAACTT TTTAACGAAG TATATAGGTT ACGTAAAAAA GTTTTAGGCC AATATCATCC CCATACCCTT AAAAGTAAAA AAGAATATGC AACTGCCATG TTTTTGCGAG GAAATTATGC TGATGCAGAA CAAATTCTGC GCGAGATATT AACGATTCAA GAACAGAGTC TAGGAAAAGA ACATTGGGAT AGCTTATTGA CAAAACATAA TTTAGCTTCA ATTCGCAGTA AACAGGGATA TTATGCCTTA GCAGAGCGGA TGTATCGTGA AATATTGAAG GATCAAGAGC AAATATTTGG TGTAAATCAT CCTGATACTC TAGCAACCAA GCGTCAAATT GCTAACAACG TAGGTTATCA GGGTCGATAT GCCGAGACCG AACGTATCTA TCGGGAAGTT CTGCCAATCT ATGAGCTTAT TTTAGGCTTG AACCATCCAT ATACGTTAAC AACAAAACAT GGAATCGCCT GGGCATTAAA TGGACAAGGG CTTTATAAAC AGGCTGAATA TATGTACCGT GAGGTATTAC TCATCTGTGA ACAAACGCTT CGGATTAATC ATCCTGAGAT TATTACTACT AAACATAATA TTGCTTGGAT ATTGAGTAAA CAGCAACATT ATCTTGAAGC AGAAGTAATC TATCGCGAAG TGCTAGAAAT TCGTGAGCAA AGCTTAGGAA CTAATCATCC TGACAGTTTA TCAACAAAAT ATAACTTGGC AGCTACACTG TATCATCAAT CTTGCTATCG TGAAGCAGAA TTATTATTTG ATCAAGTATT GAGCATCCGT GAAAAGGTTT TGGGAACTCA ACATGCAAGT ACACAGGCAA CGCAAGAGTG GCTTGAACTG GTGCGCAGTA AATTGTGTTG A
|
Protein sequence | MRGNSHVRFL GGGSVVIRSC YPTDHLPERS QLPPHSVMPY QPLSDFVGRE AQLYQLAQAM LRSDPTLITP TALATGMGGI GKSSLALEFA HRYGSYFAGG VFWLYAATNE TLQASLDRCW DSLKPDEWRY EVKPETRLRV VRELFNQPIP RLLIFDNCED PALLTAYRPQ ASSGCQLLVT SRRSQWQGTN LITLDTLPPL ESRQLLQQLA AQPNINNYLS DTDADQLAEL VGHLPLALHL VGSSLKFYFR KPAAEYIAAL QNQRIASLQA MVKPTSKLHQ NTINNFWSVR DTVEVSYGLL PAELGQACRR LLLMMAYCAP NVVIPWELLQ AASGYDDDSL TEYLWELTQA GFFNDPTQPR LHPLMADVII DLDAANEPEN YRSLEQALIM LSNRYHDQWA MREIEMLLPH LEYSDRQASQ HPDYAGELGY QAGLCLYRQG KYVDAEHIYR EVLSTQNQLF ETENPIILNT KHALADVLGD QGLLQEAEQL FNEVYRLRKK VLGQYHPHTL KSKKEYATAM FLRGNYADAE QILREILTIQ EQSLGKEHWD SLLTKHNLAS IRSKQGYYAL AERMYREILK DQEQIFGVNH PDTLATKRQI ANNVGYQGRY AETERIYREV LPIYELILGL NHPYTLTTKH GIAWALNGQG LYKQAEYMYR EVLLICEQTL RINHPEIITT KHNIAWILSK QQHYLEAEVI YREVLEIREQ SLGTNHPDSL STKYNLAATL YHQSCYREAE LLFDQVLSIR EKVLGTQHAS TQATQEWLEL VRSKLC
|
| |