Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0090 |
Symbol | |
ID | 5731983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 114888 |
End bp | 118160 |
Gene Length | 3273 bp |
Protein Length | 1090 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277212 |
Product | TPR repeat-containing protein |
Protein accession | YP_001542870 |
Protein GI | 159896623 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.763002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGGGA ACCAAGCAAT TTTTGACCGT GCGTTAGAAC AATATCAGCT TGCCTCACGT GCTGGCGATT GGAACAAAGC TTTAACCGAA GCTGCTCGGG CCATGACCGA ATTTCCGACT CACGAGCAGG CTCGGATTGC TGTCGCCACC GCGTTGTTTC ATACCAATAA GCTTCCACAA GCGCTGCAAT TGTGGCAGGA GTTGCTCAAG CGCAATCCTG ATAATCCGAT TTTTACCGAA TATATTGCCA AGATTTATCG CGCCCAAGGC GATACCGACC AAGCGATCGA GTTGTTTATG CAATTGGCTG AACGCTTGAT CGCCCAAAAA TTGCCTTTGA AGGCAGTGCG AGCTTATCGC GATATTTTGG ATATTCACCC CAGCCACGAA GAAGCCCGCG TGCGCTTGGC GATGGTACTG GCTGAATCGG GCGATAAGCC TGGCGCAATC CGTGAATATT TGCATTTGGG TCAGCAATTT TATGATGCTG AACAATACGA TGCTGCTGAT GAAGCCGCCG TCGAAGCCCT CAATGTTGAC CCTGCTAGCC TCTCGGCCCA AGAATTCAAG AAGAAAGTTG ATTTGGCACG GCAGGCCTTA ATCACTGCTG CCACAGGCGA TCAATTGCCC GATCAAGGTC GTTTGCAAAC CAAAGAACAA ATCGAACGCG CGGTCAATGA GGCGATTGAG TATCAAAATA ATGGGTTGCT GGAGCAAGCC GCCCAAATGT ATGAACGGGT GGTTGAAGTG CTGCCCGAAC GCGCCGACAT GCAGTATAGC TTGGGCTTGA TTTACGATGA GCTTGAGCGG CATAAAGATG CCTTGCGGGT GCTCGAAATT GCCAGCCATA ACGATGAATA CGCCCTCTCT GCCCACTTTG CAATTGGTAC GATCTACCAA AAGCTGGGCC AAATGGAACG TGCTGCCCAA GAGTTTGAGC AAGCGATTCG CTATGTCGAT TTGCAATCGA TCGGCAAAGA AGAAGCCAAC GATTTGATCG ATATGTACGA AGCGACCTCG GCGATTTATC TTGAACTAGG CGATGTGGCG CGGGCAGCCT CGTTGTATTC AACCTTGGCA GGCTTTTTGC AGGGCAAACG TTGGGGCGCG GAACAAGCGG CGCAATACAA TGCCAAGGCT AAAGAGCTGA CCGACCGCAG CATGATGTCG AAATTGCGCA TGCTTGGCAC GGGTATTTTG AATGCGCCGC CACCTGAAGA GGCTGTGCCT GAACCACAGA CCGAAACTTG GGGCAAGATG CCCTCGATCA CCGATTTCCT CAGTCCCAAA AGTCGCGTCG AAGCCAGCCC AGTCGATGAT ATGGCCAGCC TTGAGGCCTT GCTTAGCGCT AATACCGGCC CAGCACCCCA AACGATGCTT ACCGATATTG ATCCGCTTGA TGTGCTGGCC GCCAATTTGC CACCTGCTGG CGAAGTACAT TTTGCGCCGC TCACGCCAAT TGATACCGAA GGCCGTAGCG AGCGGATTCA GCGTTTGGTT GAGGCCAGCG AATCGTTTGC CGAGCAAAAT TTCTTGTATG CGGCGCTTGA TGCCTGTCAT GAAATTATTC GCTACGATCT GGAGTTTTAT GCTATTCACC TGCGGATGGC TGAAATTCTC GAACGTTTGG GGCGTATGGA TCAAGCGCTC GCCAAACTCA ATTTGTTGAT CGAAACCTAC AAAGCGCGGG GCGAAACCCA CAAGGCGATT GGGGTCTATC ACAAACTGAT CGATCTTTCG GCTGATAGCA CGATCATTCG GGCTGAATTG GCTGAGGTGC TACGCAAACA AGGCCGTAGC GATGAGGCCG CTGAGCAGTT GGCCTATGTT GCCAACCAAC AATTCCGCCA AGGCCAAACC GTCAAAGCCT TGGAGCAATT TCGCAAGTTG TTGGAATGGG CACCTGATAG CGTCAATTTA CGTGCTCAAT ATGGCCAAGT GTTGCTCAAA CTCGAACGTT GGGAAGCAGC CTTGGAAGAA TTCCGCCGAG TTATCGTGCA AAAACCCGAT GATTTGGTGG TGATGGCCCA AGCCAATATT GCCCTGGCCA TGATGGGCGA ATTCCCCGAT GCGATTTGGG ATTCAGTCGC AGCCTTGATT CAAAAGCTGG CTAGCAATCC GCAACAGCTT AACGATGTGC AAGCCGAATA TCGCGCCGTC ACCCTGATTA CTGATCGGGC GATTATTCAG TTTCTCTTGG GCTTGATTCA GCAATCAACC AAGCAACATC CGGCGGCGAT GCACTCATTC AATCAAGCGC TCGAATTGCT CGAAATTGAT GCTGATCCCT TGGTTCCGCC AGTGCTGGTC TATCAAGCGA TTGCCGATAG CTACATTGCC GAGGGTAATG CTGCTGGCTC AATCGAACAG TTGCGCAAAA TCGAGGCGAT TTTGGTGACT GGCACTGTGC CAGCCATGAG CACCAAGCAT GCCTTTGCCC AACCCTTCAA CGAGGGTGAG TTGCAACGGC GTTTGGCCGA AGCCTATGCC GCTAATGAGA ATTTCGAAGG CGCAATTCAG GCCTTGCAAC GGGTCAAGCA ATTGTTGCCC TACGACCGCC AAGCTTACAC CAAATTGGCT GATATCTACT TCCGCCAAGG CCGCCTAAAC GAAGCCTTGA CCCAACTTGA TGAGTTGGCA ACTTACTACG AAAGCCAAAG CCAGCTTGAC CGTGCCTTGG AGATTTTGGC CACGGGCTTG CAATTAGCAC CCAAGAATAT CCCAATCAAA TCGCGCCATG CCCAGCTGAT GATGCGGCGC GGCTATCTTG ATGAGGGCTT GGTAGGCTTG GATGAATTGG CTGAGTTGCA GCGCAAACAA GGCTTGGTCA AGGATGCAGT GGCGAGCATT CAGCAAGTCG CTGATGTGTA TTGGACGTTG GGCAAGGTTG ATAAAGCTGC CGAGATGTAC AACCGAATTG TGCAGATCGC GCCCAACGAC ACCGAAGCTC GTCAACACTT GGTCAGCTTT AACATTCTTT CGTTGCGCAC CAAGGATGCA ATTATCCAGT TGCGTGAAAT TGCACGGCTC TCAATTCAGC AACGCAACTA CGAAGAAGCG ATTGCTTCCT ATCACCAAGT GATTGCGCTC GATCAAAAAG ATGCCGATGC CTACGAGCAA TTGGCCGATG TGCTGATGCG AGCACAAGAG TATGGTCAGG CCGTGCGTAC CTATAAACAA TTGGCGAAAT TGTTGCCCGA TGATGATCGG GTCGAAGCTT TGCAAAGCGC CGCCCAACGC ATGCTTGATC AGCAACAGGT GGCCAAAGGC TAG
|
Protein sequence | MAGNQAIFDR ALEQYQLASR AGDWNKALTE AARAMTEFPT HEQARIAVAT ALFHTNKLPQ ALQLWQELLK RNPDNPIFTE YIAKIYRAQG DTDQAIELFM QLAERLIAQK LPLKAVRAYR DILDIHPSHE EARVRLAMVL AESGDKPGAI REYLHLGQQF YDAEQYDAAD EAAVEALNVD PASLSAQEFK KKVDLARQAL ITAATGDQLP DQGRLQTKEQ IERAVNEAIE YQNNGLLEQA AQMYERVVEV LPERADMQYS LGLIYDELER HKDALRVLEI ASHNDEYALS AHFAIGTIYQ KLGQMERAAQ EFEQAIRYVD LQSIGKEEAN DLIDMYEATS AIYLELGDVA RAASLYSTLA GFLQGKRWGA EQAAQYNAKA KELTDRSMMS KLRMLGTGIL NAPPPEEAVP EPQTETWGKM PSITDFLSPK SRVEASPVDD MASLEALLSA NTGPAPQTML TDIDPLDVLA ANLPPAGEVH FAPLTPIDTE GRSERIQRLV EASESFAEQN FLYAALDACH EIIRYDLEFY AIHLRMAEIL ERLGRMDQAL AKLNLLIETY KARGETHKAI GVYHKLIDLS ADSTIIRAEL AEVLRKQGRS DEAAEQLAYV ANQQFRQGQT VKALEQFRKL LEWAPDSVNL RAQYGQVLLK LERWEAALEE FRRVIVQKPD DLVVMAQANI ALAMMGEFPD AIWDSVAALI QKLASNPQQL NDVQAEYRAV TLITDRAIIQ FLLGLIQQST KQHPAAMHSF NQALELLEID ADPLVPPVLV YQAIADSYIA EGNAAGSIEQ LRKIEAILVT GTVPAMSTKH AFAQPFNEGE LQRRLAEAYA ANENFEGAIQ ALQRVKQLLP YDRQAYTKLA DIYFRQGRLN EALTQLDELA TYYESQSQLD RALEILATGL QLAPKNIPIK SRHAQLMMRR GYLDEGLVGL DELAELQRKQ GLVKDAVASI QQVADVYWTL GKVDKAAEMY NRIVQIAPND TEARQHLVSF NILSLRTKDA IIQLREIARL SIQQRNYEEA IASYHQVIAL DQKDADAYEQ LADVLMRAQE YGQAVRTYKQ LAKLLPDDDR VEALQSAAQR MLDQQQVAKG
|
| |