Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3469 |
Symbol | |
ID | 5735330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4368087 |
End bp | 4371572 |
Gene Length | 3486 bp |
Protein Length | 1161 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280616 |
Product | TPR repeat-containing protein |
Protein accession | YP_001546233 |
Protein GI | 159899986 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGGAT GGATTTTTGT CATTTTGTTG TGTGGGCTAT TTGGCTATAA CTTGTATCGC ACTGGCCATG TGCAGCGCTT GATTGCAATT CGCCAGCGCC GCCCATCTGC CCGCCCGACT GGCGCAATTG CTCCAACCCT ACGTGCCCGC GATTTGCGCA CCCAGCGCTC GTGGGCAGTC CGCATTCGTG ATGAAGTTCA ATCGCGCGGC AATATGGGCG TGGTTATTTT CTTCCTGATT ATCTTAATTG CTTTTTTTCT GAGTTTGAAC TTACTGGGCT TTTGGGGTAG TCAACCGACT GAAACTGTTC GGGCACGGGT AATTCTCTCA CCATTTACCA GCAGCGCAGG CCAACCAGCC CAAGAAGGTT TTGCCGCAGC CACCACCATG GCCCAGGCTT GGCAAGCTCG TGCTAATGAT TTGCAATTTG CTGTGCTCAA AACCCCAATT GCCGATGCCC AAAGTGCTTG GAATTTAGCC GATCGGCCCA AATATGATCT GGTGATTTGG GGCACAATTG TCGCTGGCGG TGATGCCAAT AGCGCTAGCA TCGAGCCGCA ATTGCTCTGG ACTCCACGCC AACCCTTGCC GCATGATCGC TCGTTGGGGT TACGCGAACG CTTGAGTTTA CCACTCGTAT ATCGTTTGGC CGACCAGCCA TTTAATGCCC AAGCCGTGCT CGGTGAAATT TTGGTCGTGA TCGATCTCTA TCAAATGGGC GAGTACGATC AAGCGATCCA AGCGATCAAT AGTTTGCTTG ATCGCTATGC CGTGGATGGG CCGTTACGCC CCGATTTGCT CTTGGGGATT CGTGGCTCGA TTGCCGCCCT GCAAGAGCAA TGGAGCTTAG CCGAGGGTGA TTTTCGGCGA GCCTTAGAAA CTGCCGATAA ACCTGAATAT TGGAATAACC TTGGGGTTGT GTTGCTAGAA CAAGGCCGTT TTTCTGAGGC AACTCAAGCC TTCAACACTG CTCAAGAACG CCTCAAAGGC ACAAATAGCG ATCTTACGGC GCTGCATTTG AATCGTGGTT TGTTGGCGTT ACGTGGGGTT GACCCAGCGG TAGCGGTTGG TGAGTTGGCC TTAGCGGTCA AGCTCAATCC TAATGGCATC AGCAACAAAT TAGCCTTGGT CGAGGCTCAA TTACAAGCGA ATCAATTGGC GCAGGCGATC GAAACCATCA ATAATCTGAA TGGCTATGCG AACGCTGATC CGTTGGTTCA ATTGATGACA GCGCGGGTTC AAGTAGCCCA GTTGGTCAAC AATGGCGATC AGCCGCTGTG GGAATTGGAG ATTGTGCCGC CATTGCCTCG CGAAACCCTG ACGATTGTGC GCCAACGGCT TGATAGTGCG GTTGTTACAC TTGAAACGAT TTCGATCGAG CAACGCCAAA TTGCCGCCCG TGATGATGCA GCAGGCTTGC CAGAATCAGG GCGGGTGCAC GAAGCTCATT CGCGCCAAGC CTTTGAATTG CTCAATCAGG TGCGCTATTG GCAAGCGGTC GCGATGACCG AAGAGGGTAT TTCAGCCTCG CTTGAGCAAA AAGGTCGGCT GCGCAGCATG TGGGATGGCA TGTTTGGCGA TGATTCTCCG CTGGAATTAG CCCAAAATGT GATCGATCCA CTCGCCAAAG CCCAAGACCA GAATTATCCA ATTTTAATTC AGGCAGGGCG GGTGCATCGA ATTTTGGGTG CGCCAAAAAC AGCTATGGAT TACTACACCA AAGCCAGCGA ACTCAATGCC AATCGGCCTG AGGGCTGGTA TGGCTTGGCA ATCACGCGCT TTAATACCGA AGGCCCAAGC CCAGAACGCA ACCAAGCAGT GCGTGATTAT TTGCAAAAAA CCATTGCTGC CGCCCCAAAT TTTACGCCTG GCTATATTTT GGCTGCTCGC TTAGAGGTTA GCGATAAACA ATGGGCCGCC GCCTTGCCCT ATCTACAATG GATTGTGGCT AATCGCCCTG ATAACCTTGG GGCACGGATT ACCCTGGGCA CAGCTCAACG CGAGCTTGGG GCTTTGGCCG ATGCTGAAGT GACGCTTTTG CCTTTGGCGA ATGCGAATAA CAGCCAAGCC TTAGTTGAAT TAGGCAAAGT CTATGCCCAA GCAGGCCAAG ATCAATCGGC TGAAGATATC TTCTTGCGGG CATTGAATGT TGATAGCAGC AATGCTAGCG CTGCTTATGA GATTGGCCGC TTGCGCCAAA ACCGTGGCGA TTATGCAGGA GCCGAAAAAG CCTATCAAGT TGCAACTGAA ATTAATACCA GCTACGTTGA GGCGCATCTC GCATTAGGCC AATTGTATGC GCGGTATCTC GACCAGCCAG ATAATGCCGT TGCGGCCTAT CAACGCGCAA TCGACGCTGG CGGCGAAGAT CCACGCAATT TCGAGGATAT GGGCCGCGAA TTTTTGGAGA TTGGGCGTTA TAGCGAGGCC GCCGATGCCT TGGAACAATC GGTGCGGCTC AATCCGAATG TGCCTGAATC ACGCCATTAT TTGGCCCAAG CCTACCTTGA GCAAGGTCGT TTTGAAGCAG CTCGCGAGCA AGAACGCGCA GCGATTGCCC GTGATGCTGA TGGCGTGTAC ATCGAAGCGC AGCTTGGCAT AGCCGAGAGT TTTCGGCGCG AACGGCGGTT TGATGAGGCA ATTACAGCCT ACAACGAAAT TTTGGATAAC GATTCGACGA TCATTCCTGC TTACATTGGT TTAGGCCGTA CTGCCGCCGA TCGGGGCGAG TGGCAAGTGG CGATTGGCTA TTATAATCAA GCCCTCGCCC GCGAACCCAA TAGCACCAAC GCCCATTTTT GGCTGGGCCA GGCCTTGGTC GAGCAAGGTT TCTATGAACG GGCACTTGAT GAATTTAATT TGGTGCTAGC GACCGATCCA AATAATGCCG AAGCCTTGTT TGGGGCTGGT CGGGCTTATT GCAACATGGC GATTAACAGC TATGCCTATG ACCCAGCACA AGCTGCCGAA TACGATGCTG AGGCGCGGCG TTTGCTTGAT CGAGCTTTGA ATTATCGGCC AAACGATGCG CGGGCCTTGT TTGAGCGCGG CAAACTCAAC GAGCGCCAAA ACCAAATGGC CGCCGCGATT AGCGATTATG GTCGGGCCGC CCAATTGGAT GCGCAAAATA GCGAAGCATT ATATTTGCAA GGCAAGCTCT ATCTCAGCCA AAACAACCTA CAAGCAGCCG AAGAAGCGCT TGATCAATCG GTGCGGCGCA ACCAAAATGA CCCAACCGCC TTATATTGGC TTGGTCGAAC CTATCGCGCC CAAAACCGCA CCAATGATGC AATTAAATCG TTTGAACGAG CCTTGAGTTT GAATGGCAAT TTTCATGAAG CCCGCTACTA CCAAGGCCTA ACTGCCGAAG AAGCCAACCA AATCGATCTA GCGCGTGAAG CCTATCACTT AATCAGCCAA CAAGCCAGCC CCGATGATCG TTGGCGCTTA CAAGCCGAAG AACGTCTGCG CGATTTAGGT CAGTAG
|
Protein sequence | MIGWIFVILL CGLFGYNLYR TGHVQRLIAI RQRRPSARPT GAIAPTLRAR DLRTQRSWAV RIRDEVQSRG NMGVVIFFLI ILIAFFLSLN LLGFWGSQPT ETVRARVILS PFTSSAGQPA QEGFAAATTM AQAWQARAND LQFAVLKTPI ADAQSAWNLA DRPKYDLVIW GTIVAGGDAN SASIEPQLLW TPRQPLPHDR SLGLRERLSL PLVYRLADQP FNAQAVLGEI LVVIDLYQMG EYDQAIQAIN SLLDRYAVDG PLRPDLLLGI RGSIAALQEQ WSLAEGDFRR ALETADKPEY WNNLGVVLLE QGRFSEATQA FNTAQERLKG TNSDLTALHL NRGLLALRGV DPAVAVGELA LAVKLNPNGI SNKLALVEAQ LQANQLAQAI ETINNLNGYA NADPLVQLMT ARVQVAQLVN NGDQPLWELE IVPPLPRETL TIVRQRLDSA VVTLETISIE QRQIAARDDA AGLPESGRVH EAHSRQAFEL LNQVRYWQAV AMTEEGISAS LEQKGRLRSM WDGMFGDDSP LELAQNVIDP LAKAQDQNYP ILIQAGRVHR ILGAPKTAMD YYTKASELNA NRPEGWYGLA ITRFNTEGPS PERNQAVRDY LQKTIAAAPN FTPGYILAAR LEVSDKQWAA ALPYLQWIVA NRPDNLGARI TLGTAQRELG ALADAEVTLL PLANANNSQA LVELGKVYAQ AGQDQSAEDI FLRALNVDSS NASAAYEIGR LRQNRGDYAG AEKAYQVATE INTSYVEAHL ALGQLYARYL DQPDNAVAAY QRAIDAGGED PRNFEDMGRE FLEIGRYSEA ADALEQSVRL NPNVPESRHY LAQAYLEQGR FEAAREQERA AIARDADGVY IEAQLGIAES FRRERRFDEA ITAYNEILDN DSTIIPAYIG LGRTAADRGE WQVAIGYYNQ ALAREPNSTN AHFWLGQALV EQGFYERALD EFNLVLATDP NNAEALFGAG RAYCNMAINS YAYDPAQAAE YDAEARRLLD RALNYRPNDA RALFERGKLN ERQNQMAAAI SDYGRAAQLD AQNSEALYLQ GKLYLSQNNL QAAEEALDQS VRRNQNDPTA LYWLGRTYRA QNRTNDAIKS FERALSLNGN FHEARYYQGL TAEEANQIDL AREAYHLISQ QASPDDRWRL QAEERLRDLG Q
|
| |