Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5166 |
Symbol | |
ID | 5737124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 241525 |
End bp | 246009 |
Gene Length | 4485 bp |
Protein Length | 1494 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641282331 |
Product | peptidase C14 caspase catalytic subunit p20 |
Protein accession | YP_001547922 |
Protein GI | 159901676 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCTAC CTCCACGAAG TCCACTCCCC GGTCTACGCC ATGCGATTGT GATTGGCGCA AATGGATCAA TCGCATCGGG GCTTGCGCCG TTAATCGCCC CCGAAGCCCA TGATGCGCCA CGTATGGCAG CCGTCCTTGC CAGCCCTGCC TGTGGTTTCC ACGTGCAACC GTTTCTTGGT GACCAGGCGT TGATGCAGCC GATTCGTGAT GCCATCGAAA CACAGATGCG GGCTGGTACG ATTGGTGATT CACTCATCAT TTATTTCAGC GGCCATGGAA CGATAACCCA AACGGCTGAG GGTCCTGATG TCGTGTTGGT CACAACGGAT ACAACGCTTG AAACTATTCA GGATGATCCG CTCATCTGCC TCCGCGTCGG GTGGTTGAAA ACAGTCCTGC GCGGGAATAC GCAGATGTAC CCAATTGGAC ATGTGGTGGT GATTCTTGAC TGCTGTTCCA GCGGGATAAT TGACCAACTT GGCAATCAGC CCCAGATGGG AGATATGCCA GCCTTCCGCC GCATGCTGGT ATCAACCTAT ACCAGCCATC CAGCCTATGA ATGGAATCGA ACCAGCCTCT ATACCTACCA TTTGCTCAAC GCCTTGGAAG GGCAGGCCGT GGATAGTTTT GGTGCGGTCA CGATGGAGCG AGTCCATCAG TACTGTAGCG AAAAACTTGC CGGAACAGAG CAGCGCTGTG GGCTTGGTGG ATGGGATTAC ACGGGACGCT GGCAGTTTGC GTGGTATGAC CTGCAACATC CGAAACTCAT CCATGCCGCA CACCATGTTG AAAGTGATCG CCTGCTCAAT TTGCGCCTTG AGGGCTTTGT CGGTCGGGTC ACCGAACTGG CGGATATCCA CGAGCACATT GCAGCCATAC GCCCAACGGG GGGCTATGTC GTGATTAAGG CGCTGGCCGG AGAAGGAAAA AGTAGCATTA TCGCCAAGCT CATTCAGGAT GCGGGGATTG CGCAGACACC CCATCACTTC ATTGCCCTGA CCACGGGCCG CACGTACCAA TTGGAGTTGT TACATACGGT CGTGGCGCAA TTGATTCTCA AACACAATCT CTCAAGAGCA TTCGGTCTTG GAGATCATGT CCAGTTATTA AAAGGGGAGT TTTTTCGCCT GCTTGACTAT CTCTCAAAAC AAGGAATCGA GGAAACGATC TATCTTGATG GCCTCGATCA ACTCCAGCCT GACATTGATG GTTCGCGCGA CTTGTCGTTT CTGCCGCCGC AGCCACCACC AGGCATCGTG ATCGTGCTTG GATCGCGGCC CGATGAGACG TTGATACCAT TTAAAGAGAA GGGGCTGTAT TGTATTGAGT ATGATCTGCG ACCACTTGGT AAGTCGGATG CGTTGGCCTT GTGGCGATCA GTCCAGCCTG GCGTAGCAGA TGACCTCTTC CATGACCTGT ATGATGCCTT GAAGGGGAAT GCGTTGTTCG TCCGCCTGGC AGCGGATACG ATGCGCGATC AATCCATGGC TGATACGGCC AGTCTGATTA AACAGATTAA GCGTAATCCC AATGATCTCT TTGGGATTAC GCTTGGGCGG ATTAAAGGTC GATCCTTGGC CGACTGGCGG GCAATATGGA AACCTATGCT TGCGTTTTTA CTGGTCGCGC AAGAACCACT TCATCTGGAT GTGCTTGGCG ATCTCCTGGG ACATGACCAC GACACCATGC AGGATGCGAC ATTGGTCTTG GGCGGGTTGG TCAGCCAGGG GATTGATCAA CGGGTTGCAT TGCATCACCT GCTGTTTCGT GACTATTTGG CAGCATCGGT GTTCAATGAC CGCGAGGTGA AACGCTGGCA GCAACGGCTG GCTGATTGGT GTGCCGTTGA TCTGAATACG ATTTGGGCTG ATGATCGTGA TCCCATTGAG CAGGCACGGC GGGTCTATGC GCGACACCAC TATGTAACGC ATCTTTCATT GGCGGAAAAC TGGCCAGCAC TCTGGCAGGT CTTAGATTCG GACGATTATG GCGAACAGAA AACCCGATTT GATCCGAGTA CGCGGCTGTA TGCGCTGGAT TTGGATCGGG GACGTGAGAG TGCCATCAAC GCGGGGAAAT CCATCGACGA GCATATCCAG AATTTACCAC GATTGTGGAA GTATAGCCTG TTACGGACAA GTTTAACCAG CCGCGTTGAT CAGTGGCCAG ATGAGGCGTT TGAGGTTTTG GCGATGCTTG GCCGCACGCA CGAGGCATTG GAACGAATTG AGTTGATTTC TGATGTAATG CGACAGATCA GGCTCTGGGG GAAGGTGATT CAATGGTGTG ATGAACAGCA ACAAATAATC ACACTGCGAA TGAGGCAATG TTTAAGGGTC ATTGGGGGGA ATCACGCCGC GCTTATTGAG ATTATCAGAA TTATCGCTAC TATTGGAGAT ATCGGCCAAG CATTAACTAT TGCCCATACG GTTCAAGATA ATCAACAACA GGCCGAAGCC TATACTACCA TCGCGAGCAT TATTAATACT ACTGAGGATT CTATTCCTCT CCTTGAGCAA GCAACCTTTC TTGGACAAGC TATCCATAAT CCTTTATTGC GTGCTCATAC CCTTGCTACC GTTATAAACG CTTTCGCTAC CGTAAATGCA ATCGATCATG CTCTAGCACT TGCCTATTCT ATCGATATCG ATCGGATACG AACTAAAGTA TTGGCGGGTA TTGCCCAAAC CGTTGCTGCT CGTGGTGATC AAAATAGTGC CGAACTTTTG ATAGGACAAG CTATTAACAT TAGTGATACG CTAAAGAAAG ATAGCTCTCG ATCTACCGTT CTCATAGCTA TCGCCAAAGC AATTGTGGCT ACTGGCAACG TTGACTATGC GCTTACGATT ATCAATAATA TTGATACGAA TGGGAGCCAA GAATTAGCAC GAGAACATAT CGCGCTAACA GCTTCCGAGT ATGAATTTTT TGATCACGCC ATTCGTCTTA CAGAATCTAT CAGCGATAAT TGGAGACGGA ATAAAACCTT TAGTAGTATT GCCCTAATTG CCGCAAAAAA AGGATTTTTT GATCAGGCTA TCGCCATCAC ACAATATACT AGTTCTTATA AACAGGTAGA AACCCTCGCA TATATTGCAG AAGCAGCAGC CAGATTAAAT AATTATGACT ATGCTAAAAC GCTATCAGAA AAGGTTTTTT CAGTCGTGCA AACCATGAGT GATGAGAAAC ACCGTGCTGA AGCCCTGATT ATCCTTGCCC GAACCGCCGC TATCTTGAAG GATCACAACC GTGCCATAAT GATCTTAAAA CAGACTCTAT TGAATAAATA TAATCAGGAT GATCGGTATC GTGATGAGTC CTTGCGTGCC ATAGCGGATG TTTACATAAC AGTAAATGAC CTTAATCAGG CCATAACAAT AGCCCAGACT CTTACACGTA ATGTTGATCG AGACTCGCTA TTGAGTGTTA TTGCGAAAGC ATTCGCGCTA GATGGGAATA TCAATCAGGC ACACATCATT GCTGGGTCGA TAAAGAGTGT TTGGAGCCGA TCACATACCG TCGCCTCAAT TGCCAAGGCT ACAGCTATGA TGGGAAATAT TGAAGATGCT GTTACTCAGG CGCAATCCAT CCAAGAAGTT GATGCACAAG TAACCGCTTT AAGCGCTATC TCCCAAATAT TAAGTAGTAA AGGTAACCAA CAGAAGGCAA GATCCTTACG AAAACAAGCG ATACAAACCA GTCGATTAAT TACCGAATCA GATAGTCACG ATAAGGCTAT CATGGCTATC GCCCAACTTT ACGCTGTCTC GCATTATTTC GATCATGCCA TCTCCATTAT CAAGGGCATC AATAATATTT GGACGTGTAC AGAGACCTTA ATTTCCATCG CCCAAATCGC TCTCACATCA GATAAAGTTC AGGAAGCTCT GCTTATTTTC AATCAGGCAT GCCTGATTGT TAAATCGAGA ACAAGTGATA TCGTCCTAGC CGAAGTATTA AGTGTGATTG GGCAGGCAAC TGCTAGGGCG GGGTTTATTG AACAGGCTAT TCTCATAACT CAAGACATAG AGGATCACGG CAAAAAATCC AAAACCCTTA GTATTATTGC CCAAGCAATG GCTGCTGCGG GAGATTATAA ACAAGCGCAA AGCCTTGTAA AAGAGGCCAT CTCACTGACG CTCTTCATAA CGGAACTTGA TGATCGTCTT GAGGCGCTAG TTGCTATTAT CGTGACACTT GCAGTATCTG GTGATCTCGG CACAGCCATT CATCTAGCAC AATTGATTCC TGATCTTTGG AATCGGACAA AGGCACTTAC CTCTATCCTA GAAGCACATT CATATAACAA TGAGGTTGCG ATCGATGTTA TTCAATGGGA ATGGTTAATG GGTGTCACAA ACAGCGATAT GTGGAATCTC TTATTATTAA GTACTCCATT GTTGAAAACG AATCTCTGTA TGGGGATAGT CTTGCTTGAA GGAGAACAAT GGGTACAAGA ACAGTTAATC CGTATCGCTA GATAG
|
Protein sequence | MILPPRSPLP GLRHAIVIGA NGSIASGLAP LIAPEAHDAP RMAAVLASPA CGFHVQPFLG DQALMQPIRD AIETQMRAGT IGDSLIIYFS GHGTITQTAE GPDVVLVTTD TTLETIQDDP LICLRVGWLK TVLRGNTQMY PIGHVVVILD CCSSGIIDQL GNQPQMGDMP AFRRMLVSTY TSHPAYEWNR TSLYTYHLLN ALEGQAVDSF GAVTMERVHQ YCSEKLAGTE QRCGLGGWDY TGRWQFAWYD LQHPKLIHAA HHVESDRLLN LRLEGFVGRV TELADIHEHI AAIRPTGGYV VIKALAGEGK SSIIAKLIQD AGIAQTPHHF IALTTGRTYQ LELLHTVVAQ LILKHNLSRA FGLGDHVQLL KGEFFRLLDY LSKQGIEETI YLDGLDQLQP DIDGSRDLSF LPPQPPPGIV IVLGSRPDET LIPFKEKGLY CIEYDLRPLG KSDALALWRS VQPGVADDLF HDLYDALKGN ALFVRLAADT MRDQSMADTA SLIKQIKRNP NDLFGITLGR IKGRSLADWR AIWKPMLAFL LVAQEPLHLD VLGDLLGHDH DTMQDATLVL GGLVSQGIDQ RVALHHLLFR DYLAASVFND REVKRWQQRL ADWCAVDLNT IWADDRDPIE QARRVYARHH YVTHLSLAEN WPALWQVLDS DDYGEQKTRF DPSTRLYALD LDRGRESAIN AGKSIDEHIQ NLPRLWKYSL LRTSLTSRVD QWPDEAFEVL AMLGRTHEAL ERIELISDVM RQIRLWGKVI QWCDEQQQII TLRMRQCLRV IGGNHAALIE IIRIIATIGD IGQALTIAHT VQDNQQQAEA YTTIASIINT TEDSIPLLEQ ATFLGQAIHN PLLRAHTLAT VINAFATVNA IDHALALAYS IDIDRIRTKV LAGIAQTVAA RGDQNSAELL IGQAINISDT LKKDSSRSTV LIAIAKAIVA TGNVDYALTI INNIDTNGSQ ELAREHIALT ASEYEFFDHA IRLTESISDN WRRNKTFSSI ALIAAKKGFF DQAIAITQYT SSYKQVETLA YIAEAAARLN NYDYAKTLSE KVFSVVQTMS DEKHRAEALI ILARTAAILK DHNRAIMILK QTLLNKYNQD DRYRDESLRA IADVYITVND LNQAITIAQT LTRNVDRDSL LSVIAKAFAL DGNINQAHII AGSIKSVWSR SHTVASIAKA TAMMGNIEDA VTQAQSIQEV DAQVTALSAI SQILSSKGNQ QKARSLRKQA IQTSRLITES DSHDKAIMAI AQLYAVSHYF DHAISIIKGI NNIWTCTETL ISIAQIALTS DKVQEALLIF NQACLIVKSR TSDIVLAEVL SVIGQATARA GFIEQAILIT QDIEDHGKKS KTLSIIAQAM AAAGDYKQAQ SLVKEAISLT LFITELDDRL EALVAIIVTL AVSGDLGTAI HLAQLIPDLW NRTKALTSIL EAHSYNNEVA IDVIQWEWLM GVTNSDMWNL LLLSTPLLKT NLCMGIVLLE GEQWVQEQLI RIAR
|
| |