Gene Haur_5166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5166 
Symbol 
ID5737124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp241525 
End bp246009 
Gene Length4485 bp 
Protein Length1494 aa 
Translation table11 
GC content48% 
IMG OID641282331 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_001547922 
Protein GI159901676 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCTAC CTCCACGAAG TCCACTCCCC GGTCTACGCC ATGCGATTGT GATTGGCGCA 
AATGGATCAA TCGCATCGGG GCTTGCGCCG TTAATCGCCC CCGAAGCCCA TGATGCGCCA
CGTATGGCAG CCGTCCTTGC CAGCCCTGCC TGTGGTTTCC ACGTGCAACC GTTTCTTGGT
GACCAGGCGT TGATGCAGCC GATTCGTGAT GCCATCGAAA CACAGATGCG GGCTGGTACG
ATTGGTGATT CACTCATCAT TTATTTCAGC GGCCATGGAA CGATAACCCA AACGGCTGAG
GGTCCTGATG TCGTGTTGGT CACAACGGAT ACAACGCTTG AAACTATTCA GGATGATCCG
CTCATCTGCC TCCGCGTCGG GTGGTTGAAA ACAGTCCTGC GCGGGAATAC GCAGATGTAC
CCAATTGGAC ATGTGGTGGT GATTCTTGAC TGCTGTTCCA GCGGGATAAT TGACCAACTT
GGCAATCAGC CCCAGATGGG AGATATGCCA GCCTTCCGCC GCATGCTGGT ATCAACCTAT
ACCAGCCATC CAGCCTATGA ATGGAATCGA ACCAGCCTCT ATACCTACCA TTTGCTCAAC
GCCTTGGAAG GGCAGGCCGT GGATAGTTTT GGTGCGGTCA CGATGGAGCG AGTCCATCAG
TACTGTAGCG AAAAACTTGC CGGAACAGAG CAGCGCTGTG GGCTTGGTGG ATGGGATTAC
ACGGGACGCT GGCAGTTTGC GTGGTATGAC CTGCAACATC CGAAACTCAT CCATGCCGCA
CACCATGTTG AAAGTGATCG CCTGCTCAAT TTGCGCCTTG AGGGCTTTGT CGGTCGGGTC
ACCGAACTGG CGGATATCCA CGAGCACATT GCAGCCATAC GCCCAACGGG GGGCTATGTC
GTGATTAAGG CGCTGGCCGG AGAAGGAAAA AGTAGCATTA TCGCCAAGCT CATTCAGGAT
GCGGGGATTG CGCAGACACC CCATCACTTC ATTGCCCTGA CCACGGGCCG CACGTACCAA
TTGGAGTTGT TACATACGGT CGTGGCGCAA TTGATTCTCA AACACAATCT CTCAAGAGCA
TTCGGTCTTG GAGATCATGT CCAGTTATTA AAAGGGGAGT TTTTTCGCCT GCTTGACTAT
CTCTCAAAAC AAGGAATCGA GGAAACGATC TATCTTGATG GCCTCGATCA ACTCCAGCCT
GACATTGATG GTTCGCGCGA CTTGTCGTTT CTGCCGCCGC AGCCACCACC AGGCATCGTG
ATCGTGCTTG GATCGCGGCC CGATGAGACG TTGATACCAT TTAAAGAGAA GGGGCTGTAT
TGTATTGAGT ATGATCTGCG ACCACTTGGT AAGTCGGATG CGTTGGCCTT GTGGCGATCA
GTCCAGCCTG GCGTAGCAGA TGACCTCTTC CATGACCTGT ATGATGCCTT GAAGGGGAAT
GCGTTGTTCG TCCGCCTGGC AGCGGATACG ATGCGCGATC AATCCATGGC TGATACGGCC
AGTCTGATTA AACAGATTAA GCGTAATCCC AATGATCTCT TTGGGATTAC GCTTGGGCGG
ATTAAAGGTC GATCCTTGGC CGACTGGCGG GCAATATGGA AACCTATGCT TGCGTTTTTA
CTGGTCGCGC AAGAACCACT TCATCTGGAT GTGCTTGGCG ATCTCCTGGG ACATGACCAC
GACACCATGC AGGATGCGAC ATTGGTCTTG GGCGGGTTGG TCAGCCAGGG GATTGATCAA
CGGGTTGCAT TGCATCACCT GCTGTTTCGT GACTATTTGG CAGCATCGGT GTTCAATGAC
CGCGAGGTGA AACGCTGGCA GCAACGGCTG GCTGATTGGT GTGCCGTTGA TCTGAATACG
ATTTGGGCTG ATGATCGTGA TCCCATTGAG CAGGCACGGC GGGTCTATGC GCGACACCAC
TATGTAACGC ATCTTTCATT GGCGGAAAAC TGGCCAGCAC TCTGGCAGGT CTTAGATTCG
GACGATTATG GCGAACAGAA AACCCGATTT GATCCGAGTA CGCGGCTGTA TGCGCTGGAT
TTGGATCGGG GACGTGAGAG TGCCATCAAC GCGGGGAAAT CCATCGACGA GCATATCCAG
AATTTACCAC GATTGTGGAA GTATAGCCTG TTACGGACAA GTTTAACCAG CCGCGTTGAT
CAGTGGCCAG ATGAGGCGTT TGAGGTTTTG GCGATGCTTG GCCGCACGCA CGAGGCATTG
GAACGAATTG AGTTGATTTC TGATGTAATG CGACAGATCA GGCTCTGGGG GAAGGTGATT
CAATGGTGTG ATGAACAGCA ACAAATAATC ACACTGCGAA TGAGGCAATG TTTAAGGGTC
ATTGGGGGGA ATCACGCCGC GCTTATTGAG ATTATCAGAA TTATCGCTAC TATTGGAGAT
ATCGGCCAAG CATTAACTAT TGCCCATACG GTTCAAGATA ATCAACAACA GGCCGAAGCC
TATACTACCA TCGCGAGCAT TATTAATACT ACTGAGGATT CTATTCCTCT CCTTGAGCAA
GCAACCTTTC TTGGACAAGC TATCCATAAT CCTTTATTGC GTGCTCATAC CCTTGCTACC
GTTATAAACG CTTTCGCTAC CGTAAATGCA ATCGATCATG CTCTAGCACT TGCCTATTCT
ATCGATATCG ATCGGATACG AACTAAAGTA TTGGCGGGTA TTGCCCAAAC CGTTGCTGCT
CGTGGTGATC AAAATAGTGC CGAACTTTTG ATAGGACAAG CTATTAACAT TAGTGATACG
CTAAAGAAAG ATAGCTCTCG ATCTACCGTT CTCATAGCTA TCGCCAAAGC AATTGTGGCT
ACTGGCAACG TTGACTATGC GCTTACGATT ATCAATAATA TTGATACGAA TGGGAGCCAA
GAATTAGCAC GAGAACATAT CGCGCTAACA GCTTCCGAGT ATGAATTTTT TGATCACGCC
ATTCGTCTTA CAGAATCTAT CAGCGATAAT TGGAGACGGA ATAAAACCTT TAGTAGTATT
GCCCTAATTG CCGCAAAAAA AGGATTTTTT GATCAGGCTA TCGCCATCAC ACAATATACT
AGTTCTTATA AACAGGTAGA AACCCTCGCA TATATTGCAG AAGCAGCAGC CAGATTAAAT
AATTATGACT ATGCTAAAAC GCTATCAGAA AAGGTTTTTT CAGTCGTGCA AACCATGAGT
GATGAGAAAC ACCGTGCTGA AGCCCTGATT ATCCTTGCCC GAACCGCCGC TATCTTGAAG
GATCACAACC GTGCCATAAT GATCTTAAAA CAGACTCTAT TGAATAAATA TAATCAGGAT
GATCGGTATC GTGATGAGTC CTTGCGTGCC ATAGCGGATG TTTACATAAC AGTAAATGAC
CTTAATCAGG CCATAACAAT AGCCCAGACT CTTACACGTA ATGTTGATCG AGACTCGCTA
TTGAGTGTTA TTGCGAAAGC ATTCGCGCTA GATGGGAATA TCAATCAGGC ACACATCATT
GCTGGGTCGA TAAAGAGTGT TTGGAGCCGA TCACATACCG TCGCCTCAAT TGCCAAGGCT
ACAGCTATGA TGGGAAATAT TGAAGATGCT GTTACTCAGG CGCAATCCAT CCAAGAAGTT
GATGCACAAG TAACCGCTTT AAGCGCTATC TCCCAAATAT TAAGTAGTAA AGGTAACCAA
CAGAAGGCAA GATCCTTACG AAAACAAGCG ATACAAACCA GTCGATTAAT TACCGAATCA
GATAGTCACG ATAAGGCTAT CATGGCTATC GCCCAACTTT ACGCTGTCTC GCATTATTTC
GATCATGCCA TCTCCATTAT CAAGGGCATC AATAATATTT GGACGTGTAC AGAGACCTTA
ATTTCCATCG CCCAAATCGC TCTCACATCA GATAAAGTTC AGGAAGCTCT GCTTATTTTC
AATCAGGCAT GCCTGATTGT TAAATCGAGA ACAAGTGATA TCGTCCTAGC CGAAGTATTA
AGTGTGATTG GGCAGGCAAC TGCTAGGGCG GGGTTTATTG AACAGGCTAT TCTCATAACT
CAAGACATAG AGGATCACGG CAAAAAATCC AAAACCCTTA GTATTATTGC CCAAGCAATG
GCTGCTGCGG GAGATTATAA ACAAGCGCAA AGCCTTGTAA AAGAGGCCAT CTCACTGACG
CTCTTCATAA CGGAACTTGA TGATCGTCTT GAGGCGCTAG TTGCTATTAT CGTGACACTT
GCAGTATCTG GTGATCTCGG CACAGCCATT CATCTAGCAC AATTGATTCC TGATCTTTGG
AATCGGACAA AGGCACTTAC CTCTATCCTA GAAGCACATT CATATAACAA TGAGGTTGCG
ATCGATGTTA TTCAATGGGA ATGGTTAATG GGTGTCACAA ACAGCGATAT GTGGAATCTC
TTATTATTAA GTACTCCATT GTTGAAAACG AATCTCTGTA TGGGGATAGT CTTGCTTGAA
GGAGAACAAT GGGTACAAGA ACAGTTAATC CGTATCGCTA GATAG
 
Protein sequence
MILPPRSPLP GLRHAIVIGA NGSIASGLAP LIAPEAHDAP RMAAVLASPA CGFHVQPFLG 
DQALMQPIRD AIETQMRAGT IGDSLIIYFS GHGTITQTAE GPDVVLVTTD TTLETIQDDP
LICLRVGWLK TVLRGNTQMY PIGHVVVILD CCSSGIIDQL GNQPQMGDMP AFRRMLVSTY
TSHPAYEWNR TSLYTYHLLN ALEGQAVDSF GAVTMERVHQ YCSEKLAGTE QRCGLGGWDY
TGRWQFAWYD LQHPKLIHAA HHVESDRLLN LRLEGFVGRV TELADIHEHI AAIRPTGGYV
VIKALAGEGK SSIIAKLIQD AGIAQTPHHF IALTTGRTYQ LELLHTVVAQ LILKHNLSRA
FGLGDHVQLL KGEFFRLLDY LSKQGIEETI YLDGLDQLQP DIDGSRDLSF LPPQPPPGIV
IVLGSRPDET LIPFKEKGLY CIEYDLRPLG KSDALALWRS VQPGVADDLF HDLYDALKGN
ALFVRLAADT MRDQSMADTA SLIKQIKRNP NDLFGITLGR IKGRSLADWR AIWKPMLAFL
LVAQEPLHLD VLGDLLGHDH DTMQDATLVL GGLVSQGIDQ RVALHHLLFR DYLAASVFND
REVKRWQQRL ADWCAVDLNT IWADDRDPIE QARRVYARHH YVTHLSLAEN WPALWQVLDS
DDYGEQKTRF DPSTRLYALD LDRGRESAIN AGKSIDEHIQ NLPRLWKYSL LRTSLTSRVD
QWPDEAFEVL AMLGRTHEAL ERIELISDVM RQIRLWGKVI QWCDEQQQII TLRMRQCLRV
IGGNHAALIE IIRIIATIGD IGQALTIAHT VQDNQQQAEA YTTIASIINT TEDSIPLLEQ
ATFLGQAIHN PLLRAHTLAT VINAFATVNA IDHALALAYS IDIDRIRTKV LAGIAQTVAA
RGDQNSAELL IGQAINISDT LKKDSSRSTV LIAIAKAIVA TGNVDYALTI INNIDTNGSQ
ELAREHIALT ASEYEFFDHA IRLTESISDN WRRNKTFSSI ALIAAKKGFF DQAIAITQYT
SSYKQVETLA YIAEAAARLN NYDYAKTLSE KVFSVVQTMS DEKHRAEALI ILARTAAILK
DHNRAIMILK QTLLNKYNQD DRYRDESLRA IADVYITVND LNQAITIAQT LTRNVDRDSL
LSVIAKAFAL DGNINQAHII AGSIKSVWSR SHTVASIAKA TAMMGNIEDA VTQAQSIQEV
DAQVTALSAI SQILSSKGNQ QKARSLRKQA IQTSRLITES DSHDKAIMAI AQLYAVSHYF
DHAISIIKGI NNIWTCTETL ISIAQIALTS DKVQEALLIF NQACLIVKSR TSDIVLAEVL
SVIGQATARA GFIEQAILIT QDIEDHGKKS KTLSIIAQAM AAAGDYKQAQ SLVKEAISLT
LFITELDDRL EALVAIIVTL AVSGDLGTAI HLAQLIPDLW NRTKALTSIL EAHSYNNEVA
IDVIQWEWLM GVTNSDMWNL LLLSTPLLKT NLCMGIVLLE GEQWVQEQLI RIAR