Gene Haur_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2079 
Symbol 
ID5733967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2588427 
End bp2590640 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content49% 
IMG OID641279220 
Productpeptidase C14, caspase catalytic subunit P20 
Protein accessionYP_001544847 
Protein GI159898600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGGCGC AGCTGATTCT CAAGCATCAT CTCACCGTGT CCTATTTTCC CGAAGAAAGC 
TATCCGGCGA TGAAGGGGGA ATTTGCGCGG ATTCTTGATG AACTCTCGAA ACAGGGGATT
CACGAGACCA TCTATCTGGA TGGCCTCGAT CAACTCCAGC CAGAAATTGA CGGCTCGCGG
GATTTATCAT TCTTGCCACC ACAGCCGCCC CAAGGCATGG TGATCGTGCT TGGCTCACGG
CCTGATGAGA CCTTGAAACC GCTGGAGATT CTGCATCGGG TCGATTACGA TCTGCCACCA
CTGAGTGAAA CCGATGCACT GTCGCTCTGG CGATCGGTCC AGCCTGGCAT GGCAGATGGC
CTATTCCATG ACCTGTATAC GGCGCTGAAG GGCAATGCAT TATTTGTCCA TTTAGCGGCG
GATACGATGC AGGATGCGTC GGTGGTCGAT GCGACCAGTC TCATCAAACA GATTGAGCAG
AATCCACAAA ATTTGTTTGG GATTACGCTG GAACGGATTA AACGCGTGCC CCTCTCCAAG
TGGGATACGG TGTGGAAGCC CATGCTGGCG CTCTTGCTCG TCGCCCAAGA ACCATTGCGA
CTGGATGTGT TGGGCGATCT GCTGGGACAT GACCACGACA CCATGCAGGA TGCAGTGTTG
GTTTTAGGTG GGTTGGTCAG CCAAGGTATT GATCACCAGG TTGCCTTACA TCACCTGCTG
TTTCGTGACT ATTTGGCGGC ATCGGTGTTC AATGATCGTG ACGTCAAACG CTGGCATCAA
GAGTTGTCCA ACTGGTGTGC GAAGGATGTG GATGCGATTT GGTCCGATGA TCGCGATCCC
ATTGAGCAGG CACGGCGGGT CTATGCGCGA CATCATTACA TCACCCATCT CTCAGGGGCA
GAGAACTGGC CTACACTCTG GCAGGTCTTG GATGCGGGCG ATTATGGGGA ACAGAAAACC
CGATTTGATC CGAGTACCCG GCTCTATGCC TTGGATTTGG ATCGCGGGCG AGAGAGTGTG
ATTAAGGCAG GTCAATCAAC CGAGGAACAT ATCCAAAATT TGCCCCGCCT GTGGAAGTAT
AGTTTGCTGC GGACGAGCTT AACCAGCCGT GTTGATCAGT GGCCAGATAA TGCGTTTGAG
GTTTTGGTGA TCCTTGGGCG TACACAAGAA GCTCTAGAGC GTATTCAATT GCTTTCTGAT
CCAGAACAGC AGATCACTCT TTGGTGTAAA ATGTTGGCTC AGTGTGATAC AAAAAACTAT
AACATAATAT TAAACCGAAT ACAACAATCA GTCAGCCAAC TTTCACGTCA TCATACTGAT
ACCCTCGCAA CAATTGCCGA AACTACCGCT ACCAGTGGGA ATATCGACCG TGCACTCTCC
ATTACTGCCG CCATTGAAAA TAATGATCGC ACAGGAATCC TTAAAAAAAT TGCCAGAACC
GCCGCTACTA GTGGAGATAT CGACCGTGCA CTCTCCATTG CTGCCACCAT TGAAAATGAT
AATTCCCGTA GCTATGCTCT CACAGCAATT GCCGAAGCCA TCGCCACCAG TGGGAATATC
GACCGTGCAC TCTCCATCGC TGCCACCATT GAAAATGATA ATTCCCGTAG CTATGCTCTC
AAAAAAATTG CCGAAACTAT CGCTACCAGC GGGGATATCG ACCGTGCACT CTCCATTGCT
GCCACCATTG AAAATGATAA TTCCCGTAGC TATGCTCTCA CAGCAATTGC CGAAGCCATC
GCCACCAGTG GGAATATCGA CCGTGCACTC TCCATCGCTG CCACCATTGA AAATGATAAT
TCCCGTAGCT ATGCTCTCAA AAAAATTGCC GAAGCCATCG CTACCAGTGG GGATATCGAC
CGTGCGCTCT CCATTGCTGC CACTATTGAA AATAATGATC GCACAGGAAT CCTCGCAACA
ATTGCCGAAA CCGCCGTTGC CAGTGGAGAT ATCGACCGTG CACTCTCCAT TGCCGCCACG
ATCCCTGATG ATCGAATGCG AATCAATACT TTTATAACTA TGGCCCCAGT GATTCAGTCA
GCTGAAAAAA TTCTCTCGAT TATCCATAAG GAATGGTTTC AGAGTAAAAC TCATGATATG
ACTTTGTATT CATTTTGTCT TATAAATCTC TTATCAGAAA ATAATCTTTG GCTAATTACA
ACAATTATAA AAAGTCAGGG GTGGGTCAAT GACCAGTTGA AACGGCTGGG GTAG
 
Protein sequence
MVAQLILKHH LTVSYFPEES YPAMKGEFAR ILDELSKQGI HETIYLDGLD QLQPEIDGSR 
DLSFLPPQPP QGMVIVLGSR PDETLKPLEI LHRVDYDLPP LSETDALSLW RSVQPGMADG
LFHDLYTALK GNALFVHLAA DTMQDASVVD ATSLIKQIEQ NPQNLFGITL ERIKRVPLSK
WDTVWKPMLA LLLVAQEPLR LDVLGDLLGH DHDTMQDAVL VLGGLVSQGI DHQVALHHLL
FRDYLAASVF NDRDVKRWHQ ELSNWCAKDV DAIWSDDRDP IEQARRVYAR HHYITHLSGA
ENWPTLWQVL DAGDYGEQKT RFDPSTRLYA LDLDRGRESV IKAGQSTEEH IQNLPRLWKY
SLLRTSLTSR VDQWPDNAFE VLVILGRTQE ALERIQLLSD PEQQITLWCK MLAQCDTKNY
NIILNRIQQS VSQLSRHHTD TLATIAETTA TSGNIDRALS ITAAIENNDR TGILKKIART
AATSGDIDRA LSIAATIEND NSRSYALTAI AEAIATSGNI DRALSIAATI ENDNSRSYAL
KKIAETIATS GDIDRALSIA ATIENDNSRS YALTAIAEAI ATSGNIDRAL SIAATIENDN
SRSYALKKIA EAIATSGDID RALSIAATIE NNDRTGILAT IAETAVASGD IDRALSIAAT
IPDDRMRINT FITMAPVIQS AEKILSIIHK EWFQSKTHDM TLYSFCLINL LSENNLWLIT
TIIKSQGWVN DQLKRLG