Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2079 |
Symbol | |
ID | 5733967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2588427 |
End bp | 2590640 |
Gene Length | 2214 bp |
Protein Length | 737 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279220 |
Product | peptidase C14, caspase catalytic subunit P20 |
Protein accession | YP_001544847 |
Protein GI | 159898600 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTGGCGC AGCTGATTCT CAAGCATCAT CTCACCGTGT CCTATTTTCC CGAAGAAAGC TATCCGGCGA TGAAGGGGGA ATTTGCGCGG ATTCTTGATG AACTCTCGAA ACAGGGGATT CACGAGACCA TCTATCTGGA TGGCCTCGAT CAACTCCAGC CAGAAATTGA CGGCTCGCGG GATTTATCAT TCTTGCCACC ACAGCCGCCC CAAGGCATGG TGATCGTGCT TGGCTCACGG CCTGATGAGA CCTTGAAACC GCTGGAGATT CTGCATCGGG TCGATTACGA TCTGCCACCA CTGAGTGAAA CCGATGCACT GTCGCTCTGG CGATCGGTCC AGCCTGGCAT GGCAGATGGC CTATTCCATG ACCTGTATAC GGCGCTGAAG GGCAATGCAT TATTTGTCCA TTTAGCGGCG GATACGATGC AGGATGCGTC GGTGGTCGAT GCGACCAGTC TCATCAAACA GATTGAGCAG AATCCACAAA ATTTGTTTGG GATTACGCTG GAACGGATTA AACGCGTGCC CCTCTCCAAG TGGGATACGG TGTGGAAGCC CATGCTGGCG CTCTTGCTCG TCGCCCAAGA ACCATTGCGA CTGGATGTGT TGGGCGATCT GCTGGGACAT GACCACGACA CCATGCAGGA TGCAGTGTTG GTTTTAGGTG GGTTGGTCAG CCAAGGTATT GATCACCAGG TTGCCTTACA TCACCTGCTG TTTCGTGACT ATTTGGCGGC ATCGGTGTTC AATGATCGTG ACGTCAAACG CTGGCATCAA GAGTTGTCCA ACTGGTGTGC GAAGGATGTG GATGCGATTT GGTCCGATGA TCGCGATCCC ATTGAGCAGG CACGGCGGGT CTATGCGCGA CATCATTACA TCACCCATCT CTCAGGGGCA GAGAACTGGC CTACACTCTG GCAGGTCTTG GATGCGGGCG ATTATGGGGA ACAGAAAACC CGATTTGATC CGAGTACCCG GCTCTATGCC TTGGATTTGG ATCGCGGGCG AGAGAGTGTG ATTAAGGCAG GTCAATCAAC CGAGGAACAT ATCCAAAATT TGCCCCGCCT GTGGAAGTAT AGTTTGCTGC GGACGAGCTT AACCAGCCGT GTTGATCAGT GGCCAGATAA TGCGTTTGAG GTTTTGGTGA TCCTTGGGCG TACACAAGAA GCTCTAGAGC GTATTCAATT GCTTTCTGAT CCAGAACAGC AGATCACTCT TTGGTGTAAA ATGTTGGCTC AGTGTGATAC AAAAAACTAT AACATAATAT TAAACCGAAT ACAACAATCA GTCAGCCAAC TTTCACGTCA TCATACTGAT ACCCTCGCAA CAATTGCCGA AACTACCGCT ACCAGTGGGA ATATCGACCG TGCACTCTCC ATTACTGCCG CCATTGAAAA TAATGATCGC ACAGGAATCC TTAAAAAAAT TGCCAGAACC GCCGCTACTA GTGGAGATAT CGACCGTGCA CTCTCCATTG CTGCCACCAT TGAAAATGAT AATTCCCGTA GCTATGCTCT CACAGCAATT GCCGAAGCCA TCGCCACCAG TGGGAATATC GACCGTGCAC TCTCCATCGC TGCCACCATT GAAAATGATA ATTCCCGTAG CTATGCTCTC AAAAAAATTG CCGAAACTAT CGCTACCAGC GGGGATATCG ACCGTGCACT CTCCATTGCT GCCACCATTG AAAATGATAA TTCCCGTAGC TATGCTCTCA CAGCAATTGC CGAAGCCATC GCCACCAGTG GGAATATCGA CCGTGCACTC TCCATCGCTG CCACCATTGA AAATGATAAT TCCCGTAGCT ATGCTCTCAA AAAAATTGCC GAAGCCATCG CTACCAGTGG GGATATCGAC CGTGCGCTCT CCATTGCTGC CACTATTGAA AATAATGATC GCACAGGAAT CCTCGCAACA ATTGCCGAAA CCGCCGTTGC CAGTGGAGAT ATCGACCGTG CACTCTCCAT TGCCGCCACG ATCCCTGATG ATCGAATGCG AATCAATACT TTTATAACTA TGGCCCCAGT GATTCAGTCA GCTGAAAAAA TTCTCTCGAT TATCCATAAG GAATGGTTTC AGAGTAAAAC TCATGATATG ACTTTGTATT CATTTTGTCT TATAAATCTC TTATCAGAAA ATAATCTTTG GCTAATTACA ACAATTATAA AAAGTCAGGG GTGGGTCAAT GACCAGTTGA AACGGCTGGG GTAG
|
Protein sequence | MVAQLILKHH LTVSYFPEES YPAMKGEFAR ILDELSKQGI HETIYLDGLD QLQPEIDGSR DLSFLPPQPP QGMVIVLGSR PDETLKPLEI LHRVDYDLPP LSETDALSLW RSVQPGMADG LFHDLYTALK GNALFVHLAA DTMQDASVVD ATSLIKQIEQ NPQNLFGITL ERIKRVPLSK WDTVWKPMLA LLLVAQEPLR LDVLGDLLGH DHDTMQDAVL VLGGLVSQGI DHQVALHHLL FRDYLAASVF NDRDVKRWHQ ELSNWCAKDV DAIWSDDRDP IEQARRVYAR HHYITHLSGA ENWPTLWQVL DAGDYGEQKT RFDPSTRLYA LDLDRGRESV IKAGQSTEEH IQNLPRLWKY SLLRTSLTSR VDQWPDNAFE VLVILGRTQE ALERIQLLSD PEQQITLWCK MLAQCDTKNY NIILNRIQQS VSQLSRHHTD TLATIAETTA TSGNIDRALS ITAAIENNDR TGILKKIART AATSGDIDRA LSIAATIEND NSRSYALTAI AEAIATSGNI DRALSIAATI ENDNSRSYAL KKIAETIATS GDIDRALSIA ATIENDNSRS YALTAIAEAI ATSGNIDRAL SIAATIENDN SRSYALKKIA EAIATSGDID RALSIAATIE NNDRTGILAT IAETAVASGD IDRALSIAAT IPDDRMRINT FITMAPVIQS AEKILSIIHK EWFQSKTHDM TLYSFCLINL LSENNLWLIT TIIKSQGWVN DQLKRLG
|
| |