Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5125 |
Symbol | |
ID | 5737083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 171840 |
End bp | 173249 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641282290 |
Product | peptidase C14 caspase catalytic subunit p20 |
Protein accession | YP_001547881 |
Protein GI | 159901635 |
COG category | [R] General function prediction only |
COG ID | [COG4249] Uncharacterized protein containing caspase domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.499707 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACCT TTGACCACGG CTATGCCCTT CTGATCGGCG TTGGAACTCA CCACCATCCC GCTTACTCGT TACCCGTCAC GGCCAATGAT GCGCACGCGA TCCAACGCAT CCTCAGTGAT CCTCGCTGGT GTGCCTATCC CACAGCGCAT ATTCGACTCC TGTGCAACGA AGCGGCCACA ACCGACGCGG TACTCGATGG CTTCGCGTGG CTTACCGCCT GTGCGCAGCG TGATCCGCAG GCCACGATTG TTGTCTACTA TTCCGGCCAT GGAGGTCGCA ATGAGGAGAC TGATACCTAC TGTTTGCTGC CCCATGATGC GACCACGGCA ACCAGCCGTC TCTCAACCGC TACCCTGAGC GCTGCATTGG ATGCGATCCC GGCCAAACGC CTCCTCGTCT TGCTCGATTG TTGTCACGCG GGAGGCATGA CCAAAGATGG CCCAGACCAG GACAGCCAAT GGACAACCAC GGCTCCCAGC GCCAGCATGC TCGCCGCATT ACAGCAGGGT GCGGGTCGCG TTGTCCTGTC CTCATCGACG GGCAAACAAC GGTCGTACAT TCAGCCTGAT GCAAGCCTCA GTCTCTTTAC CACCCATCTC CTTGCCGCAT TCCAAGGGGC AGGCAATCAA CCAGGGAGTA CCGAGGTGCG CGTGACCCAT CTGATTCAGT ATGTCAGTCG TCAGGTGCAG CACGCAGCAC AGGCGCTCAA TGCCCGGCAG ACTCCGTTCT TCCAGGCGGC AAGTGAAGAC TTTGTGGTCT CCTTGATTCG TGGTGGTGCA GGGCTGGGCA AAGGGGGATG GGAGGCACTC CAGCAGGACG CACACGCCGC AATTGCCGCC GCATCCCCAC CATCGCCACA TCCGACAATA ACCGATAGCA ATGTGGCCAT TGGCAACGTC GTCGGAGGCA ACCTCACGCA AACGAAGCAG TCAGGTGGCG TACACGCCGA GGGAGCCACC ATTGGTCGGA TTGGGACACT GACGGGTGGC GATAGCTATG GCGGGGATCG GATCGAAGGC TCCAAGACCG TGTATGATGG CCGAGCCACC GTCAACGACA GTACGATCAC AGGAACCGTG ACCGGACTCA GTACAGGCAC CATCATCCAC GGTGGAAGCG ACAGAGCAGC GCCTGCCCAT CCCTCGCTCA CCATGGAATG TCCCGAAATG ATGGTTTGTG GTCACGACCA TGCAGTTATC ATCCATGTTC ATGGGACGAA CCCCACGCTG CGCTATCAGC TTGATCTGCG GGCGACAGGT ATCCATGAAA CAACCTATTT TATGGGGAGC GAAGTCACGC TGTATGTTAC GCCCTCGATG CCGGGAGAGC TTCACCTGCG GGCGATCCTG AAGGCGCAGG ACGGAACCCA GATGACCAGC CAGAGCCAGC GTATCATGGT GCACCGATAG
|
Protein sequence | MATFDHGYAL LIGVGTHHHP AYSLPVTAND AHAIQRILSD PRWCAYPTAH IRLLCNEAAT TDAVLDGFAW LTACAQRDPQ ATIVVYYSGH GGRNEETDTY CLLPHDATTA TSRLSTATLS AALDAIPAKR LLVLLDCCHA GGMTKDGPDQ DSQWTTTAPS ASMLAALQQG AGRVVLSSST GKQRSYIQPD ASLSLFTTHL LAAFQGAGNQ PGSTEVRVTH LIQYVSRQVQ HAAQALNARQ TPFFQAASED FVVSLIRGGA GLGKGGWEAL QQDAHAAIAA ASPPSPHPTI TDSNVAIGNV VGGNLTQTKQ SGGVHAEGAT IGRIGTLTGG DSYGGDRIEG SKTVYDGRAT VNDSTITGTV TGLSTGTIIH GGSDRAAPAH PSLTMECPEM MVCGHDHAVI IHVHGTNPTL RYQLDLRATG IHETTYFMGS EVTLYVTPSM PGELHLRAIL KAQDGTQMTS QSQRIMVHR
|
| |