Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0516 |
Symbol | |
ID | 5732432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 598973 |
End bp | 600397 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277643 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001543293 |
Protein GI | 159897046 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5640] Secreted trypsin-like serine protease |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACGAT CTGGACGCTG GACAATTCGT CGGGTACGGT CTGCGGTCAT TGCAACGGTG TTGAGCACAT CAGTATTGTT AGGCGGCTAC GCTGCCTCAG CCAAAGACAA CAAAAAAGTC GAAGTTTATC CGCTCCCTGT TGTTGACGAA AAGCAACCTG GTTCCGAACA ATTGCCCCCA CCAGATAAAA TTGTCGGTGG CTCGGCGGCT ACTGCTGGTG AATTCCCCTG GCAAGCTCGG ATAGCTCGTA ACGGCAGCCT ACATTGTGGT GGCTCGTTGA TTGCTCCCCA ATGGGTTTTG ACTGCTGCGC ACTGTGTTCA AGGCTTCTCG GTATCATCAC TCAGCGTGGT GATGGGCGAC CATAACTGGA CGACCAACGA AGGCACCGAA CAAAGCCGCA CAATTGCTCA AGCAGTTGTT CACCCAAGCT ACAATTCATC AACCTACGAC AACGACATTG CTTTGTTGAA ACTCAGCAGC GCTGTAACCC TCAACAGCCG CGTTGCCGTG ATTCCGTTCG CCACCAGCGC TGATAGCGCC TTGTACAACG CTGGCGTTGT TTCAACCGTC ACTGGTTGGG GCGCGTTGAC CGAAGGTGGT TCATCACCAA ACGTCTTGTA CAAAGTGCAA GTGCCTGTGG TTTCAACCGC TACCTGTAAC GCCTCAAACG CCTACAACGG CCAAATCACT GGCAACATGG TGTGTGCTGG CTACGCTGCT GGCGGCAAAG ACTCATGCCA AGGCGATAGC GGTGGTCCAT TCGTCGCTCA AAGCAGCGGC TCATGGAAAC TCAGCGGTGT TGTGAGCTGG GGCGATGGTT GTGCCCGCGC CAATAAGTAT GGCGTGTACA CCAAAGTTTC CAACTACACC AGCTGGATCA ACAGCTATGT CGGTACGGTA ACCCCAACCA GCACGCCAGT GCCAGGTACT CCAGTGCCAA CCAGCACGCC AGTACCAGGT ACTCCAGTGC CAACCAGCAC GCCAGTGCCA GGTGGTAGCT TGCAAAATGG TGGCTTCGAA AGCAGCGCTA GCTGGGTTCA ATCACCAAGC AATATCATCT CAACCACTCG CCCACGCAGC GGCTCGTATA GCGCCTTCTT GGGTGGCTAC AACAGCGGCA CCGATAACAT CTATCAAAGC GTGACGGTTC CATCAAATGG TGTGTTGCGC TACTACTGGT ACATGAGCAC CCAAGAAAGT GGCAGCACTG TCTACGACCG CTTGTATGTT CGCCTCTACA ACAGCAGCGG CAGCTTGATC ACCACCTTGC GCACCTGGAG CAACGCGAGC ACCAAGAACA CTTGGACGCT TGACACGATT AGCCTCTCAG CCTACGCTGG CCAAACCGTG CGTGTCCAAT TCGTTGGCAC TACCGATAGC AGCTTGACCA CCTCGTTCTT CGTGGACGAT GTAACTCTGC AATAA
|
Protein sequence | MERSGRWTIR RVRSAVIATV LSTSVLLGGY AASAKDNKKV EVYPLPVVDE KQPGSEQLPP PDKIVGGSAA TAGEFPWQAR IARNGSLHCG GSLIAPQWVL TAAHCVQGFS VSSLSVVMGD HNWTTNEGTE QSRTIAQAVV HPSYNSSTYD NDIALLKLSS AVTLNSRVAV IPFATSADSA LYNAGVVSTV TGWGALTEGG SSPNVLYKVQ VPVVSTATCN ASNAYNGQIT GNMVCAGYAA GGKDSCQGDS GGPFVAQSSG SWKLSGVVSW GDGCARANKY GVYTKVSNYT SWINSYVGTV TPTSTPVPGT PVPTSTPVPG TPVPTSTPVP GGSLQNGGFE SSASWVQSPS NIISTTRPRS GSYSAFLGGY NSGTDNIYQS VTVPSNGVLR YYWYMSTQES GSTVYDRLYV RLYNSSGSLI TTLRTWSNAS TKNTWTLDTI SLSAYAGQTV RVQFVGTTDS SLTTSFFVDD VTLQ
|
| |