Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2203 |
Symbol | |
ID | 5734090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2799137 |
End bp | 2800336 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279344 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001544971 |
Protein GI | 159898724 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0116357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGATCT TGGCGGTTAG TGCGTTTGCT GGCTCAAATT CATCATTCGC CCAGTCACGC GATGTAGAAA AACCCGTCGA TATTGGCGAT GCCGTTGGTC TCGACCTCAA TGCCCCTGCT GTTCCTGGTC AGTTTGTCAT CAAATTCAAG AATTCAACCT CAAAAGCTAG CCGCGCCAAT AGCTTGAGTG CCTTGGGCGC AGTGCAAATC GATCGGATCG AAGCGCTTGA CGCTGAAGTC GTCGAATTCG CTAGCTTGAA GAGCAACGAT AGTTTGGCAA TGCGCCAAGC CATGGTTGAA AGCTTGCTCA AAGATGGCAA CATCGAATAT GCCGAACCCA ACTTTATCTA TACTTCAACC TACACTCCCA ACGACCCAGG TCGTAGCTCA CAATGGGCAT GGGGTGTAAC CCAAGCATAC ACTGGTTGGG ATATCACGCG CGGTAGCAGC AGCGTTGTCG TTGCGGTTGT TGACACTGGG ATTCAAAGCA CTCACCCTGA TTTGGATGCC AAAATTGTCG CTGGCTACGA CTACATCGAT AATGACTCAA CGCCAAATGA TGGAAATGGC CACGGTACGC ACGTCGCTGG GACGGTTGCT GCTGAAACCA ACAATAGCAC TGGTGGCGCA GGAACCTGCC CCAACTGTCG CTTGATGGGC GTTCGCGTCT TGAATAACAG CGGTAGCGGT ACCTTGGCTG GTGTGGCCAA TGGCATCACC TACGCTGCTA ACAATGGCGC AAAGGTCATC AACTTAAGCC TTGGTGGCGG TGGTTCAACG GCCTTGCAAA ATGCCGTCAA CTACGCTTGG GGCCGTGGAG TATTCTTGGC TTGTGCCGCT GGTAACAGCA ACACCTCAAG CACCACCAGC GCTTACCCAG CTGCGTATAC CAACTGTTTT GCGGTTGCAT CAACGACTTC AACCGATGCC CGCTCATCAT TCTCAAACTA TGGTACATGG GTCGAAGTGG CTGCCCCTGG TTCGAGCATC TACTCAACCT GGATTAACAG TGGCTACAAC ACGATCAATG GTACCTCAAT GGCTACCCCA CACGTTGCCG GTTTGGCTGG CTTGTTGTCA TCACAAGGCT TGACCAACAG CCAAATCAAG AGCAAAATCT GCTCAAGCTC CGACCAAATT AGCGGGACTG GCACGCGCTG GACTTGCGGT CGGATCAACA TCTACAAAGC TGTTCAATAG
|
Protein sequence | MLILAVSAFA GSNSSFAQSR DVEKPVDIGD AVGLDLNAPA VPGQFVIKFK NSTSKASRAN SLSALGAVQI DRIEALDAEV VEFASLKSND SLAMRQAMVE SLLKDGNIEY AEPNFIYTST YTPNDPGRSS QWAWGVTQAY TGWDITRGSS SVVVAVVDTG IQSTHPDLDA KIVAGYDYID NDSTPNDGNG HGTHVAGTVA AETNNSTGGA GTCPNCRLMG VRVLNNSGSG TLAGVANGIT YAANNGAKVI NLSLGGGGST ALQNAVNYAW GRGVFLACAA GNSNTSSTTS AYPAAYTNCF AVASTTSTDA RSSFSNYGTW VEVAAPGSSI YSTWINSGYN TINGTSMATP HVAGLAGLLS SQGLTNSQIK SKICSSSDQI SGTGTRWTCG RINIYKAVQ
|
| |