Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2156 |
Symbol | |
ID | 5734029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2717011 |
End bp | 2718216 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279297 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001544924 |
Protein GI | 159898677 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGCGTC TATGGCTAAG TACGTTGGTT GTCGGCTGTT TGCTTGGTTT GGCTACTCCT ACAAACACTG CTGGGCAGAC TCGTTTGAAA GAAAAAACAC CTGAACAATT GGCTGAGGCC GAAGCACCCG CAGTTGCCGG CCAATACCTG ATTAAATTTA AGCCTGCGCT GAATACAGCA TCTCGCAGCA CCACCCTTAA AAACCTCGGA GCCGATCATC TCCAACATCT CGCTAGCCTT GATCTTGAGT TAATTGAATT TGCTCCACTC AAGCAGAACG CTACTCCAGA GCAAACTGAA CGAGTGCTGG CCGAGCTAAA AAACCATCCA GCAATTGAAT ATGTTGAGCC AAACTACCTC TATGCCCCAC TGTATACGCC GAACGATCCT GGGTTGGGCC AACAATGGGC TTGGGGAGTC ATCAAAGCCT ATGATGGTTG GAATATTACC CAAGGTAGCT CTAGCGTGAT CATCGCGATT GTTGACACTG GCATCCAAAC CAACCACCCT GATCTTGATG CCAAAATTGT GGCTGGCTAC GATTTTGTTG ATAACGATAC CAACGCGATG GATGGCAATG GTCATGGTAC GCACTTAGCA GGCACAGCCG CCGCTGAAAC CAACAATAGC ACTGGTGGCG CAGGGTTATG CCCCAATTGT CGCTTGATGC CAATCCGTGT TTTCAACAAT AATGGCAGTG GTACCCTGGC TGCTGTGGCC CAAGGCATTA CTTTTGCTGC CAACAACGGA GCCAAAGTGA TCAACTTAGG CTTGGGTGGC AGTGCCTCAA CAACCCTGCA AAATGCAGTT AATTACGCGT GGAACAAAGG TGCCTTCTTG ACCTGTGCGG TTGGTGGCAG TAACTCGGGC ACGCCAACCT ATCCAGCAGC CTACCCCAAC TGCTTCCCCG TGGCGGCTAG CGGCAAAACC GATATTAAAA CACCTTCCTC AGGCTATGGC ACATGGGTCA AAGTGGCGGC TCCCGGGGCA AGCATCTATT CAACCTGGCT CAACGGCGGC TATACCACAA TTAGCGGTAC CTCAACCGCT ACCGCCCATG TTTCAGGTTT AGCAGGCTTG TTGGCCTCGC AAAACCGTAC CAATGCCCAA ATTCGCGATC GCATTTGCGC CACCGCTGAT CCAATTGCGG GCACTGGAAC TTACTGGTCA TGTGGCCGGA TTAATGTTTA TGCCGCTGTG CAATAA
|
Protein sequence | MKRLWLSTLV VGCLLGLATP TNTAGQTRLK EKTPEQLAEA EAPAVAGQYL IKFKPALNTA SRSTTLKNLG ADHLQHLASL DLELIEFAPL KQNATPEQTE RVLAELKNHP AIEYVEPNYL YAPLYTPNDP GLGQQWAWGV IKAYDGWNIT QGSSSVIIAI VDTGIQTNHP DLDAKIVAGY DFVDNDTNAM DGNGHGTHLA GTAAAETNNS TGGAGLCPNC RLMPIRVFNN NGSGTLAAVA QGITFAANNG AKVINLGLGG SASTTLQNAV NYAWNKGAFL TCAVGGSNSG TPTYPAAYPN CFPVAASGKT DIKTPSSGYG TWVKVAAPGA SIYSTWLNGG YTTISGTSTA TAHVSGLAGL LASQNRTNAQ IRDRICATAD PIAGTGTYWS CGRINVYAAV Q
|
| |