Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3542 |
Symbol | |
ID | 5735401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4457652 |
End bp | 4459568 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280689 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001546306 |
Protein GI | 159900059 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0100576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGAA CCGTTCGCAT GGGCTTGCTA CTGTTGTTGG CTTTTGGATT AATGCCCTTG ATTCAACGCC CAGCGCAGGC AGCGACCCTC GATGATCAGA TTATTGCCGA TGAAATTCTG GTAACGTTTA ATGATGGCTA TCGTGTCAGT GCGAATGGGG CATTCTTTGA AGGTTCTAAG GCATTGAAAA ATGGCCTAGG CATTGGGCCA TTACTTTCGA CCCAAATGCT TGATGGGGCT GGTCAAGTGG CAATGTTGAA GCTCCCCGCA GGCGCAAATG GCGATGCTAA AATTGCCCAA TTGCTCAAAA ATCCAGCAGT GCGCTACGCC GAATACAACG CTGTGCGTGA GATTCTGGTC GATCCCAATG ATGAATATTA CTCAGAACAA TGGGGCTTGC CCAAAATTGG CGCTAATGCG GCTTGGGACA TGACCCAAGG CAACGGCTTG GTGATTGCCA TTATTGATAC TGGGGTTTCA CCAACCCACC CTGATTTGGC TGGCCATGTG TTGCCTGGCT ACAATGCCTT GCAAAATAAT AGCAATAGCC AAGATGATCA AGGCCACGGT ACGGCCATGG CTGGGATTGC TGCTGCCCTA ACCAATAATG GTCAAGGTGT AGCTGGGGTT TGTTGGAACT GCCAAATCCT GCCAGTTAAA GTGCTGAACA GTCGTGGTCA AGGCACAGCG GCAGATATTG TCGAAGGCAT GTATTGGGCT GCTGATAATG GCGCACGCAT CATTAGTATG AGTTTGGGTG GCCCACGGGG TACGCAAGCC GAGCAAGATG CAGTTAATTA TATTTATAGC AAAAATATTC CACTCTTTGC CTCATCTGGC AATTCGGGCG ATGAGGGCAA CCCACGCATG TATCCAGCCG CTTTTGATCA TGTAATTGCG GTTGGGGCAA CCACGACCCA AGATCGGGTT GCAAGTTTCT CATCGTATGG CGATTATGTG GATATTGCTG CTCCAGGCGT AAATATTGTG ACGACTGGAT GGGATGGTGG CGATACCTAT GAAATGGGCA GTGGCACATC GCCAGCTTGT CCGTTTGTGG CAGCAACTGC GGCCTTGGCG CTGAGTGTTT GGCCTGAGCT TACGGTCGAT CAAATTGAAA AATTGATCAC TGGCAGTGCT GTTGATATTT TGACTCCTGG TAAAGATGTC TATAGTGGGT TTGGCCGACT GGATACCTAC AAAACGGTGC AAAATGCGGT TTTGCGCACG ATTCCTGGCG AACCACAACC CCAACCACCA GCACCACCAG CGCCGCAACC ACCAACTCCA GAACCACAAC CAGGTAACCC TGCGTTTGTG CCTGTGGGAG CGCCCCCATT GCCAGCACCG GTCGGCGAAG TCTACTTCCC TGAAACTGGC CATAACTTGC GTGGCGAGTT CAAAAACTAC TGGGATCGTA ATGGTGGCTT GGCAGTCTTT GGCTTCCCAA TTAGCGAAGA ATTTACCGAA CAAACTCCTG AAGGTTCGTT TACGGTGCAA TACTTCGAAC GCCAACGCTT TGAATTTCAC CCTGAAAAAG CTGCACCCTA CAACGTGTTG CTGGGTCGCT TAGGCGATGC TGTGCTGCGG GATCGTGGCG ACGATTGGGC CAACTTCCCC AAAACTGGGC CAGAAAATGG CTGTCTCTAT TTCGATCAAA CCCAGCACAA AATTTGTGGC GAGTTCCGCA AATATTGGGA AACCAATGGG CTGAATGATC CTGCTTTGAA CAAATATGAT CGCAGCTTGC AATTGTTTGG CTTGCCATTA TCCGAGCCAA TGACCGAAAC CAATCGGGAT GGGGCAACCG TCACGACCCA ATGGTTTGAG CGCGGCCGCT TTGAGTATCA CGAAGGTCAA GGCGTGCTGT TAGGTTTGTT GGCCAAAGAA TATGCCAACA ATCGCAGTTG GCGCTAA
|
Protein sequence | MQRTVRMGLL LLLAFGLMPL IQRPAQAATL DDQIIADEIL VTFNDGYRVS ANGAFFEGSK ALKNGLGIGP LLSTQMLDGA GQVAMLKLPA GANGDAKIAQ LLKNPAVRYA EYNAVREILV DPNDEYYSEQ WGLPKIGANA AWDMTQGNGL VIAIIDTGVS PTHPDLAGHV LPGYNALQNN SNSQDDQGHG TAMAGIAAAL TNNGQGVAGV CWNCQILPVK VLNSRGQGTA ADIVEGMYWA ADNGARIISM SLGGPRGTQA EQDAVNYIYS KNIPLFASSG NSGDEGNPRM YPAAFDHVIA VGATTTQDRV ASFSSYGDYV DIAAPGVNIV TTGWDGGDTY EMGSGTSPAC PFVAATAALA LSVWPELTVD QIEKLITGSA VDILTPGKDV YSGFGRLDTY KTVQNAVLRT IPGEPQPQPP APPAPQPPTP EPQPGNPAFV PVGAPPLPAP VGEVYFPETG HNLRGEFKNY WDRNGGLAVF GFPISEEFTE QTPEGSFTVQ YFERQRFEFH PEKAAPYNVL LGRLGDAVLR DRGDDWANFP KTGPENGCLY FDQTQHKICG EFRKYWETNG LNDPALNKYD RSLQLFGLPL SEPMTETNRD GATVTTQWFE RGRFEYHEGQ GVLLGLLAKE YANNRSWR
|
| |