Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5185 |
Symbol | |
ID | 5737143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 267448 |
End bp | 269265 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641282349 |
Product | peptidase S8 and S53 subtilisin kexin sedolisin |
Protein accession | YP_001547940 |
Protein GI | 159901694 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.193898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATTC GTTTCTTGGG AGCATTGGGC TTGCTCATGA TTGTGTTAGC ACTGCCCCGC AATAGCGTAG CGGAGCACCA GCGTTCATCG GCATCGCACG ATAGTGCGAT CACGATCAGC ACCACCGCGC CAGCAGTTCC TGGCCAGTTT ATTGTCCAAT TTCGCCCGAC TGCTTCACGG AACCAGCGTC AATCGACGCT AGCCGTACTG GGTGCATACG TCGTTCGCCG CATTGAGCCG CTTAACCTCG ATGTGTTGGA AATACCAGCA CTGACGAAGA CTTCAACATC AACGGCTATG GATCGAATCC TAGCGCATGC CCGTGACAAT CACGATATTG TATTTTTAGA GCCGAACTAT ATCTACACCA CAACCTATCT CCCGAACGAC TCCGCCTATC AAAACCAATG GGCTTGGAGT ATGGTGCGTG CACCACAAGC CTGGGATATA ACCAGAGGGA GTCCCAATGT CACTATTGCG GTGCTTGACA CCGGGATCGA TCGGGAACAC CCTGATCTCA AAGCGAAAGT TGTGAGCCAT GGTATCGATT TGGTGACTGC TGATGGGATC GCTCATGATG AGAATGGCCA TGGCACACAT GTCGCAGGCA CGGCGGCAGC GGCAACGAAC AATGCGCTAG GTGGCGCTGG GATGTGTCCG CTTTGCCAGG TACTACCAGT GCGCGTCTTG AATGCGTTTG GTTCAGGTAC ACTCGACGCT GTTGCCGAAG GGATCATCTA TGCTGCCGAC GCGGGTGTCC AAGTTATCAA CCTCAGTCTG GGCGGGCCGG GTTCTTTTGC GCTGCAACAC GCTGTCGATT ATGCGTGGAC TCATGGATCA TTTCTCGCAT GTGCCGCTGG CAATAATGGC ACTGCATCGC TCGATATGGC GTATCCTGCT GCATATGGTA ACTGCTTTGC CGTAGCGGCG ACCACGTCCA TGGATCAAAC GGCTGACTTT TCAAACTATG GCTCTTGGGT GGAAATGGGG GCACCAGGAG TTAATATCTA CTCTGGTTGG CTATATGGTG GCTATCATAC CATCAGTGGC ACGTCGATGG CAGCGCCCCA TGTCGCTGGA GTGGCGGGGT TGCTGGCTTC ATTCGGTTTA AGTAATAGTC AGATCCGCAG CCGACTCTGT ACATCCGCCG ACCGTACCGC AGCCACAGGG AGCGCCTATA CCTGTGGCCG TCTGAATGCG TGGCGGGCAG TTTCTGCCGC AATTCCGACG ACAACACCCA CATTGAGACC AACCACGACA AGGACTCCGA CACCAAGCGC CATTCCACAA CCTGGGGTGA CGGTAATGCC TAGTACGCCG CCTGCCCATG CCCCGACTGT AACAGCAGTG CCAACGGTGG CTGTTCCTGG CAATACGTTG ATTAACAGCG GATTTGAGGA TGGCAATGCC CCGTGGGAGC AGGTCTCGGC TTGGAATTAT GCGTTGATTA TTGATGGTCC TGCTCACACG GGACGACGCG GCGCACTCTT GTCGGGATTT ATTAACGGCG GAGATGCACT CTTCCAAACC GTTACGGTAC CTGCACATGG GAAACTAACC TACTATCTGC GGGTGACCTC CGATGATGCC CCAACCCAAC CTCGCGATTA TCTGCGAGTT AAGTTGTATA CGACCAACGG CATACCGATC GCGACCTTGC GCACGTGGAG CAACGCGACA CTGTGGAATA TCTGGATCGT TGATTCTGTG GACATGAACG CGTATGCGCA TCAGACCCTG CGTATTCAGT TTGAAACGAT CACTGATAGT CAATGGAACA CGTGGTTTGC GATTGATGAT ATCAGTTTGA CCCCTTAA
|
Protein sequence | MNIRFLGALG LLMIVLALPR NSVAEHQRSS ASHDSAITIS TTAPAVPGQF IVQFRPTASR NQRQSTLAVL GAYVVRRIEP LNLDVLEIPA LTKTSTSTAM DRILAHARDN HDIVFLEPNY IYTTTYLPND SAYQNQWAWS MVRAPQAWDI TRGSPNVTIA VLDTGIDREH PDLKAKVVSH GIDLVTADGI AHDENGHGTH VAGTAAAATN NALGGAGMCP LCQVLPVRVL NAFGSGTLDA VAEGIIYAAD AGVQVINLSL GGPGSFALQH AVDYAWTHGS FLACAAGNNG TASLDMAYPA AYGNCFAVAA TTSMDQTADF SNYGSWVEMG APGVNIYSGW LYGGYHTISG TSMAAPHVAG VAGLLASFGL SNSQIRSRLC TSADRTAATG SAYTCGRLNA WRAVSAAIPT TTPTLRPTTT RTPTPSAIPQ PGVTVMPSTP PAHAPTVTAV PTVAVPGNTL INSGFEDGNA PWEQVSAWNY ALIIDGPAHT GRRGALLSGF INGGDALFQT VTVPAHGKLT YYLRVTSDDA PTQPRDYLRV KLYTTNGIPI ATLRTWSNAT LWNIWIVDSV DMNAYAHQTL RIQFETITDS QWNTWFAIDD ISLTP
|
| |