Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4948 |
Symbol | |
ID | 5736784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6275220 |
End bp | 6276344 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282115 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001547706 |
Protein GI | 159901459 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.809867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTTC GTGATCGTTT GGGTTGGATG CTGAGTGGTC TATTGCTGGG CATGCTATTG ATGGTTAGTT GCGATGTGGT CAATCAAGCT TCTACGGCTC AGCCAAGTGT GGTTGATGCT GCCGCTGTGC CTTCAACTGG ACCATTGGCC ACCGCTGCTG CCCAAATGCC TGTTAATCAG GCTGGCGTTG ATGCCTATAG CAATGTGATT CGGGCCGTTT ACAATCGCGG TAATCCTTCG GTGGTGCGAA TTGACGTGCA AAGTGAGCAA GGCGAATCGC TCGGAACAGG TTTTGTGATC GATAAACAGG GCCATATTGT CACCAACAAT CACGTTGTTG GCAGCAGCCG CAGCGTGTTA GTCAATTTTA TCGATGGTGA TGCAGGGATC GCCGATGTGA TTGGCGTTGA TAGCGATTCA GATTTGGCAG TAATTAAAAT GCGTAATCCT GATCCTGCTA TTCTGATTCC TGTTGAGTTT GGCGATTCGG CGGCGGTGCA AGTTGGCGAT GTGGTGGTAG CGATTGGGAA TCCCTATGGT GAAAATCGTA CCGCGACTGC GGGGATTATT AGTGCGATTC GTGGAGCCAA GAATGAGGGT GGCGGCAGTA CCTTTTCAAT TCCTGGGGTG TTGCAAACCG ATGCGGCGAT TAACCCAGGC AATTCGGGCG GGCCATTGTT CAACAGCCAA GGCCAAGTAA TTGGGGTCAA TACCTTTATT CTCGACCCAT CGGGGCGGGG CGCGAATATT GGCTTGGGTT TTGCAGTGCC GATTAATTTG GTTAAGCTGG TGGCCCCAGC GATTATTCGC GATGGCAGCT ATACGCATCC ATTCTTTGGC GCGGCGGTAA GTAGCGTTGA TAGCTATTTT GCTGAAGTTA ATAATTTACC AAGCAAAGGC ATTATTATTA CCCAACTCTA CAATGGCCCT GCTGCCGAGG CTGGCTTGCA AGTGGGCGAT GTGATTGTCT CGGTTAATGG TGAGCCAATG CTTGAAGCTG GCGATCTGAT CACGCTCTTA GAATTAACCA CCCAACCAGG TGATCGGATG ACGGTTACGG TGGCCGAGGG CAATGGCCGC ACCCGTGATG TGCAAGTGCT GGTCGGGGCA CGTCCAGGTC GCTAA
|
Protein sequence | MQLRDRLGWM LSGLLLGMLL MVSCDVVNQA STAQPSVVDA AAVPSTGPLA TAAAQMPVNQ AGVDAYSNVI RAVYNRGNPS VVRIDVQSEQ GESLGTGFVI DKQGHIVTNN HVVGSSRSVL VNFIDGDAGI ADVIGVDSDS DLAVIKMRNP DPAILIPVEF GDSAAVQVGD VVVAIGNPYG ENRTATAGII SAIRGAKNEG GGSTFSIPGV LQTDAAINPG NSGGPLFNSQ GQVIGVNTFI LDPSGRGANI GLGFAVPINL VKLVAPAIIR DGSYTHPFFG AAVSSVDSYF AEVNNLPSKG IIITQLYNGP AAEAGLQVGD VIVSVNGEPM LEAGDLITLL ELTTQPGDRM TVTVAEGNGR TRDVQVLVGA RPGR
|
| |