Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3812 |
Symbol | |
ID | 5735676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4784665 |
End bp | 4785999 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280964 |
Product | peptidase S41 |
Protein accession | YP_001546576 |
Protein GI | 159900329 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0205907 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTATC GTTGGTTGCT TTTGCTAGGA ATTGGATTGC TCGCTAGCTG TACTGCACAA TTGCCTTGGG CGGCCACACC AACTGTGCCA CCAACCACGG CTCCCCTCGC CTTGGCAACT CCAACCTTTT TGCCCAGCCC AACCGCGCAG CCAAGCCTTG AGCCTACCCC AACCGTTGAG CTAGCGCAAG CTACGCCAAC CGCGACAACC CCAATGAGTC CCGCCGAACG GCTAGCTTTA TTCAATGATG TTTGGCAAAC GGTCAATGAA CATTATCTGT ATCCTGATTT TAATGGCGTG GATTGGGCCG CCGTGCGTGC TGAAATCGAG CCGCAAGTGC AGGCTGCGCC CGATGATGAA ACGCTCTACA CAATTCTCGA AGGCATGGTC GCCAAACTCG ATGATCAACA TTCGCGCTTT GCCCGACCAG TCGAAGCAGT TTATGAAGAT GCCGTTGCCA GCGGAACCGA TAGCTATGTT GGCATCGGTG TTTTGACAAT TCACGAAGAA AATGCCGCTT TTATTACCTT GGTTTTCCCT GATAGCCCAG CCCAAGCGGC GGGCTTGATG CGCGGCGATC GCATTACCGC CGTTGAAGGC CAGCCGTTTA CCAATGCCGA CCAAATTCGT GGCCCCGAAG GCAGCCAAGT GCGGCTGACC ATTCAAACAC CGCAAGCTGA CCCACGCGAG TTGCTGATAA CGCGGCGGGC AGTAGTGGGC AAAATTACGC CATCAGGCCG CCGCTTGCCC AATGCTCCAA CCGTTGGCTA TTTGCTGATT CCCAGCTTGT GGGCCGACGA TATGCATACC CAAGTTGTCA GCGAACTCAG CAAATTGGTC GCCGATCCTC AGCCGCTTGA TGGTTTGATT TTGGATCTGC GATCCAACGG CGGTGGCTGG CGTAGCGTGC TTGAAGGCAT TTTGGGTCAA TTTGTCAGCG GCGAGGTTGG CAATTTCTAT AGTCAGGAAA AATTGTATCC ATTAACTGTT AAACCTGGCC TGTTGTACGA GCAGCTGAAG CAAGTGCCGC TAGCGGTGCT CATCGACAAA GATAGTGCTT CGTATGCTGA GGTTTTGGCT GGAACGCTGC AATTTAATGG GGCTTTGGTG CTGGGCCAAG CCAGCCAAGG CAATACCGAA ACGATTTTTC AATATAATTT TGAGGATGGC TCACGCTTGT GGGTGGCCCA AGAGGGCTTT AAATTGCCTG ATGGCAGTAA TTTTGAAACC AAAGGTGTGC AGCCAAATAT TGTGGTCGAA GACGATTGGA CCCAATACAC GATTCCGAAT GATCCGGCGG TATTGCAGGC GATTGTTTCA TTTAGCGAGC GCTAG
|
Protein sequence | MRYRWLLLLG IGLLASCTAQ LPWAATPTVP PTTAPLALAT PTFLPSPTAQ PSLEPTPTVE LAQATPTATT PMSPAERLAL FNDVWQTVNE HYLYPDFNGV DWAAVRAEIE PQVQAAPDDE TLYTILEGMV AKLDDQHSRF ARPVEAVYED AVASGTDSYV GIGVLTIHEE NAAFITLVFP DSPAQAAGLM RGDRITAVEG QPFTNADQIR GPEGSQVRLT IQTPQADPRE LLITRRAVVG KITPSGRRLP NAPTVGYLLI PSLWADDMHT QVVSELSKLV ADPQPLDGLI LDLRSNGGGW RSVLEGILGQ FVSGEVGNFY SQEKLYPLTV KPGLLYEQLK QVPLAVLIDK DSASYAEVLA GTLQFNGALV LGQASQGNTE TIFQYNFEDG SRLWVAQEGF KLPDGSNFET KGVQPNIVVE DDWTQYTIPN DPAVLQAIVS FSER
|
| |