Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0613 |
Symbol | |
ID | 5732511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 705343 |
End bp | 706428 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277740 |
Product | peptidase M50 |
Protein accession | YP_001543389 |
Protein GI | 159897142 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.190222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGGAGCA TGCGCATCGG TAATGTCAGT GGTATCCCGA TTAAATTGCA TCTTAGCTGG CTGATTTTGG CTGGTCTGAG CATCTATAGC TTTGAGCAGA TGTTTACGCT CAATGGGCAA TCGGGCTTAC CGCTAGCCTT GCTTGGCACA TTACTGTTGT TGTGCTCAAT TGTGCTGCAT GAAGTTGGCC ATGCCCTAAT GGCTCGGCGG TTAGGCTTGC GAGTAACAGC AATCACGCTG TTTTTAACTG GCGGCCTAAC CGAGCTGGCC GATGATGTTG ATTTGCCGCA AAGCGAATTT AAAATTGCCT TGGTTGGCCC ATTAGTCAAT GTTGGCTTGG CAATTGTGGC GTTCGTTGGG GCGTGGTTTT CCCAAGGGGT TTGGGCTAGT TTTTGGGCGA CCTTGGCGAT TGTCAATGGT TTGTTGGCAG TGTTTAATTT GCTGCCCTGT CATCCACTCG ATGGCGGGCG TGTGTTACGC TCGATCTTCT GGTTTTTGAA TGATGATTTG TTGCGGGGCA CGCTCCAAGC CAGCATGGTT GGGCGCTATC TTGGCAACGG CTTGATGATC ATCGGGCTAG TAGCCTTGTT TAGTAATGCC TTGACTGGCA GTTTGCTCTT GCTGATGGGC TGGATGACCA ATCGGGCTGC GGTCAGCAAT TTTGTGCAAA CCACCCTAAA CTACACCCTC AGCCGAGCTT TGGTTGGCGA AGTGATGACC CGTAGTTTTC GGACAGTTTC GCCGCATTTA ACCCTCGATT TATTTGCTGG CCAATATTTG CTGGGCCAAG CCGAGCCTGC TTTTCCGGTC GTACACTCCG AACGTTTGAT CGGCATGATC AGCGTGCAAC ATCTGTATCG CTATGCGATG GGCGAATGGC GTAGCGTCTC GGTTGGCGAT GCCATGACTC CCAAGGCTGA TTTGCCACGG CTCAATGTTG GCGATAGTAT GCAAACTGCC TACTACACGA TGCTTGGTCA GCGGTACGAT AGCCTACCCG TGACCGATGG CGATGCGGTG GTGGGGATTG TGCGCCATCG TGATGTGGTA GCGTTTGTGC AAAGCGCCCT CAAAATCAAC GTTTAA
|
Protein sequence | MWSMRIGNVS GIPIKLHLSW LILAGLSIYS FEQMFTLNGQ SGLPLALLGT LLLLCSIVLH EVGHALMARR LGLRVTAITL FLTGGLTELA DDVDLPQSEF KIALVGPLVN VGLAIVAFVG AWFSQGVWAS FWATLAIVNG LLAVFNLLPC HPLDGGRVLR SIFWFLNDDL LRGTLQASMV GRYLGNGLMI IGLVALFSNA LTGSLLLLMG WMTNRAAVSN FVQTTLNYTL SRALVGEVMT RSFRTVSPHL TLDLFAGQYL LGQAEPAFPV VHSERLIGMI SVQHLYRYAM GEWRSVSVGD AMTPKADLPR LNVGDSMQTA YYTMLGQRYD SLPVTDGDAV VGIVRHRDVV AFVQSALKIN V
|
| |