Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5027 |
Symbol | |
ID | 5736986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 36140 |
End bp | 37066 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641282194 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001547785 |
Protein GI | 159901539 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3591] V8-like Glu-specific endopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCATCC CTCATTCATT GTTTCACGTA CGTCGTTTGA TTGTTCTCTT GGTTCTGCTG ATGATGATTA TTGGACAAAA ACCTTCGCAT GCGCAGGACA ATGCTCCTGC CCCACATGAT CCGCATACGC CGGTTACCAA TACTGGATCG CTGCCGCCTA AATTCGCCCT AGCCCCGGAT GCCGTCATTT CCCCCTCAAC AGGCAGCGAA CTCCCAAGCA CCGACGAGCC ACAAGAGGAG GTCAGCATCA ACAGCATCAT CGGTCCCGAT TCGCGCAAAC GAATCCTCAA TACGCTCCCT TATCCATATG GGACGATCGT ACACCTCTAC GTATCATACC CTCTTGCTGA GGGAGAATGC AGCGGGGTTC TGATCAGCCC CGACACCGTC CTGACCGCCG GCCATTGCGT CTTTACAAAG GAGCATGGTG GCTGGGCGGA TTTCATAATA GCTTCGCCGG GACGAAATGG TAGTAATACC TTTCCCCATC CTCCTTGTCC CGATAAACAG CTGTTTAGCA CGACTGGCTG GATACACGAC CTTAATCCGA GTTACGACTA TGGTGTGATT AAGTTGACTT GCTCATATAC TGCTACAGGA TGGATGGGCG TGAGGGCGGC TCCTGATGCG GGACTTAGGG GACAAACCAC AATCCTGACT GGATATCCGA GCGATAAACC GTTGGGTACG ATGTGGAACT CACAGGATAG TGTGCGGAAC TACAGTTCCA GTCAGGTATT TTATCAGAAC GATGCCACCC AAGGACAAAG TGGTTCGCCA GTATGGAATA GGAACGACAT CACCTGTAAT CCGTGTGTTT TCGCCATACA TACGGATGGA GTGAGTCCAG GTACTGGGGG CAACAATGGA GGCGTGAGGA TAGATGCCGA AATTATGGAG AACCTTTGGA GTTGGATTGC TCCATAA
|
Protein sequence | MSIPHSLFHV RRLIVLLVLL MMIIGQKPSH AQDNAPAPHD PHTPVTNTGS LPPKFALAPD AVISPSTGSE LPSTDEPQEE VSINSIIGPD SRKRILNTLP YPYGTIVHLY VSYPLAEGEC SGVLISPDTV LTAGHCVFTK EHGGWADFII ASPGRNGSNT FPHPPCPDKQ LFSTTGWIHD LNPSYDYGVI KLTCSYTATG WMGVRAAPDA GLRGQTTILT GYPSDKPLGT MWNSQDSVRN YSSSQVFYQN DATQGQSGSP VWNRNDITCN PCVFAIHTDG VSPGTGGNNG GVRIDAEIME NLWSWIAP
|
| |