Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4660 |
Symbol | |
ID | 5736507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5956671 |
End bp | 5958155 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281824 |
Product | peptidase S10 serine carboxypeptidase |
Protein accession | YP_001547419 |
Protein GI | 159901172 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.262908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGATG CAGGTAGCGC TGAAAAACGC AGCGATTCGG AAAGCCCCAA GCCACCACAA GAGATTGTGA GCGTGACCCA TGGGCGCGTG AAAATCAAGG GCAACGATGT GCCTTACACC GCCACCGCCG GAACAATCGT GCTCTACGAA GACGATCCGG AGTTTAAGCA AGCGCCCAAG GCCAAAGCGA CGGTATTTTA TGTAGCCTAT ACCCGCAGCG ATGTTGATGA TCAAACCACC CGCCCAATCA CCTTTTCGTT CAACGGCGGG CCAGGTTCGG CCTCAGTTTG GATGCACCTT GGCTTGTTGG GGCCAAAACG GGTTTTGATG GCCGATGAAA CTGGCAATTT GCCAGCACCG CCATTTCGCT TGGTCGAAAA CGAATATTCC TTGCTCGATC AAAGCGATTT AGTTTTTATC GATCCAATTA GCACGGGCTA TAGTCGTGCG GCAACTGGCG AAAATCCCAA CCAATTCCAT CAATTTACCA AAGATATTGA ATCGATCAGC GATTTTATTC GGCTCTACAC CTCTCGGGCC AAGCGTTGGC TCTCGCCCAA ATATATTATT GGCGAAAGCT ATGGCACAAC TCGCGGCTCT GCCATCACCA ACTATCTGCA AAATCGCTAT GGCATGTATC TGAATGGGAT TATGCTGATC TCCTCGATTC TCGATTTTCA AACCGTCGAG ATGGACCCAG GCAACGATAT TGCCTATGTG GTGATTTTGC CAACCTACGC CGCGACTGCG TGGTATCACA ACAAGCTTGA TGCCAAATTA CAATTGAGTT TGAGTGATAC CTTGGCCGAA GTTGAGGCCT TTGCTGCTGG CGAATATGCT ACGGCATTGT TGCAAGGCGA TAGTTTGGCC GAGGGCAAAC GCCGCTCGGT TGTGCGTAAA TTGGCTCGCT ATACCGGTTT GAGCGAACGT TTTATCGATC ACAGTAATCT ACGGATCGAC TTGATGCGGT TTACTAAGGA ATTGCTGCGC GATCAGCAAC GCACGGTTGG CCGCTTGGAT AGCCGCTTCG TGGGCATCGA CCGCGATCCA ACCCGTGAAG CCTTTGAGTA CGACCCAAGT TATGCGGTAA TTCATGGACC ATACAGCGCC ACCTTCAACG ATTATGTGCG GCGCGAACTC AAATTCGAGA GCGACGAACC CTATGAAATT CTAACTTCAA AAGTTCGGCC TTGGAAATAC GATAAGCATG AAAATCAATA TGTGAGCGTC ACCGAAGCGT TGCGCTCGGC CATCTCGCAA AATCCCTATC TCAAGGTGTT TGTGGCCAGC GGCTTCTTCG ATTTCGCCAC ACCCTACTAT GCCACGTTGC ACACCTTCAA CCACCTTGGG CTTGACCAAA CCCTACGCAA TAACATCGTA ATCAAGCATT ACGAGGCTGG GCATATGATG TATACCCATT TGCCGTCGCT GGCTGAGCTA AAAAGCGACC TCGAAGCCTT TATCAGCCAA ACCAAAAACG TCTAA
|
Protein sequence | MADAGSAEKR SDSESPKPPQ EIVSVTHGRV KIKGNDVPYT ATAGTIVLYE DDPEFKQAPK AKATVFYVAY TRSDVDDQTT RPITFSFNGG PGSASVWMHL GLLGPKRVLM ADETGNLPAP PFRLVENEYS LLDQSDLVFI DPISTGYSRA ATGENPNQFH QFTKDIESIS DFIRLYTSRA KRWLSPKYII GESYGTTRGS AITNYLQNRY GMYLNGIMLI SSILDFQTVE MDPGNDIAYV VILPTYAATA WYHNKLDAKL QLSLSDTLAE VEAFAAGEYA TALLQGDSLA EGKRRSVVRK LARYTGLSER FIDHSNLRID LMRFTKELLR DQQRTVGRLD SRFVGIDRDP TREAFEYDPS YAVIHGPYSA TFNDYVRREL KFESDEPYEI LTSKVRPWKY DKHENQYVSV TEALRSAISQ NPYLKVFVAS GFFDFATPYY ATLHTFNHLG LDQTLRNNIV IKHYEAGHMM YTHLPSLAEL KSDLEAFISQ TKNV
|
| |