Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4725 |
Symbol | |
ID | 5736569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6036071 |
End bp | 6037156 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281890 |
Product | peptidase M24 |
Protein accession | YP_001547484 |
Protein GI | 159901237 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.28239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTCAAG AACGGCTCGC AGCGTTGCGG ATGCTTTTTG AAGCAGCCCA AATCGATGGA TTGTTGGTCG CTAATAGCCA AAATCGGCGC TATCTGAGCG GGTTTACTGG CTCGGCGGGC TTGTTGATCA TTGATGCTCA ACGAGCATTA TTAATCAGCG ATGGCCGCTA TACCGTGCAA GCCGCCCAAG AGGCCAGCCA ATTTGAAACG ATCACCCGCA CGCTTGATGA AAGCTTGTAT AGCTGTGTTG GCCGCCATAT TGCGCCGATC AAACGCTTGG GCTTCGAGCC AGCAACCCTC AGCGTTGCCG ATTACAATGC CTTGCGCCAA GCCTTGCCTG CTGATGTAAC CTTGGTTGCC ATCGGGGCAT TGACCGAGCA ACTCCGCGCG ATCAAAAGCG ACGAAGAAGT TGCGGCCTTG CGTCAAGCAA TTAACATCAC CGACCAAGCC TTAGCGGCAG TCAAGCCAAT GTTGCGCCCA AGCATGCTCG AACGCGAAGT CGCTTGGGAA TTGCACAAGG CAATTGTTGA GCATGGCGGC GATGGTTTAG CTTTTGAAAT TATCGTGGGT GCTGGCTTAA ATAGTGCTTT GCCCCATTAT CACGCTGGTA ACGCCCCGCT GGGCCAAGGC CAGCCGATTG TGGTCGATTT TGGGGCGCTC TATGCTGGCT ATCATGGCGA TATGACCCGC ACCTTGGTGC TCGGCCAGCC CGATGCCAAA TTTGATGAAA TTTATGGCAT TGTGCGCCAC GCGCTTGCGG ATGCAACCAA CGGCATCACC GCCAATACCA CTGGCAAAGA AGCCGATGCC TTGGCTCGCG ATGTGATCGA AGCCTCAGGC TATGGCGAAT ATTTTAGCCA TGGCACAGGC CACGGGGTTG GCCTGCAAAT TCATGAAGAG CCACGGCTCA GCCGCGTTCA CAACGATTTG CTGCCAGTTG GCTCAATTTT TAGCATCGAG CCTGGCATTT ATTTGCCCGA TTGGGGCGGC GTGCGGCTCG AAAACTTGGT TTTACTCAAT GCCAATGGTG TTGAAACGCT TACACAATCG CCACTTGACC CGATCATTGT GATCGAGCAA GCCTAA
|
Protein sequence | MSQERLAALR MLFEAAQIDG LLVANSQNRR YLSGFTGSAG LLIIDAQRAL LISDGRYTVQ AAQEASQFET ITRTLDESLY SCVGRHIAPI KRLGFEPATL SVADYNALRQ ALPADVTLVA IGALTEQLRA IKSDEEVAAL RQAINITDQA LAAVKPMLRP SMLEREVAWE LHKAIVEHGG DGLAFEIIVG AGLNSALPHY HAGNAPLGQG QPIVVDFGAL YAGYHGDMTR TLVLGQPDAK FDEIYGIVRH ALADATNGIT ANTTGKEADA LARDVIEASG YGEYFSHGTG HGVGLQIHEE PRLSRVHNDL LPVGSIFSIE PGIYLPDWGG VRLENLVLLN ANGVETLTQS PLDPIIVIEQ A
|
| |