Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4026 |
Symbol | |
ID | 5735887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5139820 |
End bp | 5141439 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281176 |
Product | peptidase M23B |
Protein accession | YP_001546786 |
Protein GI | 159900539 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.435878 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAGA AGCGAACTCC GCGGCGACGT GTTCAACGGG CACAAACGCA AAACAACGCC GAAGTTATTT TGCTACCAGC TCCTAACCAC ACCACTACCG CATTTCGCCA AAAATCTGAT ACTGGCCGGA TTGAGTCACT CAATCGTGGT GGCCGCTCTG CTCAACGACG GCGCAAGCAA GAACAACGCG CGGCCTCGAT TGCGCGATCT TCGCTAGCGC AAATCGCTGG CACACCTACT CGTGCCCCAC AAGCAATTTC TGAGCCAAAA CAACCACGGC GCTCATTCAA AGCCGCCGCT CAATCGGTGC GTCAGCGTTT GCAGCTTCAA CGCCGTTTGG TTGCACATAG TGTGATCCTG TTGGCCGCCT GTGTGGTGGC TTGGGGTCGC GGTTTTCCAC TTAATCCAGT TTCTGATAAT GCCACTAATC AAGCAATTCC CGAGGTTTTA GCTGCGGCCC AGTTGGTCGA TGAAGTTGAA GCAGGTAATT TCCAACGGGT TGTGATCCCA ATTTCAGCTA CTCAGCCCCA AAGTGTTGGT GGCGGCGAGC ATGAAGTTGT GGTCGATGTT GGCCCTGCTC TCATCAAACC AGCGCCAGAA CGTGGCCCGG CCTTTGTGGC AACCCACTTG GTTGCTAACG AAGAAACCAT CGCCGATATT GCTGCCAAAT ATACAATTAC TCCAGAATCG TTGATTGCAG CCAATAATTT GACTGGCCCA GTCGCGATCG GCGAAACCTT GCGAATTCCA CGGGTTTCGG GGGTTCCACA CACAGTCAGC GAAGGCGAAA CGATCACCGA TATCGCCGAA CGCTACAGTG TTGGTGCTGA TGTAATTATG ACCTTCCCAC CCAATGGCTT AGATCAGGGC CAAGCCTTGA TTGCTGGACG CGAAATTTTC GTGCCTGGAG CATCGTTGGC GGGAGTGCAA AATGTTTCAG TGCGCGGCGC AATCGATTCA ATCAATCAAA AATCACAAGC CGCCGCAATT GTGCTGGCTG ATCGCACCAA TTTGCGCGAA GGCCCAGGCA CGGCCTACGA GAAAATCGTC AAGGTCAATG CTGGCGAGCG TTTACAATTA ATCGCCAAAC ACGAAGTTTG GGTCAAAATT CGCCAAAGTG ATGGCGAAGT GGCCTGGGTT GCCCGCGAAG TTGTGAGCAT CCCCGAGGAA GTTTGGTCGG CCTTGGAAGA GACCGATAAC TTTCCGCCGC CGCCACCACC ACCACCAGTT TGGGTTTGGC CAACCTATGG CGATTTGACC TCAGGTTTTG GCTATCGCAA CTTTAGCGTT GGGCGGTTCC ATAACGGCAT CGATATTGCC AACCGCAAGG GCACGCCAAT TTCAGCTGCC CGCCGTGGCA CGGTGATCGA GGCTGGTTGG TGTAGCGGCT ATGGCTATTG TGTCAAAATC AGCCATGGCT CTGGGATGGT CACCGAATAC GGCCATATGA TGTCGAATCC AGTGGTTTCC GAAGGCCAAG AGGTCGAGGC AGGCCAATTG ATCGGCTATA TGGGCAGCAC CTATGATCGA GCTGGCGGTG GCTACTCAAC GGGGGTTCAC CTGCACTTCA CCATCAAGGT CGATGGTACC GCAGTTAATC CACTCAAGTA CCTACCCTAA
|
Protein sequence | MTEKRTPRRR VQRAQTQNNA EVILLPAPNH TTTAFRQKSD TGRIESLNRG GRSAQRRRKQ EQRAASIARS SLAQIAGTPT RAPQAISEPK QPRRSFKAAA QSVRQRLQLQ RRLVAHSVIL LAACVVAWGR GFPLNPVSDN ATNQAIPEVL AAAQLVDEVE AGNFQRVVIP ISATQPQSVG GGEHEVVVDV GPALIKPAPE RGPAFVATHL VANEETIADI AAKYTITPES LIAANNLTGP VAIGETLRIP RVSGVPHTVS EGETITDIAE RYSVGADVIM TFPPNGLDQG QALIAGREIF VPGASLAGVQ NVSVRGAIDS INQKSQAAAI VLADRTNLRE GPGTAYEKIV KVNAGERLQL IAKHEVWVKI RQSDGEVAWV AREVVSIPEE VWSALEETDN FPPPPPPPPV WVWPTYGDLT SGFGYRNFSV GRFHNGIDIA NRKGTPISAA RRGTVIEAGW CSGYGYCVKI SHGSGMVTEY GHMMSNPVVS EGQEVEAGQL IGYMGSTYDR AGGGYSTGVH LHFTIKVDGT AVNPLKYLP
|
| |