Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5249 |
Symbol | |
ID | 5737207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 19030 |
End bp | 20289 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641282413 |
Product | peptidase M23B |
Protein accession | YP_001548004 |
Protein GI | 159901759 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.973521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAGC GAGCCATTGG ATCAAGCATC CTTGTGGCAA GCCTGATTGT TTTGTTGGCA TGTGCGCCGA CCTACGGCGA CGACGACCAG CCGCCAACGA TTGTGGTCAA CTGGCAATGC CCCACGCCCT CACCGCTGCC AACGATCCAA TCGGGGGTAT TGCCTACGCC GGAACCGCCG ATCAACGCGA CCTACGTGCC AGGGCAGGAG CCAACTGCCG AAGCGATCTA CACGACCCCG CTGCCCACGG CGACCCCGTA TGTGCGAACC GGCAGTGATT ACTATGTCAA CCAGCGGATT CAAATCAATC GCTATACCTT ACGCGTGACC AGCTACCGGA CGCAGCCTGC GAGCAACGGC AATGCTTACC ATCTGGTGAC GCTGGCCTTA GAAAACCCCA CCCAAACGCA ATGGCCGCTG TATCTGGACT TCTCGCAGCT GCGGGCGATC AAAGGAACGG ACGGACGCAT GATCCAAGCG ACGTGGTATC CGAGCGAGGA GGCCGCGCAG CAGTTGGGCA TTAGCCCCGC CAAAGACCCG CCGCTTGAGG AAGTTGGCGA CACGTTGGTG GGTGGCTATC CCATCGGCAC GAGTAGCCGC ACGCTGGTGT TTGAAGCGCC CGCAGGCGAC GCGCAGGCGT GGGGAATTAC GTTGACCAAC GAGGACACCA GCCGCGATGA CGGCGCGGGC AGTGGCCAAG TCTGGGTATT ACTGCGCAGC GACCCCAACT GTACCACGGG CGTAGGCGGC GGAGCCAGTG AGGGCGGCAC GATTCCGACC GCAGGCACAC CGAGTACCGG ATCAGGCCGT TGGCCTGTGC CGCTGGATAC GCCAATTACC CGTGCTTTCG GCTGCCACGG GTTTTTCACG GGGACGCGCG GCCCCTGTGG CGGCGCGACC CCGTGGTGGC ATGATGGCAT CGACTTCGCC AAGCCCGAAG GCACGCCGCT GTATGCGACC CGTGACATGA CGGTGCTGTT TGCCGGACGC GACACCTCGA CGATTGATTG TTCGGGGATC AGCGGGTCGC GTTTCCCCCA TTTCGGCTTT GGAAATTATG TCAAAACCCA AGATAGCCTC GGTTTTTCGT ACTGGTATGG GCACGTTTCG GCGTGGTCAG TCAGTGCAGG CCAAACCGTG CGGGCAGGCC AGCAGATCGC GTCGATGGGA TCAACGGGCT GTTCCACCGG ATCACACCTG CACTTTCGGG TGCGGCTGAA TGGGTTGGAC AGAAATCCTT TAGACGTTAT TTCGAAATAA
|
Protein sequence | MNERAIGSSI LVASLIVLLA CAPTYGDDDQ PPTIVVNWQC PTPSPLPTIQ SGVLPTPEPP INATYVPGQE PTAEAIYTTP LPTATPYVRT GSDYYVNQRI QINRYTLRVT SYRTQPASNG NAYHLVTLAL ENPTQTQWPL YLDFSQLRAI KGTDGRMIQA TWYPSEEAAQ QLGISPAKDP PLEEVGDTLV GGYPIGTSSR TLVFEAPAGD AQAWGITLTN EDTSRDDGAG SGQVWVLLRS DPNCTTGVGG GASEGGTIPT AGTPSTGSGR WPVPLDTPIT RAFGCHGFFT GTRGPCGGAT PWWHDGIDFA KPEGTPLYAT RDMTVLFAGR DTSTIDCSGI SGSRFPHFGF GNYVKTQDSL GFSYWYGHVS AWSVSAGQTV RAGQQIASMG STGCSTGSHL HFRVRLNGLD RNPLDVISK
|
| |