Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2152 |
Symbol | |
ID | 5734025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2710343 |
End bp | 2711413 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279293 |
Product | N-acetylmuramoyl-L-alanine amidase |
Protein accession | YP_001544920 |
Protein GI | 159898673 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3409] Putative peptidoglycan-binding domain-containing protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000936807 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAC TTTCGCGCCG TACATTTGGC AAACTATCGT TGGGTGTGGC AATGAGCGTG GTGGTGGGTG GTACTGAATT ACGCCTCTTG CGGCCTGCCT ATGCCGCTGT TGCAACTCCC GCAATCGATT CAACTACTGC TTGGGGTGCT GCCGCTGCCA AAGAGCCAAT CAATGTGCTC AATCAAAAGC CAATCGGGAT TGTTGTGCAC CACACGACGA ATCCTAACAC CAACGATTTT ACCCGTAATA AGGCTTGGCA AGTGGCACGG CAAATTCAGC AAAGCCATTT CAATCGCGGC TGGATCGATA CAGGCCAACA ATTTACGATC AGCCGTGGTG GCTGGATTAT GGAAGGTCGC CATCAAAGCT TGAGCATTTT GCAGGGTGGC ACGAAGCATG TTCAAGGCGC ACACGTCGAT GGCCATAATG AAACCCATAT TGGCATTGAA TGCGAAGGCT TGTATATGAA TGTTACGCCC AGCCTGCCTT TGTGGAATAA ACTGGTGGCG TTGATTGCCT ATATTTGCCA GCAATATGGC CTAACCGCCA ACGCTATTGT CGGCCATCGC GATTTGGATA GCACCAGTTG CCCAGGCGAT ACGCTCTATA GCTTGCTGCC ACAGTTGCGT ACCGCTGTTG ATACGACGCT CAGCGGTGGT GCAGTTGGCC GGATGTGGCC AATTTTGCGG CGTAACACCC CCGCGACTGG CCTCGCCAAA ACCATGCAAT ATCTTTTGCG GGCACGCGGC GCGACGATTA CCGCCGATGG AGCCTTTGGC CCAGGCACCG AAACTGCCGT CAAGAGTTTC CAAACTGCCA ATGGCTTAAC CTCAGATGGA GTTGTTGGGG CCGCAACGTG GGAAAAATTA ATTATGACCT TGCGCTCAGG CGATACAGGC GAGGCAGTCA AGGCCTTACA AAATCAACTG ACGGTGCAAA GTTACCCAAC GACAATTGAT GGCAGCTTTG GCACAGGCGT AAATACCCTT GTTCGCGCCT TCCAAACCAA TCGCCAACTG ACGGTTGATG GCGTGGTCGG CTTGAACAGC TGGAACAACT TAGCGATGTA G
|
Protein sequence | MKKLSRRTFG KLSLGVAMSV VVGGTELRLL RPAYAAVATP AIDSTTAWGA AAAKEPINVL NQKPIGIVVH HTTNPNTNDF TRNKAWQVAR QIQQSHFNRG WIDTGQQFTI SRGGWIMEGR HQSLSILQGG TKHVQGAHVD GHNETHIGIE CEGLYMNVTP SLPLWNKLVA LIAYICQQYG LTANAIVGHR DLDSTSCPGD TLYSLLPQLR TAVDTTLSGG AVGRMWPILR RNTPATGLAK TMQYLLRARG ATITADGAFG PGTETAVKSF QTANGLTSDG VVGAATWEKL IMTLRSGDTG EAVKALQNQL TVQSYPTTID GSFGTGVNTL VRAFQTNRQL TVDGVVGLNS WNNLAM
|
| |