Gene Haur_2152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2152 
Symbol 
ID5734025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2710343 
End bp2711413 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID641279293 
ProductN-acetylmuramoyl-L-alanine amidase 
Protein accessionYP_001544920 
Protein GI159898673 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3409] Putative peptidoglycan-binding domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000936807 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAC TTTCGCGCCG TACATTTGGC AAACTATCGT TGGGTGTGGC AATGAGCGTG 
GTGGTGGGTG GTACTGAATT ACGCCTCTTG CGGCCTGCCT ATGCCGCTGT TGCAACTCCC
GCAATCGATT CAACTACTGC TTGGGGTGCT GCCGCTGCCA AAGAGCCAAT CAATGTGCTC
AATCAAAAGC CAATCGGGAT TGTTGTGCAC CACACGACGA ATCCTAACAC CAACGATTTT
ACCCGTAATA AGGCTTGGCA AGTGGCACGG CAAATTCAGC AAAGCCATTT CAATCGCGGC
TGGATCGATA CAGGCCAACA ATTTACGATC AGCCGTGGTG GCTGGATTAT GGAAGGTCGC
CATCAAAGCT TGAGCATTTT GCAGGGTGGC ACGAAGCATG TTCAAGGCGC ACACGTCGAT
GGCCATAATG AAACCCATAT TGGCATTGAA TGCGAAGGCT TGTATATGAA TGTTACGCCC
AGCCTGCCTT TGTGGAATAA ACTGGTGGCG TTGATTGCCT ATATTTGCCA GCAATATGGC
CTAACCGCCA ACGCTATTGT CGGCCATCGC GATTTGGATA GCACCAGTTG CCCAGGCGAT
ACGCTCTATA GCTTGCTGCC ACAGTTGCGT ACCGCTGTTG ATACGACGCT CAGCGGTGGT
GCAGTTGGCC GGATGTGGCC AATTTTGCGG CGTAACACCC CCGCGACTGG CCTCGCCAAA
ACCATGCAAT ATCTTTTGCG GGCACGCGGC GCGACGATTA CCGCCGATGG AGCCTTTGGC
CCAGGCACCG AAACTGCCGT CAAGAGTTTC CAAACTGCCA ATGGCTTAAC CTCAGATGGA
GTTGTTGGGG CCGCAACGTG GGAAAAATTA ATTATGACCT TGCGCTCAGG CGATACAGGC
GAGGCAGTCA AGGCCTTACA AAATCAACTG ACGGTGCAAA GTTACCCAAC GACAATTGAT
GGCAGCTTTG GCACAGGCGT AAATACCCTT GTTCGCGCCT TCCAAACCAA TCGCCAACTG
ACGGTTGATG GCGTGGTCGG CTTGAACAGC TGGAACAACT TAGCGATGTA G
 
Protein sequence
MKKLSRRTFG KLSLGVAMSV VVGGTELRLL RPAYAAVATP AIDSTTAWGA AAAKEPINVL 
NQKPIGIVVH HTTNPNTNDF TRNKAWQVAR QIQQSHFNRG WIDTGQQFTI SRGGWIMEGR
HQSLSILQGG TKHVQGAHVD GHNETHIGIE CEGLYMNVTP SLPLWNKLVA LIAYICQQYG
LTANAIVGHR DLDSTSCPGD TLYSLLPQLR TAVDTTLSGG AVGRMWPILR RNTPATGLAK
TMQYLLRARG ATITADGAFG PGTETAVKSF QTANGLTSDG VVGAATWEKL IMTLRSGDTG
EAVKALQNQL TVQSYPTTID GSFGTGVNTL VRAFQTNRQL TVDGVVGLNS WNNLAM