Gene Haur_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4026 
Symbol 
ID5735887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5139820 
End bp5141439 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content54% 
IMG OID641281176 
Productpeptidase M23B 
Protein accessionYP_001546786 
Protein GI159900539 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.435878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAGA AGCGAACTCC GCGGCGACGT GTTCAACGGG CACAAACGCA AAACAACGCC 
GAAGTTATTT TGCTACCAGC TCCTAACCAC ACCACTACCG CATTTCGCCA AAAATCTGAT
ACTGGCCGGA TTGAGTCACT CAATCGTGGT GGCCGCTCTG CTCAACGACG GCGCAAGCAA
GAACAACGCG CGGCCTCGAT TGCGCGATCT TCGCTAGCGC AAATCGCTGG CACACCTACT
CGTGCCCCAC AAGCAATTTC TGAGCCAAAA CAACCACGGC GCTCATTCAA AGCCGCCGCT
CAATCGGTGC GTCAGCGTTT GCAGCTTCAA CGCCGTTTGG TTGCACATAG TGTGATCCTG
TTGGCCGCCT GTGTGGTGGC TTGGGGTCGC GGTTTTCCAC TTAATCCAGT TTCTGATAAT
GCCACTAATC AAGCAATTCC CGAGGTTTTA GCTGCGGCCC AGTTGGTCGA TGAAGTTGAA
GCAGGTAATT TCCAACGGGT TGTGATCCCA ATTTCAGCTA CTCAGCCCCA AAGTGTTGGT
GGCGGCGAGC ATGAAGTTGT GGTCGATGTT GGCCCTGCTC TCATCAAACC AGCGCCAGAA
CGTGGCCCGG CCTTTGTGGC AACCCACTTG GTTGCTAACG AAGAAACCAT CGCCGATATT
GCTGCCAAAT ATACAATTAC TCCAGAATCG TTGATTGCAG CCAATAATTT GACTGGCCCA
GTCGCGATCG GCGAAACCTT GCGAATTCCA CGGGTTTCGG GGGTTCCACA CACAGTCAGC
GAAGGCGAAA CGATCACCGA TATCGCCGAA CGCTACAGTG TTGGTGCTGA TGTAATTATG
ACCTTCCCAC CCAATGGCTT AGATCAGGGC CAAGCCTTGA TTGCTGGACG CGAAATTTTC
GTGCCTGGAG CATCGTTGGC GGGAGTGCAA AATGTTTCAG TGCGCGGCGC AATCGATTCA
ATCAATCAAA AATCACAAGC CGCCGCAATT GTGCTGGCTG ATCGCACCAA TTTGCGCGAA
GGCCCAGGCA CGGCCTACGA GAAAATCGTC AAGGTCAATG CTGGCGAGCG TTTACAATTA
ATCGCCAAAC ACGAAGTTTG GGTCAAAATT CGCCAAAGTG ATGGCGAAGT GGCCTGGGTT
GCCCGCGAAG TTGTGAGCAT CCCCGAGGAA GTTTGGTCGG CCTTGGAAGA GACCGATAAC
TTTCCGCCGC CGCCACCACC ACCACCAGTT TGGGTTTGGC CAACCTATGG CGATTTGACC
TCAGGTTTTG GCTATCGCAA CTTTAGCGTT GGGCGGTTCC ATAACGGCAT CGATATTGCC
AACCGCAAGG GCACGCCAAT TTCAGCTGCC CGCCGTGGCA CGGTGATCGA GGCTGGTTGG
TGTAGCGGCT ATGGCTATTG TGTCAAAATC AGCCATGGCT CTGGGATGGT CACCGAATAC
GGCCATATGA TGTCGAATCC AGTGGTTTCC GAAGGCCAAG AGGTCGAGGC AGGCCAATTG
ATCGGCTATA TGGGCAGCAC CTATGATCGA GCTGGCGGTG GCTACTCAAC GGGGGTTCAC
CTGCACTTCA CCATCAAGGT CGATGGTACC GCAGTTAATC CACTCAAGTA CCTACCCTAA
 
Protein sequence
MTEKRTPRRR VQRAQTQNNA EVILLPAPNH TTTAFRQKSD TGRIESLNRG GRSAQRRRKQ 
EQRAASIARS SLAQIAGTPT RAPQAISEPK QPRRSFKAAA QSVRQRLQLQ RRLVAHSVIL
LAACVVAWGR GFPLNPVSDN ATNQAIPEVL AAAQLVDEVE AGNFQRVVIP ISATQPQSVG
GGEHEVVVDV GPALIKPAPE RGPAFVATHL VANEETIADI AAKYTITPES LIAANNLTGP
VAIGETLRIP RVSGVPHTVS EGETITDIAE RYSVGADVIM TFPPNGLDQG QALIAGREIF
VPGASLAGVQ NVSVRGAIDS INQKSQAAAI VLADRTNLRE GPGTAYEKIV KVNAGERLQL
IAKHEVWVKI RQSDGEVAWV AREVVSIPEE VWSALEETDN FPPPPPPPPV WVWPTYGDLT
SGFGYRNFSV GRFHNGIDIA NRKGTPISAA RRGTVIEAGW CSGYGYCVKI SHGSGMVTEY
GHMMSNPVVS EGQEVEAGQL IGYMGSTYDR AGGGYSTGVH LHFTIKVDGT AVNPLKYLP