Gene Haur_3076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3076 
Symbol 
ID5734948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3885004 
End bp3887337 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content53% 
IMG OID641280220 
Productlytic transglycosylase catalytic 
Protein accessionYP_001545842 
Protein GI159899595 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.627553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCCA TGAAGCGCGA TGAGATTCTT ATGCTGAAAA AAATTCCACA CGTAGCCTTG 
ATTGTGCTAC TGCTAGCCAG TTGTAGCCGC GATGTCGTCG GTCAACCAAC CGCCACGCCG
CTACCATCCC CTGCCCAATT GTTGCAGCAA GCCGAGCAAC ATCAGCAAGC CGACCAAGTT
GATTTGGCCT TGAGCGATTA TCAACAGGTG TTATTGCAAT ACCCCGATGC GCCCGAAGCG
CGGGTAGCCA AATTTGGGGT AGCCTATAGT GCCTTCTTAC GCCAAGATTG GGCTGCTGCA
TGGTCGCAAT TAACTTCCTT TATCAACGAG CAAACCCACG ATCAATGGCA TTTGCGGGCG
CTCTTTTTGC TGGGACGGGT CGCTGAAATT CAAGGTGACC ATGCGATCGC GATCGAGGCC
TATCAACAAT ATGAAGATCT TAAAGGCTTG TTGAGTGGCT ATGCAGCTCA GCGCCGTGCC
GCCCAACTGC AAGCAACCAA CCAAACCGAG CAAGCGATTG CGGCCTATGC TGCCAGCGGA
CGCTACGATA TGGCTGGGCC GCAGCGGGTC GCCAGCTTAA ACAAAGCCTT AGAATTTTAT
GATCAAACGG GGCAGGCCGA GCAAGCACTT ACTCAACTCG AAGTTATTTT GAGCTTTGCC
CGAACCCCAA GTTTCCGTTC AACGACCTTG CTTGATGCAG CGCGACGTGC TCAGCGCTTG
GGCAAAACTG AACAAGCCCG AATATGGCTG CGCGAAATCA TTAACCAACA CCCAACCTTG
AGCGAAGCTC CAATTGCGAT TGATGAACTG GCAGCTTTGG GCGAATCAAC CCCAGTGCTG
GCCGCCGCTG GGATTGCCTA TAACCATGGT CAATATCTTG ATGCGATCAG TTTGTTCGAC
CAAGTGCTGG CCAATGGCCT GAGTGGCGAA GAAGCCGCCG AAATCGAGCG CAAACGGGCT
TTGGCCTTGC GCCAGCTCGA TGATTATGCT GGTGCACAAG CAGCATTTAA TAGTATTGCC
GAGCGTTTTG CCGAGTTGCC GATTGGCCGC CAAGCACGAC TTGATGCGAT TCAAACCCAA
GGCCAAGCGG GTGATCGTGA AGGCTCACGT TTGGCCTATC TCGATTTTGC CGAGCGTTAT
GCCGATGATC CGTTAGCACC CGAAGCTTTG CGCCGCGTCG TCGAAATTAC CTCGTGGAGC
GGCGATCCGG CAGCCACAGC CAATGCGCAA ATTATGCTTG GCCAGCGCTA CCCTTGGAGC
CACGAAGGCC AGCAAGCCTT GCACGCCGCA GGCCGTTATG CCTGGGATAC GGGGCAGGTT
GAGCAGGCGG CAGCAGTTTG GCAATTATTA GGCGATAGCA ATATTGGGCC GCCTCGCGCC
GAAGGCTATT ATTGGCTGGG GCGCTTGGAA ATTAGCCGTG GCAATCGGGA AAAAGGCGAG
CAATTGCTAC GTTCGGCTCA ATCAGCTGAT CCAAATTCGT ATTATGCGGC ACGGGTTGCC
GATGCCCTCA ACATTAACGA TGGCGATCAA CTGCCGATTG GCTCGCCAAT TAGCCCTGAG
GCCGAACAAG CAGGTTGGCA ATGGATCGCT AGTTGGTCAA CTGCACCAAC CAGCGCCACC
TTAGATACTG AGCCATATAG TTTACGCGCC GAAGAACTAA GCTGGACCGA TTTGCATAGC
GAAGCCCAAG CCGAGTGGAT TGCCGCACGT GATGCTGCCT TGAATAATCC ATTTAGCATT
TATCGGGTTG CTTTGGCCGC TTTACGCAGC GATATGCCCT ATGCTACCGT GACTACAGCT
CAAAAGTTGG TGCAACTTGC GCCAATTGAG GCAGGCGAGC CAAGTGTCGC GATTCGCCAA
TTGCTCTATC CAACACCGTA TCCCAGCGCG GTTGTGACCA AAAGCCAAGA GTTTGGGCTT
GATCCACGGG TGTTGTATGC AGTGATGCGC CAAGAGAGTA TTTTTAATCC AAATGCAACG
TCATGGGTGG GGGCGCGTGG CTTGGCCCAA GTGATGCCCA GCACTGGCGA GGGCATCGCC
CAGAATTTAG GCATTGAGGG CTTTAGCGTC GATGATTTGT ATAATCCGGT GACTTCGATT
CGTTTTGGAG CTTATTATAT CGATGCTCAA ATCGAATATA TGAGTGGCAG CCTACCCGGC
GCGTTTGCCG CCTATAACGG TGGGCCTGGT AATGCCGAGC GTTGGGCTGA TGGTCGCGTG
GTGGCTGATC CCGACCGTTT TATTGAAATT ATTGATTATG CTGAAACTCG CCACTATGTT
GAAGTAGTTT ATGCCAACTA TGGAGCCTAT CGGCGCTTGT ATCAGCAACC ATAA
 
Protein sequence
MAAMKRDEIL MLKKIPHVAL IVLLLASCSR DVVGQPTATP LPSPAQLLQQ AEQHQQADQV 
DLALSDYQQV LLQYPDAPEA RVAKFGVAYS AFLRQDWAAA WSQLTSFINE QTHDQWHLRA
LFLLGRVAEI QGDHAIAIEA YQQYEDLKGL LSGYAAQRRA AQLQATNQTE QAIAAYAASG
RYDMAGPQRV ASLNKALEFY DQTGQAEQAL TQLEVILSFA RTPSFRSTTL LDAARRAQRL
GKTEQARIWL REIINQHPTL SEAPIAIDEL AALGESTPVL AAAGIAYNHG QYLDAISLFD
QVLANGLSGE EAAEIERKRA LALRQLDDYA GAQAAFNSIA ERFAELPIGR QARLDAIQTQ
GQAGDREGSR LAYLDFAERY ADDPLAPEAL RRVVEITSWS GDPAATANAQ IMLGQRYPWS
HEGQQALHAA GRYAWDTGQV EQAAAVWQLL GDSNIGPPRA EGYYWLGRLE ISRGNREKGE
QLLRSAQSAD PNSYYAARVA DALNINDGDQ LPIGSPISPE AEQAGWQWIA SWSTAPTSAT
LDTEPYSLRA EELSWTDLHS EAQAEWIAAR DAALNNPFSI YRVALAALRS DMPYATVTTA
QKLVQLAPIE AGEPSVAIRQ LLYPTPYPSA VVTKSQEFGL DPRVLYAVMR QESIFNPNAT
SWVGARGLAQ VMPSTGEGIA QNLGIEGFSV DDLYNPVTSI RFGAYYIDAQ IEYMSGSLPG
AFAAYNGGPG NAERWADGRV VADPDRFIEI IDYAETRHYV EVVYANYGAY RRLYQQP