Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3076 |
Symbol | |
ID | 5734948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3885004 |
End bp | 3887337 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280220 |
Product | lytic transglycosylase catalytic |
Protein accession | YP_001545842 |
Protein GI | 159899595 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.627553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCCA TGAAGCGCGA TGAGATTCTT ATGCTGAAAA AAATTCCACA CGTAGCCTTG ATTGTGCTAC TGCTAGCCAG TTGTAGCCGC GATGTCGTCG GTCAACCAAC CGCCACGCCG CTACCATCCC CTGCCCAATT GTTGCAGCAA GCCGAGCAAC ATCAGCAAGC CGACCAAGTT GATTTGGCCT TGAGCGATTA TCAACAGGTG TTATTGCAAT ACCCCGATGC GCCCGAAGCG CGGGTAGCCA AATTTGGGGT AGCCTATAGT GCCTTCTTAC GCCAAGATTG GGCTGCTGCA TGGTCGCAAT TAACTTCCTT TATCAACGAG CAAACCCACG ATCAATGGCA TTTGCGGGCG CTCTTTTTGC TGGGACGGGT CGCTGAAATT CAAGGTGACC ATGCGATCGC GATCGAGGCC TATCAACAAT ATGAAGATCT TAAAGGCTTG TTGAGTGGCT ATGCAGCTCA GCGCCGTGCC GCCCAACTGC AAGCAACCAA CCAAACCGAG CAAGCGATTG CGGCCTATGC TGCCAGCGGA CGCTACGATA TGGCTGGGCC GCAGCGGGTC GCCAGCTTAA ACAAAGCCTT AGAATTTTAT GATCAAACGG GGCAGGCCGA GCAAGCACTT ACTCAACTCG AAGTTATTTT GAGCTTTGCC CGAACCCCAA GTTTCCGTTC AACGACCTTG CTTGATGCAG CGCGACGTGC TCAGCGCTTG GGCAAAACTG AACAAGCCCG AATATGGCTG CGCGAAATCA TTAACCAACA CCCAACCTTG AGCGAAGCTC CAATTGCGAT TGATGAACTG GCAGCTTTGG GCGAATCAAC CCCAGTGCTG GCCGCCGCTG GGATTGCCTA TAACCATGGT CAATATCTTG ATGCGATCAG TTTGTTCGAC CAAGTGCTGG CCAATGGCCT GAGTGGCGAA GAAGCCGCCG AAATCGAGCG CAAACGGGCT TTGGCCTTGC GCCAGCTCGA TGATTATGCT GGTGCACAAG CAGCATTTAA TAGTATTGCC GAGCGTTTTG CCGAGTTGCC GATTGGCCGC CAAGCACGAC TTGATGCGAT TCAAACCCAA GGCCAAGCGG GTGATCGTGA AGGCTCACGT TTGGCCTATC TCGATTTTGC CGAGCGTTAT GCCGATGATC CGTTAGCACC CGAAGCTTTG CGCCGCGTCG TCGAAATTAC CTCGTGGAGC GGCGATCCGG CAGCCACAGC CAATGCGCAA ATTATGCTTG GCCAGCGCTA CCCTTGGAGC CACGAAGGCC AGCAAGCCTT GCACGCCGCA GGCCGTTATG CCTGGGATAC GGGGCAGGTT GAGCAGGCGG CAGCAGTTTG GCAATTATTA GGCGATAGCA ATATTGGGCC GCCTCGCGCC GAAGGCTATT ATTGGCTGGG GCGCTTGGAA ATTAGCCGTG GCAATCGGGA AAAAGGCGAG CAATTGCTAC GTTCGGCTCA ATCAGCTGAT CCAAATTCGT ATTATGCGGC ACGGGTTGCC GATGCCCTCA ACATTAACGA TGGCGATCAA CTGCCGATTG GCTCGCCAAT TAGCCCTGAG GCCGAACAAG CAGGTTGGCA ATGGATCGCT AGTTGGTCAA CTGCACCAAC CAGCGCCACC TTAGATACTG AGCCATATAG TTTACGCGCC GAAGAACTAA GCTGGACCGA TTTGCATAGC GAAGCCCAAG CCGAGTGGAT TGCCGCACGT GATGCTGCCT TGAATAATCC ATTTAGCATT TATCGGGTTG CTTTGGCCGC TTTACGCAGC GATATGCCCT ATGCTACCGT GACTACAGCT CAAAAGTTGG TGCAACTTGC GCCAATTGAG GCAGGCGAGC CAAGTGTCGC GATTCGCCAA TTGCTCTATC CAACACCGTA TCCCAGCGCG GTTGTGACCA AAAGCCAAGA GTTTGGGCTT GATCCACGGG TGTTGTATGC AGTGATGCGC CAAGAGAGTA TTTTTAATCC AAATGCAACG TCATGGGTGG GGGCGCGTGG CTTGGCCCAA GTGATGCCCA GCACTGGCGA GGGCATCGCC CAGAATTTAG GCATTGAGGG CTTTAGCGTC GATGATTTGT ATAATCCGGT GACTTCGATT CGTTTTGGAG CTTATTATAT CGATGCTCAA ATCGAATATA TGAGTGGCAG CCTACCCGGC GCGTTTGCCG CCTATAACGG TGGGCCTGGT AATGCCGAGC GTTGGGCTGA TGGTCGCGTG GTGGCTGATC CCGACCGTTT TATTGAAATT ATTGATTATG CTGAAACTCG CCACTATGTT GAAGTAGTTT ATGCCAACTA TGGAGCCTAT CGGCGCTTGT ATCAGCAACC ATAA
|
Protein sequence | MAAMKRDEIL MLKKIPHVAL IVLLLASCSR DVVGQPTATP LPSPAQLLQQ AEQHQQADQV DLALSDYQQV LLQYPDAPEA RVAKFGVAYS AFLRQDWAAA WSQLTSFINE QTHDQWHLRA LFLLGRVAEI QGDHAIAIEA YQQYEDLKGL LSGYAAQRRA AQLQATNQTE QAIAAYAASG RYDMAGPQRV ASLNKALEFY DQTGQAEQAL TQLEVILSFA RTPSFRSTTL LDAARRAQRL GKTEQARIWL REIINQHPTL SEAPIAIDEL AALGESTPVL AAAGIAYNHG QYLDAISLFD QVLANGLSGE EAAEIERKRA LALRQLDDYA GAQAAFNSIA ERFAELPIGR QARLDAIQTQ GQAGDREGSR LAYLDFAERY ADDPLAPEAL RRVVEITSWS GDPAATANAQ IMLGQRYPWS HEGQQALHAA GRYAWDTGQV EQAAAVWQLL GDSNIGPPRA EGYYWLGRLE ISRGNREKGE QLLRSAQSAD PNSYYAARVA DALNINDGDQ LPIGSPISPE AEQAGWQWIA SWSTAPTSAT LDTEPYSLRA EELSWTDLHS EAQAEWIAAR DAALNNPFSI YRVALAALRS DMPYATVTTA QKLVQLAPIE AGEPSVAIRQ LLYPTPYPSA VVTKSQEFGL DPRVLYAVMR QESIFNPNAT SWVGARGLAQ VMPSTGEGIA QNLGIEGFSV DDLYNPVTSI RFGAYYIDAQ IEYMSGSLPG AFAAYNGGPG NAERWADGRV VADPDRFIEI IDYAETRHYV EVVYANYGAY RRLYQQP
|
| |