Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3648 |
Symbol | |
ID | 5735509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4588222 |
End bp | 4589397 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280797 |
Product | NLP/P60 protein |
Protein accession | YP_001546412 |
Protein GI | 159900165 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCGAT TGCCAATCCT CGCACGGGAT CGTCGGCGCT TGCAGATCGC TGGGTTATTG CTTGGCGCTG GGTTAACTGT GCCGTTGCTG CTTTGGTGGA CGTTTCCCGC AACCTCTCCC GCGCCGCTTG GGGCAACGGC CACACCGCAA ATTGTGCGCG TCGGCGCAAC CCTCGATGAG CCAATTCCTG CTGATTGGCA AGCCCCCGTT TTAAACCCCG AACTCGAAGC CCAGGCCTTG GCTATCCCTG AAACCTTGCC GTTAACCGCT AGTGCGCTAT TGTATGATTC GATCTCTGCT GATACGGTCT TTACTACAAC AATTGCTGGG TTGGTTGCCA GCGAGGAATT GAATTTGCGC GATGGCCCGA GCGTTGATTA TTTGCCCATG GCGATTTTGC TCAACACCAC GCCGTTGACA GTAGTTGGCC GATTTGAGGG CTGGCTGCAA GTTGTAACCC CGCAACGAGC GCTTGGTTGG GTTGATGATA GTTATGTGGC CTTGGCCAGT TCAGCCCAAA CCCTGCCCCA AGTTAATCTG CATGCCGACC CAAATCCAGT TTTAGTGGCG GGATTAACGG TTGAACGAGC TAATGTTCGC TCGAAGCCGC AAACTGAAGC TGAAATTATC ACGACCTTGA GCGCTGAGCA TGGGCAAGTC AATTTATTGC AACAACGTGA GGGTTGGTTC AATGTGCGCA CCAACGATGG CACTGAGGGC TGGGTTTCCG CCGAACTGTT ACAAGCCGAT GCCTATATTT TGCGGCGTGT GCCAACCTTG AGTGCCTCGC CCAACGCGCT TGAAGCGGTG CGTTTGGCCC GCAAATATGT AGGCTATCCC TATGTTTGGG GCGGCGAAAC TCCGCGCGGT GGCTTCGATT GCTCAGGCTT GGTGCTGTAT GTTTATGGCA AATTAGGCAT CGATATGCCC CATAGCGCCG CCGAACAATG GACTGGTGGT TATGGCGAGA AAGTTGCTAG TCGCCGCGAT TTAGTGCCTG GCGATATTGT TTTTTTCAAA AATACCTATA AAAAAGGCGT GAGCCATGTG GGCATTTATG CTGGCAATGG CAAAGTGATT CAGGCGCTCT CCGAGAGTTT AGGCATTCGC GTTTCCGATT TATCCAATAG CTATTGGAGC AGCCGCTATG TTGGGGCAAT TCGGCCATTT CCCTAG
|
Protein sequence | MPRLPILARD RRRLQIAGLL LGAGLTVPLL LWWTFPATSP APLGATATPQ IVRVGATLDE PIPADWQAPV LNPELEAQAL AIPETLPLTA SALLYDSISA DTVFTTTIAG LVASEELNLR DGPSVDYLPM AILLNTTPLT VVGRFEGWLQ VVTPQRALGW VDDSYVALAS SAQTLPQVNL HADPNPVLVA GLTVERANVR SKPQTEAEII TTLSAEHGQV NLLQQREGWF NVRTNDGTEG WVSAELLQAD AYILRRVPTL SASPNALEAV RLARKYVGYP YVWGGETPRG GFDCSGLVLY VYGKLGIDMP HSAAEQWTGG YGEKVASRRD LVPGDIVFFK NTYKKGVSHV GIYAGNGKVI QALSESLGIR VSDLSNSYWS SRYVGAIRPF P
|
| |