Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3225 |
Symbol | |
ID | 5735093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4080694 |
End bp | 4083552 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280371 |
Product | 1A family penicillin-binding protein |
Protein accession | YP_001545990 |
Protein GI | 159899743 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0744] Membrane carboxypeptidase (penicillin-binding protein) |
TIGRFAM ID | [TIGR02073] penicillin-binding protein 1C [TIGR02074] penicillin-binding protein, 1A family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00720883 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGATGT TAACCAGCCC CCATTCCCCA GCGACCAGTC CCCAAAAGAC TATGGCACAA TCAAACAGCA ATCAAACTCT ATTAACTCCG CGCCAAGAAC GGCGCTTGGC GCGGGTTGCC CGTCGTCAAG CCCGCCCGTG GTGGCTCAAA GCACTCATCC TGCTGGTTGA ACTGCTGCTC ATTGGCGTAT TTTGCGTGCT TGCGGCAGGC TTGGGCGGTT ATTGGTATTT TAGTCGTAAT TTGCCTTCGA TTGATAATTT GGGTACGCAT CGCGCCTTTG AAACCACCAA ACTCTATGCC CGCGATGGCA CAACCTTACT GTATGAAATT TTCGATCCCA ATGCTGGTCA ACGCACGGTT GTGCCATTCA GCGCATTCTC CGAGTATTTG AAACAGGCCA CAATTGCAGT CGAAGATAGT AATTTTTATA CTAATCCTGG GGTCAATTTA CCTTCGATCG CCCGCGCCGC GCTAGCCAAT TTGACCGATC AAGAATCCGG TCAAGGTGGT GCATCAACCA TTACTCAGCA GTTAGTGCGC AATGTGTTGC TCTCACCCGA GGAGCGTAGC CAACAAACGC CGCAACGTAA AATTCGCGAA GCAATTTTGG CCTATCAAAT TAGCCAACGC TACTCCAAAG ATCAAATTTT AGCCTTGTAT TTGAATGAAA TTCCTTATGG CAATAATGCG TATGGTGCTG AGGCCGCTGC CCAAGCCTAT TTTGGGGTTA GCGTGAGCGA CCTGAGCTTG GCCCAAGCAG CGATGCTGGC TGGCTTGCCG CAATCGCCGT CGCAACTTGA CCCGTTGGTT AATGCCGATG CCGCCAAAAA ACGCCAAGAA ATTGTCTTGG CAGCGATGGT GCGCAATGGG GTGATTACGC CGCAACAAGC TGAGCAAGCT TTTGCTGAGG TGTTGTATGT CAAGCCAGCC CAAGTTAATT TAACCGCGCC GCATTTTGTG TTTTATGTGC GCGAATTGCT TGAAGCTCGC TACGGCCCCG AATTGCTCTA TCGCGGCGGC TTGCGGGTCA CCACGACGCT TGATCCACAT TGGCAAGCGA TTGCCCAACA AGAGGTGCAA CAACGCATTA GCGAAATTGC CCAGCAAAAT GCGACCAATG GCTCGGTTGT GATGCTTGAT CGCAAAACCA ACCAAATGTT GGCGCTGGTT GGCTCAGCCG ATTACAACAA TACGGCGATT GATGGTCAAG TTAATGTGGC GTTGGCCGAA CGGCAACCTG GCTCAGCGCT CAAACCGTTT GTCTATGCGG CGGCCATGCT CCGCGATTGG ACTGCTGCCA GTGTTCTGTG GGATGTACCA ACTGAATATC GTTTTGCGGG TGGCGAAGTT TATGCGCCCA AAAATTATGA TCGCAGTTTC CACGGCCCCG TTTCAATTCG CGTCGCCTTA GCCAATTCAT TCAACGTGCC AGCAGTCAAA ACCCTCGATC ATGTCGGGAT TGATGAGTTT TTGCGCTTGA TGCAGCGGGT TGGCATCAGC ACCTTGGATG ATCGCCCACG CTATGGGCTT TCGTTGGCGC TGGGTGGTGG CGAGGTTAAG CTGCTTGAAT TGACCACAGC CTACAGCGTT TTTGCTAACG AGGGCAATTA TCGCCCAGCC ACAACCATTT TGAAGGTGGT GAATACTCGT GGCGAAGTGC TTGAGGCTTG GAGCGAACCA CCAAAACAAG CGGTGCTCGG CCCCGATGGC AATGGGCTTG CCTTTATTAT TTCGAGCATG CTCAGCGATA ACAAAGCCCG CGAATGGATG TTTGGGCCAG ATAATGCCAT GGAATTACCG AATGATCGTC CGGCGGCGGT CAAAACTGGC ACAACCGACG ACGACCGCGA TAGCTGGACG GTTGGCTATA CGCCGAGTGT AGTGATTGGT GCATGGGTCG GTAACAGCGA TAATAGCCCA ATGCAAGCAG TTCCAGGCTC GTTTGGCGCA GCAGTGATTT GGAATCGGCT AATGACTAAA TATCACGAAG GTTTGCCAAT TGAGCAATTT ACCCCGCCTT CAAATGTTTC CGAACATGAA GTTTGTATTC CAACTGGCAC CAAGCCATCA GCCGCCTGCC CCAACATTCG CACGGAATAT TTTGTTAATG GCACGGAGCC TGTCGAAACC GAAAACGTCT ATCGCACCGT GCGGATTGGG CCAAGTGGCG ATTGTGTAGC GCTGCCCAAT CAGCCAGGCG AAGATCGGGT ATTTGCAATC TATCCTGATG AGGCGGGCAA TTGGGGCGAA ACTGGTGGCT TAGGCACTCC GCCAACCAAG CCTTGCCCGT TGGTCAATTC GACCAGTGCT GGTGGCGCAA ATGTGGCAAT TGCCTTGATC GAGCCAAGTG ATGGAGCGGC CTTAGCCTCG CCAATTCGAA TTCGCGGCAG CGCGGCTGGC GATTACAACG TGGCTTGGGG CACTGGCACC AATCCAAGCA GTTGGACGAC CATTATCGAA GGCTTTGGCG GAATTTCTAA TGGTTTGCTG GCGATGTGGA ACGCCGATGG CTTGCCTGAG GGAGCCTATA GCCTGCGTTT GTTGGTCAAG CAAAGCAACG GCATGAACGA ACAACGGGTA ACAATCTATC TTGATCGAAC TGCGCCAACG ATTTCAATTA ATCTACCAGC CTCGGCCTTG CGCGGCCAAC CACTGCAATT CCAAGCCACT GCCCAAGATG ATCGTCAGCT GGCTAAAGTT GAGTGGACGG TTAATGGCGA GGTGTTTATG CGCGATCAAG CACCGTATAG CCTTGATTTC AGCCCAACTC AAGCTGGAAA TTATCGGGTT GTGGCCACGG CAATTGATCA AGCAGGCAAC CGCGCAACCA GTAGTGTCAG TGTGCTGAGC GTTAAATAG
|
Protein sequence | MLMLTSPHSP ATSPQKTMAQ SNSNQTLLTP RQERRLARVA RRQARPWWLK ALILLVELLL IGVFCVLAAG LGGYWYFSRN LPSIDNLGTH RAFETTKLYA RDGTTLLYEI FDPNAGQRTV VPFSAFSEYL KQATIAVEDS NFYTNPGVNL PSIARAALAN LTDQESGQGG ASTITQQLVR NVLLSPEERS QQTPQRKIRE AILAYQISQR YSKDQILALY LNEIPYGNNA YGAEAAAQAY FGVSVSDLSL AQAAMLAGLP QSPSQLDPLV NADAAKKRQE IVLAAMVRNG VITPQQAEQA FAEVLYVKPA QVNLTAPHFV FYVRELLEAR YGPELLYRGG LRVTTTLDPH WQAIAQQEVQ QRISEIAQQN ATNGSVVMLD RKTNQMLALV GSADYNNTAI DGQVNVALAE RQPGSALKPF VYAAAMLRDW TAASVLWDVP TEYRFAGGEV YAPKNYDRSF HGPVSIRVAL ANSFNVPAVK TLDHVGIDEF LRLMQRVGIS TLDDRPRYGL SLALGGGEVK LLELTTAYSV FANEGNYRPA TTILKVVNTR GEVLEAWSEP PKQAVLGPDG NGLAFIISSM LSDNKAREWM FGPDNAMELP NDRPAAVKTG TTDDDRDSWT VGYTPSVVIG AWVGNSDNSP MQAVPGSFGA AVIWNRLMTK YHEGLPIEQF TPPSNVSEHE VCIPTGTKPS AACPNIRTEY FVNGTEPVET ENVYRTVRIG PSGDCVALPN QPGEDRVFAI YPDEAGNWGE TGGLGTPPTK PCPLVNSTSA GGANVAIALI EPSDGAALAS PIRIRGSAAG DYNVAWGTGT NPSSWTTIIE GFGGISNGLL AMWNADGLPE GAYSLRLLVK QSNGMNEQRV TIYLDRTAPT ISINLPASAL RGQPLQFQAT AQDDRQLAKV EWTVNGEVFM RDQAPYSLDF SPTQAGNYRV VATAIDQAGN RATSSVSVLS VK
|
| |