Gene Haur_3225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3225 
Symbol 
ID5735093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4080694 
End bp4083552 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content52% 
IMG OID641280371 
Product1A family penicillin-binding protein 
Protein accessionYP_001545990 
Protein GI159899743 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0744] Membrane carboxypeptidase (penicillin-binding protein) 
TIGRFAM ID[TIGR02073] penicillin-binding protein 1C
[TIGR02074] penicillin-binding protein, 1A family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00720883 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATGT TAACCAGCCC CCATTCCCCA GCGACCAGTC CCCAAAAGAC TATGGCACAA 
TCAAACAGCA ATCAAACTCT ATTAACTCCG CGCCAAGAAC GGCGCTTGGC GCGGGTTGCC
CGTCGTCAAG CCCGCCCGTG GTGGCTCAAA GCACTCATCC TGCTGGTTGA ACTGCTGCTC
ATTGGCGTAT TTTGCGTGCT TGCGGCAGGC TTGGGCGGTT ATTGGTATTT TAGTCGTAAT
TTGCCTTCGA TTGATAATTT GGGTACGCAT CGCGCCTTTG AAACCACCAA ACTCTATGCC
CGCGATGGCA CAACCTTACT GTATGAAATT TTCGATCCCA ATGCTGGTCA ACGCACGGTT
GTGCCATTCA GCGCATTCTC CGAGTATTTG AAACAGGCCA CAATTGCAGT CGAAGATAGT
AATTTTTATA CTAATCCTGG GGTCAATTTA CCTTCGATCG CCCGCGCCGC GCTAGCCAAT
TTGACCGATC AAGAATCCGG TCAAGGTGGT GCATCAACCA TTACTCAGCA GTTAGTGCGC
AATGTGTTGC TCTCACCCGA GGAGCGTAGC CAACAAACGC CGCAACGTAA AATTCGCGAA
GCAATTTTGG CCTATCAAAT TAGCCAACGC TACTCCAAAG ATCAAATTTT AGCCTTGTAT
TTGAATGAAA TTCCTTATGG CAATAATGCG TATGGTGCTG AGGCCGCTGC CCAAGCCTAT
TTTGGGGTTA GCGTGAGCGA CCTGAGCTTG GCCCAAGCAG CGATGCTGGC TGGCTTGCCG
CAATCGCCGT CGCAACTTGA CCCGTTGGTT AATGCCGATG CCGCCAAAAA ACGCCAAGAA
ATTGTCTTGG CAGCGATGGT GCGCAATGGG GTGATTACGC CGCAACAAGC TGAGCAAGCT
TTTGCTGAGG TGTTGTATGT CAAGCCAGCC CAAGTTAATT TAACCGCGCC GCATTTTGTG
TTTTATGTGC GCGAATTGCT TGAAGCTCGC TACGGCCCCG AATTGCTCTA TCGCGGCGGC
TTGCGGGTCA CCACGACGCT TGATCCACAT TGGCAAGCGA TTGCCCAACA AGAGGTGCAA
CAACGCATTA GCGAAATTGC CCAGCAAAAT GCGACCAATG GCTCGGTTGT GATGCTTGAT
CGCAAAACCA ACCAAATGTT GGCGCTGGTT GGCTCAGCCG ATTACAACAA TACGGCGATT
GATGGTCAAG TTAATGTGGC GTTGGCCGAA CGGCAACCTG GCTCAGCGCT CAAACCGTTT
GTCTATGCGG CGGCCATGCT CCGCGATTGG ACTGCTGCCA GTGTTCTGTG GGATGTACCA
ACTGAATATC GTTTTGCGGG TGGCGAAGTT TATGCGCCCA AAAATTATGA TCGCAGTTTC
CACGGCCCCG TTTCAATTCG CGTCGCCTTA GCCAATTCAT TCAACGTGCC AGCAGTCAAA
ACCCTCGATC ATGTCGGGAT TGATGAGTTT TTGCGCTTGA TGCAGCGGGT TGGCATCAGC
ACCTTGGATG ATCGCCCACG CTATGGGCTT TCGTTGGCGC TGGGTGGTGG CGAGGTTAAG
CTGCTTGAAT TGACCACAGC CTACAGCGTT TTTGCTAACG AGGGCAATTA TCGCCCAGCC
ACAACCATTT TGAAGGTGGT GAATACTCGT GGCGAAGTGC TTGAGGCTTG GAGCGAACCA
CCAAAACAAG CGGTGCTCGG CCCCGATGGC AATGGGCTTG CCTTTATTAT TTCGAGCATG
CTCAGCGATA ACAAAGCCCG CGAATGGATG TTTGGGCCAG ATAATGCCAT GGAATTACCG
AATGATCGTC CGGCGGCGGT CAAAACTGGC ACAACCGACG ACGACCGCGA TAGCTGGACG
GTTGGCTATA CGCCGAGTGT AGTGATTGGT GCATGGGTCG GTAACAGCGA TAATAGCCCA
ATGCAAGCAG TTCCAGGCTC GTTTGGCGCA GCAGTGATTT GGAATCGGCT AATGACTAAA
TATCACGAAG GTTTGCCAAT TGAGCAATTT ACCCCGCCTT CAAATGTTTC CGAACATGAA
GTTTGTATTC CAACTGGCAC CAAGCCATCA GCCGCCTGCC CCAACATTCG CACGGAATAT
TTTGTTAATG GCACGGAGCC TGTCGAAACC GAAAACGTCT ATCGCACCGT GCGGATTGGG
CCAAGTGGCG ATTGTGTAGC GCTGCCCAAT CAGCCAGGCG AAGATCGGGT ATTTGCAATC
TATCCTGATG AGGCGGGCAA TTGGGGCGAA ACTGGTGGCT TAGGCACTCC GCCAACCAAG
CCTTGCCCGT TGGTCAATTC GACCAGTGCT GGTGGCGCAA ATGTGGCAAT TGCCTTGATC
GAGCCAAGTG ATGGAGCGGC CTTAGCCTCG CCAATTCGAA TTCGCGGCAG CGCGGCTGGC
GATTACAACG TGGCTTGGGG CACTGGCACC AATCCAAGCA GTTGGACGAC CATTATCGAA
GGCTTTGGCG GAATTTCTAA TGGTTTGCTG GCGATGTGGA ACGCCGATGG CTTGCCTGAG
GGAGCCTATA GCCTGCGTTT GTTGGTCAAG CAAAGCAACG GCATGAACGA ACAACGGGTA
ACAATCTATC TTGATCGAAC TGCGCCAACG ATTTCAATTA ATCTACCAGC CTCGGCCTTG
CGCGGCCAAC CACTGCAATT CCAAGCCACT GCCCAAGATG ATCGTCAGCT GGCTAAAGTT
GAGTGGACGG TTAATGGCGA GGTGTTTATG CGCGATCAAG CACCGTATAG CCTTGATTTC
AGCCCAACTC AAGCTGGAAA TTATCGGGTT GTGGCCACGG CAATTGATCA AGCAGGCAAC
CGCGCAACCA GTAGTGTCAG TGTGCTGAGC GTTAAATAG
 
Protein sequence
MLMLTSPHSP ATSPQKTMAQ SNSNQTLLTP RQERRLARVA RRQARPWWLK ALILLVELLL 
IGVFCVLAAG LGGYWYFSRN LPSIDNLGTH RAFETTKLYA RDGTTLLYEI FDPNAGQRTV
VPFSAFSEYL KQATIAVEDS NFYTNPGVNL PSIARAALAN LTDQESGQGG ASTITQQLVR
NVLLSPEERS QQTPQRKIRE AILAYQISQR YSKDQILALY LNEIPYGNNA YGAEAAAQAY
FGVSVSDLSL AQAAMLAGLP QSPSQLDPLV NADAAKKRQE IVLAAMVRNG VITPQQAEQA
FAEVLYVKPA QVNLTAPHFV FYVRELLEAR YGPELLYRGG LRVTTTLDPH WQAIAQQEVQ
QRISEIAQQN ATNGSVVMLD RKTNQMLALV GSADYNNTAI DGQVNVALAE RQPGSALKPF
VYAAAMLRDW TAASVLWDVP TEYRFAGGEV YAPKNYDRSF HGPVSIRVAL ANSFNVPAVK
TLDHVGIDEF LRLMQRVGIS TLDDRPRYGL SLALGGGEVK LLELTTAYSV FANEGNYRPA
TTILKVVNTR GEVLEAWSEP PKQAVLGPDG NGLAFIISSM LSDNKAREWM FGPDNAMELP
NDRPAAVKTG TTDDDRDSWT VGYTPSVVIG AWVGNSDNSP MQAVPGSFGA AVIWNRLMTK
YHEGLPIEQF TPPSNVSEHE VCIPTGTKPS AACPNIRTEY FVNGTEPVET ENVYRTVRIG
PSGDCVALPN QPGEDRVFAI YPDEAGNWGE TGGLGTPPTK PCPLVNSTSA GGANVAIALI
EPSDGAALAS PIRIRGSAAG DYNVAWGTGT NPSSWTTIIE GFGGISNGLL AMWNADGLPE
GAYSLRLLVK QSNGMNEQRV TIYLDRTAPT ISINLPASAL RGQPLQFQAT AQDDRQLAKV
EWTVNGEVFM RDQAPYSLDF SPTQAGNYRV VATAIDQAGN RATSSVSVLS VK