Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4995 |
Symbol | |
ID | 5736831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6335597 |
End bp | 6338062 |
Gene Length | 2466 bp |
Protein Length | 821 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641282162 |
Product | penicillin amidase |
Protein accession | YP_001547753 |
Protein GI | 159901506 |
COG category | [R] General function prediction only |
COG ID | [COG2366] Protein related to penicillin acylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAACAC CTGCCAAACG CTCATTGCGT GATCGTCCAC CCCTTCAGCG TTGGTTTATT CTGTTTTTGC GCTTCTTGTT AATTGTTGTG CTCTTGCTGC TGTTGGCCAG CGGCGGCGGC TATCTCTATT TGCGCCGCTC GTTGCCAACC ACTGCAGGAA CCCTGACCCT CGCTGGCCTC GCGGCTCCCG TCGATGTGTT GCGCGATCAA TATGGCGTGC CGCATATCTA CGCCCAAAGC GAAACCGATG CCCTGATGGC CTTGGGCTAT GTTCATGCTC AAGATCGGCT GTGGCAAATG GAGTTTCAAC GCCGCATCGC CCGTGGTACG CTCTCCGAAG CTTTGGGCGA AACCACGGTC GAAACCGACC GCTTTTTGCG CACGCTTGGG GTTTATCGCG CTGCTGAAAG TGCTTACGTA GCGCTCGATC CGGCTGCTAA AACGATTGTT GATGCCTATG TTAGTGGTAT CAACGCTTTT CTGGCTGAGC ACAGCGGCGG CGAACTTTCG CCTGAGTTCA CTTTAGTTGG GGTCGAGCCG GAGCCATGGG TCGGCGCTGA TGTGCTGGGC TGGGCCAAAA TGATGTCGTG GGATTTGGGC GGTAATTATT CGACCGAATT GCTGACCATG GAATTGGTTG CCAAATTGGG CAGCGAAAAA ACCAAAGATC TTGTGCCCCA CTATCCGAGC GATGGCCCGC TGATTGTGCC AAATCCCAGC GGTGGTAGCC AAGCTGAGCA ATTGTTGGCA TTAAGTCGGC GAGTTGAGCA AGGCCTAGGC ATCGGTGGTG GCAATATTGC TGGTGTTGGC TCGAACAATT GGGTGATCGG CCCGAGCAAA TCAGCCACAG GCAAACCCTT ATTGGCCGAC GACCCGCACT TGAGCTTTCG CACGCCCTCA ATTTGGTATA TGGCCGAGGT CGAAGGTGGC GATCTGCACT CGGTTGGTGC GACGATTCCC GGGTTGCCTG GGGTGATTGT TGGCCACAAC CAACGGATTG CGTGGGGCGT GACCAATACT GGCCCTGACG TTCAAGATTT GTATCGCGAA ACCCTCGACC CAACTGGCAC TCAAGCCATG TTCAAGGGCA GTTACGAACC ACTGACGATT ATTAGCGATA CGATTCGGGT CAAAGATAAA GGCAATCTGC CGCTGACGAT TCGGGTTAGT CGCCATGGGC CATTGATTTC TGATGCGCTG AATGCCAATA ACGCCGACGA CCCCGATGCG CCGCAACGTG AAGCCTTGGC CTTTCGCTGG ACAGCGCTCG ACCAAACTGA TAACACCATC AATGCCTATT TGGCGATTAA CAAAGCCCAA AATTGGCAAG AATTTCAGGC TGGTTTAGAA AGTTATGTTG CGCCAGTTCA GAATTTTGTC TTTGCTGATG TTGATGGCAA TATTGGCTAT ATGGCACCTG GCCATATTCC AATTCGTGCC AATGGTGATG GCACAATGCC GGCTGATGGG GCTTCGGGCG ACTACGAATG GACGGGCTTT ATTCCATTTG AGCAATTGCC CCAAAGCTAT AACCCGCCTC AAGGCTATAT TGCGACGGCG AATAATAAAG TTGTGGCCGA TAGTTACCCG TATTTTCTCA GCCACGAGTG GGCCACACCA TTTCGCGCCC AACGCATTAC CAAGTTGATT GAGGCCAAAC CAACGTTAAC TATGGACGAT ATGGCCGCGA TTCAGGCCGA TGTGCATTCG ACCTATGCCG AGGAATTATT GCCAGTTCTG CTAAATTTGG TGCAACCAAC CAGCGATCAA CAGCGCCAAG CCATCGCCAT GCTGCAAAAT TGGAACTACA GCACCGCAGG CGATCAACCA GCCGCCAGCA TTTTCGAAGC TTGGACCTAT TATTTGACCG TGCCGATGGT TGGCGACGAA TTGGGCGAAC GTTTGCTTGA AACCTATGGT CAGCGCCGCC AACTGATCGA TTTGGCGATT CCAGCGATGT TGCAAGACCC CAACAACTCT TGGTGTGACG ACGTTACCAC GACTAGCAGC ACTGAAAACT GCAATGCAAT TGTGACCCAA GCGCTTGATG TGGCACTCAA AGATTTAAGC TTCCGCATGC AAGATAAACC GATGGAGCAA TGGCGTTGGG GCACAATTCA CCTTGCCTTG TTCCCGCATA ACCCGCTTGA TGCGATTGGG CCGTTGCGGG GCTTTTTCAG CAAAGCAATT GAAAGTGCTG GCGATGGTAG CACGGTTAAT GTTGGCCATG TTGCCGATGG CGAGCCATTT GATCAAGATC GTGGGCCAAT TTATCGGCAT ATTGTTGATT TAGGCGATTT TGCCAATAGC CGCATGATCA ATGCACCTGG CCAAGCAGGC CATTTTCTAT CGCCGCACTA CGATGATCTG CTCGAACGCT GGCAAAAAGT CGAATACATT CCTATGACCT ATGGCCGCGC CGCAGTCAGC GCTGGCGAGG TGGAGATGTT ACAATTACAA CCCTAG
|
Protein sequence | MATPAKRSLR DRPPLQRWFI LFLRFLLIVV LLLLLASGGG YLYLRRSLPT TAGTLTLAGL AAPVDVLRDQ YGVPHIYAQS ETDALMALGY VHAQDRLWQM EFQRRIARGT LSEALGETTV ETDRFLRTLG VYRAAESAYV ALDPAAKTIV DAYVSGINAF LAEHSGGELS PEFTLVGVEP EPWVGADVLG WAKMMSWDLG GNYSTELLTM ELVAKLGSEK TKDLVPHYPS DGPLIVPNPS GGSQAEQLLA LSRRVEQGLG IGGGNIAGVG SNNWVIGPSK SATGKPLLAD DPHLSFRTPS IWYMAEVEGG DLHSVGATIP GLPGVIVGHN QRIAWGVTNT GPDVQDLYRE TLDPTGTQAM FKGSYEPLTI ISDTIRVKDK GNLPLTIRVS RHGPLISDAL NANNADDPDA PQREALAFRW TALDQTDNTI NAYLAINKAQ NWQEFQAGLE SYVAPVQNFV FADVDGNIGY MAPGHIPIRA NGDGTMPADG ASGDYEWTGF IPFEQLPQSY NPPQGYIATA NNKVVADSYP YFLSHEWATP FRAQRITKLI EAKPTLTMDD MAAIQADVHS TYAEELLPVL LNLVQPTSDQ QRQAIAMLQN WNYSTAGDQP AASIFEAWTY YLTVPMVGDE LGERLLETYG QRRQLIDLAI PAMLQDPNNS WCDDVTTTSS TENCNAIVTQ ALDVALKDLS FRMQDKPMEQ WRWGTIHLAL FPHNPLDAIG PLRGFFSKAI ESAGDGSTVN VGHVADGEPF DQDRGPIYRH IVDLGDFANS RMINAPGQAG HFLSPHYDDL LERWQKVEYI PMTYGRAAVS AGEVEMLQLQ P
|
| |