Gene Haur_4995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4995 
Symbol 
ID5736831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6335597 
End bp6338062 
Gene Length2466 bp 
Protein Length821 aa 
Translation table11 
GC content53% 
IMG OID641282162 
Productpenicillin amidase 
Protein accessionYP_001547753 
Protein GI159901506 
COG category[R] General function prediction only 
COG ID[COG2366] Protein related to penicillin acylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAACAC CTGCCAAACG CTCATTGCGT GATCGTCCAC CCCTTCAGCG TTGGTTTATT 
CTGTTTTTGC GCTTCTTGTT AATTGTTGTG CTCTTGCTGC TGTTGGCCAG CGGCGGCGGC
TATCTCTATT TGCGCCGCTC GTTGCCAACC ACTGCAGGAA CCCTGACCCT CGCTGGCCTC
GCGGCTCCCG TCGATGTGTT GCGCGATCAA TATGGCGTGC CGCATATCTA CGCCCAAAGC
GAAACCGATG CCCTGATGGC CTTGGGCTAT GTTCATGCTC AAGATCGGCT GTGGCAAATG
GAGTTTCAAC GCCGCATCGC CCGTGGTACG CTCTCCGAAG CTTTGGGCGA AACCACGGTC
GAAACCGACC GCTTTTTGCG CACGCTTGGG GTTTATCGCG CTGCTGAAAG TGCTTACGTA
GCGCTCGATC CGGCTGCTAA AACGATTGTT GATGCCTATG TTAGTGGTAT CAACGCTTTT
CTGGCTGAGC ACAGCGGCGG CGAACTTTCG CCTGAGTTCA CTTTAGTTGG GGTCGAGCCG
GAGCCATGGG TCGGCGCTGA TGTGCTGGGC TGGGCCAAAA TGATGTCGTG GGATTTGGGC
GGTAATTATT CGACCGAATT GCTGACCATG GAATTGGTTG CCAAATTGGG CAGCGAAAAA
ACCAAAGATC TTGTGCCCCA CTATCCGAGC GATGGCCCGC TGATTGTGCC AAATCCCAGC
GGTGGTAGCC AAGCTGAGCA ATTGTTGGCA TTAAGTCGGC GAGTTGAGCA AGGCCTAGGC
ATCGGTGGTG GCAATATTGC TGGTGTTGGC TCGAACAATT GGGTGATCGG CCCGAGCAAA
TCAGCCACAG GCAAACCCTT ATTGGCCGAC GACCCGCACT TGAGCTTTCG CACGCCCTCA
ATTTGGTATA TGGCCGAGGT CGAAGGTGGC GATCTGCACT CGGTTGGTGC GACGATTCCC
GGGTTGCCTG GGGTGATTGT TGGCCACAAC CAACGGATTG CGTGGGGCGT GACCAATACT
GGCCCTGACG TTCAAGATTT GTATCGCGAA ACCCTCGACC CAACTGGCAC TCAAGCCATG
TTCAAGGGCA GTTACGAACC ACTGACGATT ATTAGCGATA CGATTCGGGT CAAAGATAAA
GGCAATCTGC CGCTGACGAT TCGGGTTAGT CGCCATGGGC CATTGATTTC TGATGCGCTG
AATGCCAATA ACGCCGACGA CCCCGATGCG CCGCAACGTG AAGCCTTGGC CTTTCGCTGG
ACAGCGCTCG ACCAAACTGA TAACACCATC AATGCCTATT TGGCGATTAA CAAAGCCCAA
AATTGGCAAG AATTTCAGGC TGGTTTAGAA AGTTATGTTG CGCCAGTTCA GAATTTTGTC
TTTGCTGATG TTGATGGCAA TATTGGCTAT ATGGCACCTG GCCATATTCC AATTCGTGCC
AATGGTGATG GCACAATGCC GGCTGATGGG GCTTCGGGCG ACTACGAATG GACGGGCTTT
ATTCCATTTG AGCAATTGCC CCAAAGCTAT AACCCGCCTC AAGGCTATAT TGCGACGGCG
AATAATAAAG TTGTGGCCGA TAGTTACCCG TATTTTCTCA GCCACGAGTG GGCCACACCA
TTTCGCGCCC AACGCATTAC CAAGTTGATT GAGGCCAAAC CAACGTTAAC TATGGACGAT
ATGGCCGCGA TTCAGGCCGA TGTGCATTCG ACCTATGCCG AGGAATTATT GCCAGTTCTG
CTAAATTTGG TGCAACCAAC CAGCGATCAA CAGCGCCAAG CCATCGCCAT GCTGCAAAAT
TGGAACTACA GCACCGCAGG CGATCAACCA GCCGCCAGCA TTTTCGAAGC TTGGACCTAT
TATTTGACCG TGCCGATGGT TGGCGACGAA TTGGGCGAAC GTTTGCTTGA AACCTATGGT
CAGCGCCGCC AACTGATCGA TTTGGCGATT CCAGCGATGT TGCAAGACCC CAACAACTCT
TGGTGTGACG ACGTTACCAC GACTAGCAGC ACTGAAAACT GCAATGCAAT TGTGACCCAA
GCGCTTGATG TGGCACTCAA AGATTTAAGC TTCCGCATGC AAGATAAACC GATGGAGCAA
TGGCGTTGGG GCACAATTCA CCTTGCCTTG TTCCCGCATA ACCCGCTTGA TGCGATTGGG
CCGTTGCGGG GCTTTTTCAG CAAAGCAATT GAAAGTGCTG GCGATGGTAG CACGGTTAAT
GTTGGCCATG TTGCCGATGG CGAGCCATTT GATCAAGATC GTGGGCCAAT TTATCGGCAT
ATTGTTGATT TAGGCGATTT TGCCAATAGC CGCATGATCA ATGCACCTGG CCAAGCAGGC
CATTTTCTAT CGCCGCACTA CGATGATCTG CTCGAACGCT GGCAAAAAGT CGAATACATT
CCTATGACCT ATGGCCGCGC CGCAGTCAGC GCTGGCGAGG TGGAGATGTT ACAATTACAA
CCCTAG
 
Protein sequence
MATPAKRSLR DRPPLQRWFI LFLRFLLIVV LLLLLASGGG YLYLRRSLPT TAGTLTLAGL 
AAPVDVLRDQ YGVPHIYAQS ETDALMALGY VHAQDRLWQM EFQRRIARGT LSEALGETTV
ETDRFLRTLG VYRAAESAYV ALDPAAKTIV DAYVSGINAF LAEHSGGELS PEFTLVGVEP
EPWVGADVLG WAKMMSWDLG GNYSTELLTM ELVAKLGSEK TKDLVPHYPS DGPLIVPNPS
GGSQAEQLLA LSRRVEQGLG IGGGNIAGVG SNNWVIGPSK SATGKPLLAD DPHLSFRTPS
IWYMAEVEGG DLHSVGATIP GLPGVIVGHN QRIAWGVTNT GPDVQDLYRE TLDPTGTQAM
FKGSYEPLTI ISDTIRVKDK GNLPLTIRVS RHGPLISDAL NANNADDPDA PQREALAFRW
TALDQTDNTI NAYLAINKAQ NWQEFQAGLE SYVAPVQNFV FADVDGNIGY MAPGHIPIRA
NGDGTMPADG ASGDYEWTGF IPFEQLPQSY NPPQGYIATA NNKVVADSYP YFLSHEWATP
FRAQRITKLI EAKPTLTMDD MAAIQADVHS TYAEELLPVL LNLVQPTSDQ QRQAIAMLQN
WNYSTAGDQP AASIFEAWTY YLTVPMVGDE LGERLLETYG QRRQLIDLAI PAMLQDPNNS
WCDDVTTTSS TENCNAIVTQ ALDVALKDLS FRMQDKPMEQ WRWGTIHLAL FPHNPLDAIG
PLRGFFSKAI ESAGDGSTVN VGHVADGEPF DQDRGPIYRH IVDLGDFANS RMINAPGQAG
HFLSPHYDDL LERWQKVEYI PMTYGRAAVS AGEVEMLQLQ P