Gene Pars_1588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1588 
Symbol 
ID5055090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1437232 
End bp1439313 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content53% 
IMG OID640469129 
Productpenicillin amidase 
Protein accessionYP_001153794 
Protein GI145591792 
COG category[R] General function prediction only 
COG ID[COG2366] Protein related to penicillin acylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0427638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGGT GGATTAGTCT AGTTATAGTT CTAGTAGCTG TTTTTGCCAT GTCCCTAATT 
CCTCCCTATT TTTTCAGACT TGCCGACCCG TTTCAAGGCC CTATGAAGGC AGTTTCGCGG
CCTCCGCCTC CAGCGCCGCT TGAGAACTAT GTACTTGTTA TCAAGGCAGA TAGCGAAAAG
GAGGCGATGT ATAAGTTGGG GTATGCCCAT GCGTACTACC GCTTCTTTCA AATGGACGTT
ATGAGGAGAG TTGTCCAGGG TCGGCTTTCT GAACTATTAG GCGAGGCGGC TCTTGACACG
GACATCTACT TCAGAACGAG GGGCTTGTAC ATAGCGGCGG AGAAGACGTG GGCATACATC
AAGGAGAGGT ATCCCCAATT CGCTGATCTA CTTGAGGAAT ACACGAGGGG CGTCAACGAC
TTCTTAGCAA GCAACCCGCC CATCGTGGAA TATCTCATAC TTGGAAAAAG ACCTGAGCCT
TGGTCGCCCG TGGATAGCAT TGCCATCGGG AAACTCATCG CGTGGTCCCT AAGCGGCGGA
GAAGACGATC TTTACCTAAA AGAACTGGTT GACGCTCTGG GACCTGAGTT CCTCAAAATA
GCGCTTGTCA GGGCGGATAA CACCCCCATA CTTCCTCAAA AGGCGACTTT CTCGCCTCTG
GGTAGCAACA ACTGGGTGAT ATCGCCTGGC CTCACAGCCA CTGGAAAACC GATATTGGCA
AACGACCCCC ACCTCTCCCT ATCGGCTCCC CCAATTTGGA TATTCCAGAA GGTGGAAACT
CCCGGCTACA CAGTTATGGG AGTTGCCTTC CCGGGCGTTC CTGTTGTTGT TATTGGGACA
AACGGCTACG TTGCTTGGGG TTTTACAAAC ACTGGGGTAG ACGTCATTGA CTACTATTAT
TATGTATGGA GCGGGGACAG GTACTACCAC AACGGCACGT GGCTAGAAGC ACAGCGGCGG
GTTGAGAAGT TCCGAGTCTG CGACCTAAAC AACAACTGCG CGGTTCGATC TATCGACGTC
TTAGAGACAA TGCACGGGCC TGTGGTGGAT TTCAGAGGGC AGAGATACGC TATGAGGTGG
CTAGGGAACA ACGTGACTTT GGAGGCCATA GCGCTGTACA TGATGAACCG GGCTAGGAAC
CTCACAGAGT TCCTAAACGG GCTTAGGTAT TTCGCAACGC CGAGCCAAAA CACCGTCTAC
GCAGATAGAT ACGGCGTTGT CGCCTACTTT GCCAGCGGCT ACTACCCGAT TAGAGATGGG
GGCTACCTCC CCTTCAACGG CTCGAGAGGC GAGGGGGAGT GGAGAAGGCT TGTGTGGCTA
CCTGATGTGC TTAGGTACAT AAACCCGCCG TATTTAGCCA CTGCGAACAA CAAGGTGGCC
GACGCCAATA TTTACCTACA GTGGAGGTGG GCTGACAGGT ACCGCCACGA CAGGATCACA
GAGCTAATAG CGCAGAAAAT AGCCGGGGGG AGGATCTCCG CTAGCGATGT CATGGACATC
CAGCGCGACG TTGTGGATAT CTCGTGCAGA GACGCAAAGG CTTTGCTTAG ACTCTACGGA
GGAGTAGCGG GGAAAAAGCT CCTCGAGGAG CTGGATGGTT GGGACTGCGC CATGGCGCCT
AGCTCTGTAA CCGCGGCTAG GTACGCCATG TTTGTGTACA CCCTGCAAAA ATTGTCATGG
GAGAGGTTCA ATGTCTCCAT TACCTTTGTC CCATTTGAGG TGACGCTTGC CTCGATTGGA
CGCGGCCTTA TAGATAGGGC TATTGTGGAA AAGGCGGCTG AGGAGGCTTT GAAAATTAAC
TCGCCGTGGG GCTCTATACA CAAGTACAAC ATCCAGCACG TGCTCAGCTC CGTCTTCCCG
TCGCTTAATT ATAGGAGCGT CGAGGCTCCA GGCGATTGGT TCACGGTAAA CGTGGCGCCG
GGATTCGCCG TCTCTCATGG ACCGAGCGTG AGGTTCATAG TGGCTTTCGG CGATGGCGTC
TACATGATGC TCCCCGGGGG GCCCGATGGC GATCCCTTAA GCCCACTATA CGACGCCATG
TACATTCCTT GGGTGAGGGG GGAGTATGTC AAGGTGGGCT AA
 
Protein sequence
MIRWISLVIV LVAVFAMSLI PPYFFRLADP FQGPMKAVSR PPPPAPLENY VLVIKADSEK 
EAMYKLGYAH AYYRFFQMDV MRRVVQGRLS ELLGEAALDT DIYFRTRGLY IAAEKTWAYI
KERYPQFADL LEEYTRGVND FLASNPPIVE YLILGKRPEP WSPVDSIAIG KLIAWSLSGG
EDDLYLKELV DALGPEFLKI ALVRADNTPI LPQKATFSPL GSNNWVISPG LTATGKPILA
NDPHLSLSAP PIWIFQKVET PGYTVMGVAF PGVPVVVIGT NGYVAWGFTN TGVDVIDYYY
YVWSGDRYYH NGTWLEAQRR VEKFRVCDLN NNCAVRSIDV LETMHGPVVD FRGQRYAMRW
LGNNVTLEAI ALYMMNRARN LTEFLNGLRY FATPSQNTVY ADRYGVVAYF ASGYYPIRDG
GYLPFNGSRG EGEWRRLVWL PDVLRYINPP YLATANNKVA DANIYLQWRW ADRYRHDRIT
ELIAQKIAGG RISASDVMDI QRDVVDISCR DAKALLRLYG GVAGKKLLEE LDGWDCAMAP
SSVTAARYAM FVYTLQKLSW ERFNVSITFV PFEVTLASIG RGLIDRAIVE KAAEEALKIN
SPWGSIHKYN IQHVLSSVFP SLNYRSVEAP GDWFTVNVAP GFAVSHGPSV RFIVAFGDGV
YMMLPGGPDG DPLSPLYDAM YIPWVRGEYV KVG