Gene Avin_19340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_19340 
Symbol 
ID7760868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1929294 
End bp1931813 
Gene Length2520 bp 
Protein Length839 aa 
Translation table11 
GC content67% 
IMG OID643804836 
ProductPeptidase S45, penicillin amidase family 
Protein accessionYP_002799120 
Protein GI226944047 
COG category[R] General function prediction only 
COG ID[COG2366] Protein related to penicillin acylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTGC TTCGCCATCT GTTTCGACTG AGCCTCGGAG CCGCTTCCCT GCTCGTACTC 
GCCGGCTGCC AGTCGGTGGC CGACAGGCGC CATGCGGACA GCCTGCCACC CACACAGGGC
GTGCTGCGCC TCCAGGGACT GGCGGACAGC GTGGTGGTAC GACGCAACGC CCTCGGCATG
CCGCTGATCG AAACCCGCAA CGAGCACGAT GCCCTGTTCG CCCTCGGCTA CGTGCATGCC
AGCGACCGGC TCAGCCAGAT GGTCGGCCTG CGCCTGATGG CCGAAGGAAA ACTGGCCGAG
ATGCTCGGCC CGGACGTCCT CGATATCGAC CGCTTCATGC GTGCGATGGG CCTCCGTCGT
CATGCCGAGA CGCTGTACCG CAACGCCTCG CCCGGCCTGA GGCAGGACTT CGAAATCTAT
GCCCGCGGGG TCAACGCCTT CCTGTTCCGT CACCGCGACC GGCTGCCCAT GGATCTGGCC
GAGTCAGGTT ACCAGCCATC GTACTGGAAG GCGGAAGATT CGGTGCTGCT GTTCTGCCTG
CTGAACTTCG GTCTGTCGAC CAACCTGCAG GAGGAGATCG CCGCGCTCAG GCTGGCGCAG
AAGGTCGGTA CGGAGCGGCT CGTCTGGCTG ACGCCGATCT ACCCGGACGA ACCCCTGCCG
TTCGCGGAGG CGGACAAGCT CAAGGGTGTC GAGCTGGGCG GGCAGATCCC GGGGCTCGCC
GCCCTTGACC GGGCGGCCGG ACAAGTCGCC GCCCTGAACA TGCTCGGTGT CGCCGCCTCC
AACAATTGGG CGATCGCGCC GCGCGGCAGC CGCCTCGGCA AGAGTCTGCT GGCCAACGAC
ACCCACCTGC CGCCCGCCCT GCCTTCGCTG TGGAATTTCG TCCATGTCCG CACGCCGCAG
CGCCAGAGCG CAGGCGTCTC GGTCGCCGGC ATTCCGGTGG TGATCGCCGG CTTCAACGGC
AAGCTGGCCT GGGGCATGAC CATGGTCATG GCCGATACCC AAGACCTTTT CCTCGAGCGG
ATCAAGCACG AGGACGGACG CCTCCATTAC CTCGCCGACG GCAAATGGCT GCCGGTCAGC
GAGCGCCAGG AAACCTTCCT CGTCAAGGGC GCGCCAGCGA TCCGCGAAAC CCTCTACGAA
ACCCGTCACG GCCCACTGCT GAACAGCGTA CTCGGCGAGC GCAAGCATCC GCTGCAACCC
CTGTCGCTGA GCAGCGGCCA TGGGCTGGCG CTGCAGAGGC TCCCGCTCGA CGATGACCGT
AGCCTGGACG CCTTCCTCCA GCTCTCCCAC GCCCGCTCGG TGGACGAAGC CTTCACGGCT
GCCGGCGAAC TGCGCGCCAT GGCGGTGAAT CTGCTGTTCG CCGATGCCGG ACACATCGGC
TGGCAGGTGA CCGGACGCTA CCCCAATCGC CGCGGCGGCC TCGGCCTACT GCCTTCCCCC
GGCTGGGATG GCGCCTACGA CTGGGACGGC TTCGCCGATC CCATGCTGCA CCCCTACGAT
CAGGACCCGG CGCAGGGCTG GCTGGGCACC GCCAACCAGC GCACGGTACC GCGCGGCTAT
GGCATGCAAC TGTCCAGCAC CTGGTATTAC CCGGAGCGCG CCGAGCGTAT CGCCGAACTG
GCCGGTCGTG GCCGACACGA TGCGCAGAGC ATGATCGCCA TGCAGTACGA CCAGACCACC
CCCTTCGCCG CCAAGCTCAA GGCCATGTTC GAGGCGCCGG GCATGGCCGA GCCGCTGCGC
CAGGCGATCG ACGCCCTGCC GTCGACCGAA CGCGACCGGG CCCGCGAAGC CTTGAGCCGA
TTGCTCGCCT TCGACGGCCG GCTCGCCGCC GGCTCGGCGG ATGCCGCCCT GTATGGCGCC
TTCCTGCAGG AAAGCGCGCG GCAGACATTC CTCGACGAAC TGGGCCCGGA CGACAGCCCC
GCCTGGCAGG CCCTGCTGCA GATGTCCGGC CTGTCCTACT CGGCCCAGGC CGACCACCTG
CTCGGCCGCG AGGACAGTCC GTTCTGGGAC GACATCGCCA CCGCGCGGAC CGAAGACAAG
CCAGCCATCC TCGCACGCAG CCTGGCGGCC GCAGTACGAC ATCTGGAAAG CCGGCTCGGC
GACGATCGAA ACGCCTGGCA ATGGGGCAAG CTGCATCGCT ACCGGTGGAT CAGCGACAGT
ACCCGCCTGG CCCCCTACCT CGATGCCCGT CAGCGTACCG CCATCCAGGC GCTCGACGGC
TACCTGAACC ACGCCCCTGC CCCGGCCGGC GGCGATCATG GCACGTTGAA CGTTTCCGCC
TATCGCTGGG GACAGAACTT CGATGCCCAG TTGATTCCCG CCATGCGCAT CGTCGTCGAT
TTCGCACGCG AAGAACCCAT GCTCGGGTTG AACGGCACCG GCCAGTCCGG CAATCCGGCC
AGCCCGCATT ACAAGGACGG TATCGATGCC TGGCTCGATG GGCGCTACAT GAGCTTTCCT
TTCAAGGCGG AGAACCTGGA CAAGACCTAC GGCAACCAGC GCCTGCTGCT GGTGCCGTAG
 
Protein sequence
MRLLRHLFRL SLGAASLLVL AGCQSVADRR HADSLPPTQG VLRLQGLADS VVVRRNALGM 
PLIETRNEHD ALFALGYVHA SDRLSQMVGL RLMAEGKLAE MLGPDVLDID RFMRAMGLRR
HAETLYRNAS PGLRQDFEIY ARGVNAFLFR HRDRLPMDLA ESGYQPSYWK AEDSVLLFCL
LNFGLSTNLQ EEIAALRLAQ KVGTERLVWL TPIYPDEPLP FAEADKLKGV ELGGQIPGLA
ALDRAAGQVA ALNMLGVAAS NNWAIAPRGS RLGKSLLAND THLPPALPSL WNFVHVRTPQ
RQSAGVSVAG IPVVIAGFNG KLAWGMTMVM ADTQDLFLER IKHEDGRLHY LADGKWLPVS
ERQETFLVKG APAIRETLYE TRHGPLLNSV LGERKHPLQP LSLSSGHGLA LQRLPLDDDR
SLDAFLQLSH ARSVDEAFTA AGELRAMAVN LLFADAGHIG WQVTGRYPNR RGGLGLLPSP
GWDGAYDWDG FADPMLHPYD QDPAQGWLGT ANQRTVPRGY GMQLSSTWYY PERAERIAEL
AGRGRHDAQS MIAMQYDQTT PFAAKLKAMF EAPGMAEPLR QAIDALPSTE RDRAREALSR
LLAFDGRLAA GSADAALYGA FLQESARQTF LDELGPDDSP AWQALLQMSG LSYSAQADHL
LGREDSPFWD DIATARTEDK PAILARSLAA AVRHLESRLG DDRNAWQWGK LHRYRWISDS
TRLAPYLDAR QRTAIQALDG YLNHAPAPAG GDHGTLNVSA YRWGQNFDAQ LIPAMRIVVD
FAREEPMLGL NGTGQSGNPA SPHYKDGIDA WLDGRYMSFP FKAENLDKTY GNQRLLLVP