Gene Avin_01090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_01090 
Symbol 
ID7759076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp109811 
End bp111856 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content69% 
IMG OID643803035 
Productoligopeptidase A 
Protein accessionYP_002797351 
Protein GI226942278 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGCCA GCAATCCCCT CCTGCAAGCT TTCGACCTGC CGCCCTATTC CGCCATCCGT 
CCGGAACACG TCGAGCCGGC CATCGACCGG ATCCTCGCCG ACAACCGCGC CGCCATCGCG
CAGACCCTCG CCGATCAGGC CGACACGCCG AGCTGGGACG GCCTGGTGCT GGCGCTCGAC
GAACTGGGCG AACGCCTCGG CCGGGCCTGG AGTCCGGTCA GCCATCTCAA TGCGGTGTGC
AACAGCCCCG AGCTGCGCGC CGCCTACGAG GCCTGCCTGC CCAAGCTGTC CGCCTACTGG
ACCGAGATGG GCCAGAACCG CGCCCTCTGC GACGCCTACA AGGCCCTGGC CGCCAGCCCC
GCGACCGCCG GCTTCGACGT GGCGCAGAAG ACCGTCCTCG AACACACCCT GCGCGACTTC
CACCTGTCCG GCATCGATCT GCCGGAGGAG CAGCAGAAGC GCTACGGCGA GATCCAGATG
CGTCTGTCCG AGCTGACCAG CCGCTTCTCC AACCAGTTGC TCGACGCCAC CCAGGCCTGG
ACCAAGCACG TCACCGACGA GTCCGCGCTG GCCGGCCTGA CCGACTCGGC CAGGGCGCAG
ATGGCCCAGG CCGCCCAGGC CAAGGGGCTC GACGGCTGGC TGATCAGCCT GGAGTTCCCC
AGCTACTACG CGGTGATGAC CTACGCCGAC GACCGCGCCC TGCGCGAGGA GCTCTACGCC
GCCTACTGCA CCCGCGCCTC CGACCAGGGG CCGAACGCCG GGCAATTCGA CAACGGCCCA
CTGATGGAGC AGATCCTCGA CCTGCGCCGC GAACTGGCCC GGTTGCTCGG CTACCCGAAC
TACGCCGAAC TGTCGCTGGC GACCAAGATG GCCGACTCCG GCGAGCAGGT GCTCGGTTTC
CTGCGCGATC TGGCCGCGCG CAGCCGGCCG TTCGCCGAGA AGGACCTGGC CGAGCTGCGC
GCCTTCGCCG CCGAACAGGG CTGCGGCGAT CTGCAGAGCT GGGACGTGGG CTACTACAGC
GAGAAGCTGC GCCAGGCCCG CTACAGCATT TCCCAGGAGC AGTTGCGCGC CTACTTCCCG
ATCGACAAGG TGCTCGGCGG CCTGTTCGCC ATGGTCCAGC GCCTCTACGG CATCGAGATC
CGCGAGCTTG CGGATTTCGA TAGCTGGCAC CCGGACGTGC GCCTGTTCGA GATTCGCGAG
AACGGCGAGC ACGTCGGGCG CTTCTTCTTC GACCTCTACG CGCGGGCCAA CAAGCGCGGC
GGCGCCTGGA TGGACGGCGC CCGCGACAAG CGCCGCAACA CCGCCGGCGA GCTGGTCAGC
CCGGTGGCCA ACCTGGTGTG CAATTTCACC CCGGCGGTGG GCGGCAAGCC GGCGCTGCTC
ACCCACGACG AAGTCACCAC CCTGTTCCAC GAGTTCGGCC ACGGCCTGCA CCACCTGCTG
ACGCGCATCG AGCACGCCGG CGCCTCCGGC ATCAGCGGGG TGCCCTGGGA CGCGGTCGAG
CTGCCCAGCC AGTTCATGGA GAACTGGTGC TGGGAGCCGG AAGGCCTGGC GCTGATCTCC
GGCCACTACC AGACCGGCGA GCCCCTGCCC CAGGACCTGC TGGAGAAGAT GCTGGCGGCG
AAGAACTTCC AGTCCGGCAT GATGATGGCG CGCCAACTGG AGTTCTCGCT GTTCGACTTC
GAGCTGCACG TCCACCATGG CGACGGGCGC GGCGTGCTCG AGGTGCTCAA GGGCATCCGC
GACGAGGTTT CGGTGATGCA GCCGCCGGCC TACAACCGCT TCCCCAACAG CTTCTCGCAC
ATCTTCGCCG GCGGCTACGC GGCCGGCTAC TACAGCTACA AGTGGGCGGA AGTGCTCTCC
GCCGACGCCT TCTCCCGCTT CGAGGAGGAC GGCGTGTTCA ACCCCGTCAC CGGCCAGGCC
TTCCGCGAGG CGATCCTGGC CCGTGGCGGC TCGCAGGACC CCATGGTGCT GTTCGTCGAC
TTCCGCGGCC GCGAGCCGTC CATCGACGCC CTGCTCCGTC ACCTTGGCCT GAGCGCGGCG
GCCTGA
 
Protein sequence
MTASNPLLQA FDLPPYSAIR PEHVEPAIDR ILADNRAAIA QTLADQADTP SWDGLVLALD 
ELGERLGRAW SPVSHLNAVC NSPELRAAYE ACLPKLSAYW TEMGQNRALC DAYKALAASP
ATAGFDVAQK TVLEHTLRDF HLSGIDLPEE QQKRYGEIQM RLSELTSRFS NQLLDATQAW
TKHVTDESAL AGLTDSARAQ MAQAAQAKGL DGWLISLEFP SYYAVMTYAD DRALREELYA
AYCTRASDQG PNAGQFDNGP LMEQILDLRR ELARLLGYPN YAELSLATKM ADSGEQVLGF
LRDLAARSRP FAEKDLAELR AFAAEQGCGD LQSWDVGYYS EKLRQARYSI SQEQLRAYFP
IDKVLGGLFA MVQRLYGIEI RELADFDSWH PDVRLFEIRE NGEHVGRFFF DLYARANKRG
GAWMDGARDK RRNTAGELVS PVANLVCNFT PAVGGKPALL THDEVTTLFH EFGHGLHHLL
TRIEHAGASG ISGVPWDAVE LPSQFMENWC WEPEGLALIS GHYQTGEPLP QDLLEKMLAA
KNFQSGMMMA RQLEFSLFDF ELHVHHGDGR GVLEVLKGIR DEVSVMQPPA YNRFPNSFSH
IFAGGYAAGY YSYKWAEVLS ADAFSRFEED GVFNPVTGQA FREAILARGG SQDPMVLFVD
FRGREPSIDA LLRHLGLSAA A