Gene Avin_23590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_23590 
SymbolpepS16 
ID7761275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2362163 
End bp2364556 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content55% 
IMG OID643805244 
ProductPeptidase S16, ATP-dependent protease 
Protein accessionYP_002799522 
Protein GI226944449 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La
[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATGA TCGAATTACC TCTTCTGCCG CTGCGTGATG TTGTCGTATA TCCGCACATG 
GTAATCCCGC TGTTCGTGGG GCGCGAAAAG TCCATCGAAG CCCTTGAGTC TGCTATGAGC
GGAGACAAGC AGATCCTCTT GCTGGCCCAG AAGAACCCGG CAGATGACGA TCCGGGAGAG
GCATCTCTCT ATCGAGTCGG TACCGTTGCA ACGGTCCTGC AGCTGCTGAA GCTTCCCGAC
GGTACCGTCA AGGTTCTGGT CGAAGGAGAG CAGCGAGGGA TCATTGAGCG CTTCATCGAT
GCCGAGGGGC ACAGTCGGGC GCAGTTGTCC TTGGTCGAGG AGGCTTCGAT CACCGAGCGA
GAGGGCGAGG TCTTCATTCG TAGCCTGCTC AGTCAGTTCG AGCAGTATGT CCAGCTTGGC
AAGAAAGTGC CTGCCGAAGT CCTGTCGTCA TTGAATAGCA TCGATGAGCC GGGGCGGCTG
GTCGACACCA TGGCTGCACA CATGGCTCTC AAGCTTGAGC AGAAGCAGGA AATCTTGGAG
ATCGCCGATC TTTCGGCCCG TGTCGAGCAC GTTCTGGCGC TTCTGGATGC TGAAATCGAT
CTGCTGCAGG TCGAGAAGCG TATCCGGGGG CGAGTGAAGA AGCAGATGGA GCGCAGTCAG
CGCGAGTACT ATCTGAACGA GCAAATGAAA GCCATTCAGA AGGAGCTCGG CGATATCGAT
GAAGGTCACA ATGAAATCGA TGACTTGAAG CAGCGCATCG AGACTGCCGG TTTGACGAAG
GAGGCGCTGG CCAAAGCCCA GGCCGAGTTG AATAAGCTGA AGCAGATGTC TCCGATGTCG
GCGGAAGCTA CTGTCGTCCG TTCCTATATC GACTGGCTGG TCAATGTGCC CTGGAAAGCC
GAGAGCAAGG TTCGTCTGGA TCTGTCCAAG GCCGAGACCA TTCTCGATAC GGATCATTAT
GGTTTGGAGG AGGTTAAGGA ACGCATCCTC GAATATCTTG CCGTGCAGAA ACGTGTGAAG
AAGCTCAAGG GGCCTGTGCT CTGCTTGGTC GGGCCTCCAG GCGTTGGCAA GACTTCCCTG
GCCGAGTCCA TTGCGCGGGC TACCAATCGC AAATTTGTGC GCATGGCGCT GGGCGGTGTC
CGCGATGAAG CCGAAATCCG CGGTCATCGC CGTACCTATA TTGGCTCGAT GCCTGGCCGG
CTGATCCAGA AAATGACCAA GGTGGGGGTT CGCAATCCAC TGTTTCTCCT CGATGAAATC
GACAAGATGG GCAGCGACAT GCGGGGAGAT CCTGCTTCTG CGCTGCTGGA GGTGCTGGAT
CCGGAGCAGA ACCACAACTT CAACGATCAT TACCTGGAGG TTGACTATGA CCTCTCGGAT
GTGATGTTCG TCTGCACCGC CAACTCCATG AACATTCCGG CGCCGCTCTT GGATCGTATG
GAGGTGATCC GGCTTCCTGG ATATACCGAA GACGAGAAAG TCAATATCGT CGTCAAATAT
TTGGCTCCCA AGCAAACCCA GGCCAATGGC TTGAAAAAAG GGGAACTGGA GTTCGAGGAG
GCAGCAATCC GCGACATGGT CCGCTACTAT ACCCGTGAGG CGGGAGTTCG TAGCCTCGAG
CGGCAAATCG CCAAGGTCTG TCGCAAAGTG GTCAAGGAAA ATGCCAAGGA AAAACATTTC
AAGGTTACGG TAACGGCCGA CTCGCTGGAG CATTTTCTGG GCGTCCGCAA ATATCGTTAT
GGGTTGGCTG AGCAGCAGGA TCAGATCGGT CAAGTCACGG GACTGGCTTG GACCCAGGTC
GGGGGTGAGT TGTTGACCAT CGAAGCGGCT GTCGTACCGG GCAAAGGGCA ACTGATCAAA
ACCGGATCGC TTGGTGATGT CATGGTCGAA TCCATTACGG CGGCATTGAC CGTAGTGCGA
AGCCGGGCGA AAAGTCTCGG GATTCCGCTG GATTTCCACG AGAAGCGGGA TATCCATATC
CACATGCCGG AAGGGGCGAC GCCCAAGGAT GGTCCCAGTG CGGGAGTGGG GATGTGTACT
GCCTTGGTTT CCGCGTTGAC ACAAATTCCC GTGAGGGCTG ATGTGGCAAT GACCGGGGAA
ATAACGCTGC GTGGTCAAGT ACTGGCCATT GGTGGATTAA AGGAAAAACT TCTGGCGGCT
CATCGGGGAG GTATCAAGAT CGTTATCATT CCTGAAGAGA ACGTCCGGGA TTTAAAAGAT
ATCCCAGAGA ACATTAAGCA GGACTTGCAG ATCAAACCGG TCAAATGGAT TGACGAGGTC
CTGCAGATTG CGCTGCAATA CGCCCCAGAG CCCTTGCCTG ATACGGCCCC GGAGATGGTC
GCAAAGGATG AGAAGCGCGA AACTGATCCC AAGGAGAGAA TCAGTACGCA CTAG
 
Protein sequence
MNMIELPLLP LRDVVVYPHM VIPLFVGREK SIEALESAMS GDKQILLLAQ KNPADDDPGE 
ASLYRVGTVA TVLQLLKLPD GTVKVLVEGE QRGIIERFID AEGHSRAQLS LVEEASITER
EGEVFIRSLL SQFEQYVQLG KKVPAEVLSS LNSIDEPGRL VDTMAAHMAL KLEQKQEILE
IADLSARVEH VLALLDAEID LLQVEKRIRG RVKKQMERSQ REYYLNEQMK AIQKELGDID
EGHNEIDDLK QRIETAGLTK EALAKAQAEL NKLKQMSPMS AEATVVRSYI DWLVNVPWKA
ESKVRLDLSK AETILDTDHY GLEEVKERIL EYLAVQKRVK KLKGPVLCLV GPPGVGKTSL
AESIARATNR KFVRMALGGV RDEAEIRGHR RTYIGSMPGR LIQKMTKVGV RNPLFLLDEI
DKMGSDMRGD PASALLEVLD PEQNHNFNDH YLEVDYDLSD VMFVCTANSM NIPAPLLDRM
EVIRLPGYTE DEKVNIVVKY LAPKQTQANG LKKGELEFEE AAIRDMVRYY TREAGVRSLE
RQIAKVCRKV VKENAKEKHF KVTVTADSLE HFLGVRKYRY GLAEQQDQIG QVTGLAWTQV
GGELLTIEAA VVPGKGQLIK TGSLGDVMVE SITAALTVVR SRAKSLGIPL DFHEKRDIHI
HMPEGATPKD GPSAGVGMCT ALVSALTQIP VRADVAMTGE ITLRGQVLAI GGLKEKLLAA
HRGGIKIVII PEENVRDLKD IPENIKQDLQ IKPVKWIDEV LQIALQYAPE PLPDTAPEMV
AKDEKRETDP KERISTH