Gene Avin_13920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_13920 
Symbollon 
ID7760329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1352518 
End bp1354920 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content66% 
IMG OID643804285 
ProductATP-dependent protease La 
Protein accessionYP_002798584 
Protein GI226943511 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.235434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACC AGGAAATCGC GCCCGAGATC GTCGACGAAA AAACCGCGGC CAACAGTCTC 
GTCCTGCCCG AGCAGACTCT GCCCGAACAG GTGTACGTCA TTCCGATCCA CAACCGCCCG
TTCTTCCCGG CCCAGGTGCT ACCGGTGGTG GTCAACCCCG ATCCCTGGGC GGAAACCCTC
AAACGCGTGG TCAAGACACC GCAGCACAGC CTGGCGCTGT TCTACATGGA CCCGCCGCCG
GAGGATGCCG AGGACTTCGA CCCGGACAAG CTGCCCGAGC ACGGCACCCT GGTGCGCGTG
CACCACGCTA GCCAGGAAGG TGGCAAGCTG CAGTTCGTCG CCCAGGGGCT GGCGCGGGTG
CGCATCCGTG GCTGGCTGAG CCGCAAGCCG CCCTATCTGG TCGAGGTCGA TTATCCGAAG
AGCGCCCAGG ACCCGCGCGA CGAGGTCAAG GCCTACGGCA TGGCGCTGAT CAACGCGATC
AAGGAACTGC TGCCGCTCAA TCCCCTGTAC AGCGAAGAGC TGAAGAACTA CCTGAACCGC
TTCAGCCCCA ACGAGCCCTC GCCCCTGACC GACTTCGCCG CCGCGCTGAC CACCGCGCCA
AGCACCGAGC TGCAGGAAGT GCTGGATACC GTGCCGGTGC TCAAGCGCAT GGAAAAGGTC
CTGCCGCTGC TGCGCAAGGA AGTCGAGGTC GCGCGCCTGC AGAACGAGCT GTCCGCCGAA
GTCAACCGCC AGATCGGCGA GCGCCAGCGC GAGTTCTTCC TCAAGGAGCA GTTGAAGATC
ATCCAGCGCG AGCTGGGCAT CACCAAGGAC GACAAGAGCG CCGACGCCGA CGAGTTCCGC
GCCCGCCTGG AAGGCAAGGT CGTGCCGGCG GCGGCGCGCA AGCGCATCGA CGAGGAACTC
AACAAACTGT CGATCCTCGA AACCGGCTCG CCGGAATACG CGGTCACCCG CAACTACCTG
GACTGGGCGA CCAGCATCCC CTGGGGCGTC TACGGCAAGG ACCGGCTCGA CCTGGCCCAT
GCCCGCAAGG TGCTCGACAA GCACCACGCC GGAATGGACG ACATCAAGGC GCGGATCACC
GAGTTCCTCG CCGTCGGCGC CTTCAAGGGC GAGATCGCCG GCTCCATCGT GCTGCTGGTC
GGTCCGCCCG GCGTCGGCAA GACCAGCATC GGCAAATCGA TCGCCGAATC CCTCGGCCGG
CCCTTCTATC GCTTCAGCGT CGGCGGCATG CGCGACGAGG CGGAGATCAA GGGCCACCGG
CGCACCTATA TCGGCGCGCT GCCGGGCAAG CTCGTGCAGG CCTTGAAGGA CGTCGAGGTG
ATGAACCCGG TGATCATGCT CGACGAGATC GACAAGCTGG GCGCCAGCCA TCACGGCGAT
CCGGCCTCGG CGCTGCTGGA AACCCTCGAT CCCGAGCAGA ACGCAGCCTT CCTCGACCAC
TATCTGGACC TGCGCCTGGA CCTGTCCAAG GTGCTGTTCG TCTGCACGGC GAACACCCTG
GATTCGATCC CCGGCCCCCT GCTCGATCGC ATGGAGGTGA TCCGTCTGTC CGGCTACATC
GCCGAGGAGA AGTTCGCCAT CGCCAAGCGT CATCTGTGGC CGCGCCAGCT GGAAAAGGCC
GGGGTGCCGA AGAACCGCCT GTCGATCAGC GACAGCGCGC TGAAGGCGGT GATCGAGGGT
TACGCCCGCG AGGCCGGCGT GCGCCAGTTG GAGAAACAAC TGGGCAAAAT CGTGCGCAAG
GCGGTGGTCA GGCTGCTGGA AGCCCCCGAA GCCAGGCTGA AGGTCGGCCC CAGGGATCTC
GAGGACTATT TGGGCATGCC GCCGTTCCGC AAGGAGCGGC GCCTGGAGGG CGTCGGCATC
ATCACCGGCC TGGCCTGGAC CAGCATGGGC GGCGCCACCC TGCCGATCGA GGCGACGCGC
ATTCACACCC TGAACCGTGG CTTCAAGCTG ACCGGCAAGC TCGGCGAGGT GATGAAGGAA
TCGGCGGAAA TCGCCTACAG CTACGTCAGT TCGCACCTCA AGCAGTACAA GGGCGACCCG
ACCTTCTTCG ACCAGGCCTT CGTCCACCTG CACGTACCGG AGGGCGCCAC GCCCAAGGAC
GGTCCCAGCG CCGGCATCAG CATGGCCAGC GCCCTGCTGT CGCTGGCCCG CAACCAGGCG
CCGAAGAAGG ACGTGGCGAT GACCGGCGAA CTGACCCTCA CCGGCCAGGT GCTGGCCATC
GGCGGAGTCC GCGAGAAAGT GATCGCCGCC CGCCGGCAGA AGATCTTCGA ACTGGTCCTG
CCGGAGGCCA ATCGCGGCGA TTTCGAGGAA CTGCCGGCCT ACCTCAGGGA AGGCCTCACC
GTGCACTTCG CCAGGACCTT CTCAGACGTG GCCAGAGTGC TGTTCCCCCA CGACAAGCCC
TGA
 
Protein sequence
MNDQEIAPEI VDEKTAANSL VLPEQTLPEQ VYVIPIHNRP FFPAQVLPVV VNPDPWAETL 
KRVVKTPQHS LALFYMDPPP EDAEDFDPDK LPEHGTLVRV HHASQEGGKL QFVAQGLARV
RIRGWLSRKP PYLVEVDYPK SAQDPRDEVK AYGMALINAI KELLPLNPLY SEELKNYLNR
FSPNEPSPLT DFAAALTTAP STELQEVLDT VPVLKRMEKV LPLLRKEVEV ARLQNELSAE
VNRQIGERQR EFFLKEQLKI IQRELGITKD DKSADADEFR ARLEGKVVPA AARKRIDEEL
NKLSILETGS PEYAVTRNYL DWATSIPWGV YGKDRLDLAH ARKVLDKHHA GMDDIKARIT
EFLAVGAFKG EIAGSIVLLV GPPGVGKTSI GKSIAESLGR PFYRFSVGGM RDEAEIKGHR
RTYIGALPGK LVQALKDVEV MNPVIMLDEI DKLGASHHGD PASALLETLD PEQNAAFLDH
YLDLRLDLSK VLFVCTANTL DSIPGPLLDR MEVIRLSGYI AEEKFAIAKR HLWPRQLEKA
GVPKNRLSIS DSALKAVIEG YAREAGVRQL EKQLGKIVRK AVVRLLEAPE ARLKVGPRDL
EDYLGMPPFR KERRLEGVGI ITGLAWTSMG GATLPIEATR IHTLNRGFKL TGKLGEVMKE
SAEIAYSYVS SHLKQYKGDP TFFDQAFVHL HVPEGATPKD GPSAGISMAS ALLSLARNQA
PKKDVAMTGE LTLTGQVLAI GGVREKVIAA RRQKIFELVL PEANRGDFEE LPAYLREGLT
VHFARTFSDV ARVLFPHDKP