Gene Avin_40850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_40850 
Symbol 
ID7762970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4120027 
End bp4122447 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content66% 
IMG OID643806944 
ProductATP-dependent protease 
Protein accessionYP_002801195 
Protein GI226946122 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.887392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGACT CCGTTGCTGC CGGCCTGCGC CTGGCACCCG ATGAGCTGAC CCGTCCCTTC 
GATCCCGCCC AGTTCGCTTT CGCCAGTACC GACGAACTGG AGCCCTTTCG CGGCGTGCTC
GGTCAGGAGC GCGCGGTGGA AGCGCTGCAG TTCGGCGTGG CGATGCCACG CCCGGGTTAC
AACGTCTATG TCATGGGCGA GCCGGGTACC GGCCGCTTTT CCTACGTCAA GCGCTATCTA
CAGGCCGAGG CCAAGCGCCT GGAAACTCCG TCGGACTGGG TCTATGTCAA CCACTTCGAC
GAGCCTCGCG AGCCGCGCTC GCTGCAACTC GCGGCGGGCA GCGCCAGCGA GTTCATCGCC
GACATCAATC TGTTGATCGA CAACCTGCTG GCGACCTTCC CGGCGGTGTT CGAGACGCCG
ACCTATCAGC AGAAGAAGAG CGTCATCGAC CGTGGTTTCA ACCAGCGCTA CGACCGCGCC
CTCGATTGCA TCGAACGGCA GGCGCTGGAC AAGGGGATCG CCCTCTATCG CGACAGCGCC
AACATCGCCT TCACCCCGAT GAAGGATGGC AAGACGCTGG ACGAGGGCGA GTTCGCCCTG
TTGCCGGAGG CCGAGCGCGA GCGTTTCCAC GGGGACATCG GCTACCTCGA GGAGCGCCTC
AACGAGGAGT TGTCCAGCCT GCCGCAGTGG AAACGCGAGT CGAGCAACCA GTTGCGCCAG
CTCAACGAGG AGACCATCAC CCTGGCCCTG CAGCCGCTGC TCGCGCCGCT CTCGGAGAAA
TACGCGGAAA ACGCCGGGGT CTGTGCCTAT CTGCAGGCGA TGCAGGTCAA CCTGCTGCGT
ACCGTGGTCG AGCTGCTTAG CGACGAGAAC GCGACGGACG CCCAGCGCCG CGCGCTGCTG
GTCGAGCAGT ACTGCCCGAG CCTGGTGGTC GGCCACCACA GCCAGGGTGG CGCGCCGGTG
GTGTTCGAGC CGCATCCCAT CTACGACAAC CTGTTCGGTC GCATCGAATA CGGCACCGAT
CAGGGGGCGC TCTACACCAG TTATCGCCAG TTGCGTCCGG GCGCGCTGCA CCGTGCCAAC
GGCGGCTTTC TGATCCTCGA GGCGGAACGG CTGCTCGGCG AGCCCTTCGT CTGGGACGCC
CTCAAGCGCG CCCTGCATTC GCGCCAATTG AAGATGGAGT CGCCGCTGGC CGAGATGGGC
CGTCTGGCCA CCGTGACCCT CACCCCGCAG GTCATTCCGC TGCAGCTCAA GGTGGTGATC
GTCGGCTCGC GGCAACTCTA TTACGCGCTG CAGGAGCTGG ATCCCGACTT CCAGGAGATG
TTCCGCGTAT TGGTGGATTT CGACGAGGAG ATCCCTCTTG GCGACGACAG CCTCGAGCAG
TTCGCCCAGT TGCTCAAGAC CCGCACCTCG GAGGAGGGCA TGGCGCCCCT GACCGGTCCG
GCGGTGGCGC GCCTGGCGAC CTACAGCGCG CGTCTGGCCG AGCACCAGGG GCGGCTGTCG
GCGCGCATCG GCGACCTGTT CCAACTGGTC AGCGAGGCCG ATTTCGTTCG CCAGCTGGCC
GGTGAAGCGC TGACCGACGT CGGTCACATC GAGCGCGCCC TGAAGGCCAA GGAAACCCGT
ACCGGACGGG TTTCGGCACG CGTTCTCGAC GACATGCTCG CCGGCATCAT CCTGATCGAC
ACAGAAGGCG CGGCGATCGG CAAGTGCAAC GGGCTGACCG TGCTGGAGAT CGGCGACTCC
GCCTTCGGTG TGCCGGCGCG GATCTCCGCT ACCGTCTATC CGGGCGGTTC GGGGATCGTC
GACATCGAGC GCGAGGTCAA CCTCGGCCAG CCGATCCATT CCAAGGGGGT GATGATCCTC
ACCGGCTATC TCGGCAGCCG CTATGCCCAG GAGTTTCCGC TGGAGATATC GGCGAGCATC
GCCCTGGAGC AGTCCTACGG GTACGTCGAT GGCGACAGCG CCTCCCTCGG CGAGGTCTGC
ACGCTGATCT CGGCGCTGTC GCGTACGCCG CTCAGGCAAT GCTTCGCCAT TACCGGTTCG
ATCAACCAGT TCGGCGAGGT GCAGGCGGTC GGCGGGGTCA ACGAGAAGAT CGAGGGCTTC
TTCCGCCTCT GCGAGGCGCG CGGACTGACC GGCGAGCAGG GGGTGATCAT CCCGCGCGCC
AACGTCACCA ACCTGATGCT CGACGAACGT GTGCTGCAGG CGGTGCGCGC CGGGAACTTC
CATGTCTACG CGGTGCGCCA GGTGGACGAG GCGCTCAGCC TGCTGCTCGG CGAGCCGGCC
GGTACCCCGG ACGGGCAGGG CCGCTTCCCG GCCGGCAGCG TCAATGCGCG GGTGGTCGAG
CGACTGCGGG AAATCGCCGA AATGGGCATG GAAGAGGATT CGGGCAAGCC GAAGGAGCCG
CAGGTCGGCC TGTCGGAGTG A
 
Protein sequence
MPDSVAAGLR LAPDELTRPF DPAQFAFAST DELEPFRGVL GQERAVEALQ FGVAMPRPGY 
NVYVMGEPGT GRFSYVKRYL QAEAKRLETP SDWVYVNHFD EPREPRSLQL AAGSASEFIA
DINLLIDNLL ATFPAVFETP TYQQKKSVID RGFNQRYDRA LDCIERQALD KGIALYRDSA
NIAFTPMKDG KTLDEGEFAL LPEAERERFH GDIGYLEERL NEELSSLPQW KRESSNQLRQ
LNEETITLAL QPLLAPLSEK YAENAGVCAY LQAMQVNLLR TVVELLSDEN ATDAQRRALL
VEQYCPSLVV GHHSQGGAPV VFEPHPIYDN LFGRIEYGTD QGALYTSYRQ LRPGALHRAN
GGFLILEAER LLGEPFVWDA LKRALHSRQL KMESPLAEMG RLATVTLTPQ VIPLQLKVVI
VGSRQLYYAL QELDPDFQEM FRVLVDFDEE IPLGDDSLEQ FAQLLKTRTS EEGMAPLTGP
AVARLATYSA RLAEHQGRLS ARIGDLFQLV SEADFVRQLA GEALTDVGHI ERALKAKETR
TGRVSARVLD DMLAGIILID TEGAAIGKCN GLTVLEIGDS AFGVPARISA TVYPGGSGIV
DIEREVNLGQ PIHSKGVMIL TGYLGSRYAQ EFPLEISASI ALEQSYGYVD GDSASLGEVC
TLISALSRTP LRQCFAITGS INQFGEVQAV GGVNEKIEGF FRLCEARGLT GEQGVIIPRA
NVTNLMLDER VLQAVRAGNF HVYAVRQVDE ALSLLLGEPA GTPDGQGRFP AGSVNARVVE
RLREIAEMGM EEDSGKPKEP QVGLSE