Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_40850 |
Symbol | |
ID | 7762970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4120027 |
End bp | 4122447 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643806944 |
Product | ATP-dependent protease |
Protein accession | YP_002801195 |
Protein GI | 226946122 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.887392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGACT CCGTTGCTGC CGGCCTGCGC CTGGCACCCG ATGAGCTGAC CCGTCCCTTC GATCCCGCCC AGTTCGCTTT CGCCAGTACC GACGAACTGG AGCCCTTTCG CGGCGTGCTC GGTCAGGAGC GCGCGGTGGA AGCGCTGCAG TTCGGCGTGG CGATGCCACG CCCGGGTTAC AACGTCTATG TCATGGGCGA GCCGGGTACC GGCCGCTTTT CCTACGTCAA GCGCTATCTA CAGGCCGAGG CCAAGCGCCT GGAAACTCCG TCGGACTGGG TCTATGTCAA CCACTTCGAC GAGCCTCGCG AGCCGCGCTC GCTGCAACTC GCGGCGGGCA GCGCCAGCGA GTTCATCGCC GACATCAATC TGTTGATCGA CAACCTGCTG GCGACCTTCC CGGCGGTGTT CGAGACGCCG ACCTATCAGC AGAAGAAGAG CGTCATCGAC CGTGGTTTCA ACCAGCGCTA CGACCGCGCC CTCGATTGCA TCGAACGGCA GGCGCTGGAC AAGGGGATCG CCCTCTATCG CGACAGCGCC AACATCGCCT TCACCCCGAT GAAGGATGGC AAGACGCTGG ACGAGGGCGA GTTCGCCCTG TTGCCGGAGG CCGAGCGCGA GCGTTTCCAC GGGGACATCG GCTACCTCGA GGAGCGCCTC AACGAGGAGT TGTCCAGCCT GCCGCAGTGG AAACGCGAGT CGAGCAACCA GTTGCGCCAG CTCAACGAGG AGACCATCAC CCTGGCCCTG CAGCCGCTGC TCGCGCCGCT CTCGGAGAAA TACGCGGAAA ACGCCGGGGT CTGTGCCTAT CTGCAGGCGA TGCAGGTCAA CCTGCTGCGT ACCGTGGTCG AGCTGCTTAG CGACGAGAAC GCGACGGACG CCCAGCGCCG CGCGCTGCTG GTCGAGCAGT ACTGCCCGAG CCTGGTGGTC GGCCACCACA GCCAGGGTGG CGCGCCGGTG GTGTTCGAGC CGCATCCCAT CTACGACAAC CTGTTCGGTC GCATCGAATA CGGCACCGAT CAGGGGGCGC TCTACACCAG TTATCGCCAG TTGCGTCCGG GCGCGCTGCA CCGTGCCAAC GGCGGCTTTC TGATCCTCGA GGCGGAACGG CTGCTCGGCG AGCCCTTCGT CTGGGACGCC CTCAAGCGCG CCCTGCATTC GCGCCAATTG AAGATGGAGT CGCCGCTGGC CGAGATGGGC CGTCTGGCCA CCGTGACCCT CACCCCGCAG GTCATTCCGC TGCAGCTCAA GGTGGTGATC GTCGGCTCGC GGCAACTCTA TTACGCGCTG CAGGAGCTGG ATCCCGACTT CCAGGAGATG TTCCGCGTAT TGGTGGATTT CGACGAGGAG ATCCCTCTTG GCGACGACAG CCTCGAGCAG TTCGCCCAGT TGCTCAAGAC CCGCACCTCG GAGGAGGGCA TGGCGCCCCT GACCGGTCCG GCGGTGGCGC GCCTGGCGAC CTACAGCGCG CGTCTGGCCG AGCACCAGGG GCGGCTGTCG GCGCGCATCG GCGACCTGTT CCAACTGGTC AGCGAGGCCG ATTTCGTTCG CCAGCTGGCC GGTGAAGCGC TGACCGACGT CGGTCACATC GAGCGCGCCC TGAAGGCCAA GGAAACCCGT ACCGGACGGG TTTCGGCACG CGTTCTCGAC GACATGCTCG CCGGCATCAT CCTGATCGAC ACAGAAGGCG CGGCGATCGG CAAGTGCAAC GGGCTGACCG TGCTGGAGAT CGGCGACTCC GCCTTCGGTG TGCCGGCGCG GATCTCCGCT ACCGTCTATC CGGGCGGTTC GGGGATCGTC GACATCGAGC GCGAGGTCAA CCTCGGCCAG CCGATCCATT CCAAGGGGGT GATGATCCTC ACCGGCTATC TCGGCAGCCG CTATGCCCAG GAGTTTCCGC TGGAGATATC GGCGAGCATC GCCCTGGAGC AGTCCTACGG GTACGTCGAT GGCGACAGCG CCTCCCTCGG CGAGGTCTGC ACGCTGATCT CGGCGCTGTC GCGTACGCCG CTCAGGCAAT GCTTCGCCAT TACCGGTTCG ATCAACCAGT TCGGCGAGGT GCAGGCGGTC GGCGGGGTCA ACGAGAAGAT CGAGGGCTTC TTCCGCCTCT GCGAGGCGCG CGGACTGACC GGCGAGCAGG GGGTGATCAT CCCGCGCGCC AACGTCACCA ACCTGATGCT CGACGAACGT GTGCTGCAGG CGGTGCGCGC CGGGAACTTC CATGTCTACG CGGTGCGCCA GGTGGACGAG GCGCTCAGCC TGCTGCTCGG CGAGCCGGCC GGTACCCCGG ACGGGCAGGG CCGCTTCCCG GCCGGCAGCG TCAATGCGCG GGTGGTCGAG CGACTGCGGG AAATCGCCGA AATGGGCATG GAAGAGGATT CGGGCAAGCC GAAGGAGCCG CAGGTCGGCC TGTCGGAGTG A
|
Protein sequence | MPDSVAAGLR LAPDELTRPF DPAQFAFAST DELEPFRGVL GQERAVEALQ FGVAMPRPGY NVYVMGEPGT GRFSYVKRYL QAEAKRLETP SDWVYVNHFD EPREPRSLQL AAGSASEFIA DINLLIDNLL ATFPAVFETP TYQQKKSVID RGFNQRYDRA LDCIERQALD KGIALYRDSA NIAFTPMKDG KTLDEGEFAL LPEAERERFH GDIGYLEERL NEELSSLPQW KRESSNQLRQ LNEETITLAL QPLLAPLSEK YAENAGVCAY LQAMQVNLLR TVVELLSDEN ATDAQRRALL VEQYCPSLVV GHHSQGGAPV VFEPHPIYDN LFGRIEYGTD QGALYTSYRQ LRPGALHRAN GGFLILEAER LLGEPFVWDA LKRALHSRQL KMESPLAEMG RLATVTLTPQ VIPLQLKVVI VGSRQLYYAL QELDPDFQEM FRVLVDFDEE IPLGDDSLEQ FAQLLKTRTS EEGMAPLTGP AVARLATYSA RLAEHQGRLS ARIGDLFQLV SEADFVRQLA GEALTDVGHI ERALKAKETR TGRVSARVLD DMLAGIILID TEGAAIGKCN GLTVLEIGDS AFGVPARISA TVYPGGSGIV DIEREVNLGQ PIHSKGVMIL TGYLGSRYAQ EFPLEISASI ALEQSYGYVD GDSASLGEVC TLISALSRTP LRQCFAITGS INQFGEVQAV GGVNEKIEGF FRLCEARGLT GEQGVIIPRA NVTNLMLDER VLQAVRAGNF HVYAVRQVDE ALSLLLGEPA GTPDGQGRFP AGSVNARVVE RLREIAEMGM EEDSGKPKEP QVGLSE
|
| |