Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_52330 |
Symbol | |
ID | 7764068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5346025 |
End bp | 5348796 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643808046 |
Product | Type III restriction enzyme, res subunit |
Protein accession | YP_002802280 |
Protein GI | 226947207 |
COG category | [S] Function unknown |
COG ID | [COG3421] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.232434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCAC GTGTTAAGAA TCACGTCACT GGCCGCCTGT CGCTGCGGCC GCCGCAAGCC GAATCCCTGG CACGGCTGGT ACGCGCGCTG GAAAGCGCGC CGGAAATGCT GGGCAAGGAT AGGGACGTTG CCTCGATTCT GGCGACCCTG AAAGCCGAAT TTCCGATGCT GGAGGACTTC GAGCGGGACT TCCCTTCGCT GTGCTTCGCC TTGGCTACCG GAGTCGGCAA GACGCGCTTG ATGGGGGCTT TTGTCGCCTA CCTGCATCTG GCACACGGCA TCAATAATTT CTTCGTGCTG GCGCCGAATC TCACCATCTA TAACAAGTTG ATTGCCGATT TCACGCCGAA CACGCCCAAG TACGTGTTCA AGGGCATCGG GGAGTTCGCC ATCAACGCGC CCAGGGTCAT CACCGGCGAC AACTACGACC AGCAAAACGT CGCGGGTGGC GAGTTGTTCG GCGAGGTGCG GATCAACATC TTCAATATCT CCAAGATCAA TTCCGAGGTG CGCGGCGGCA AGGAGCCACG CATCAAGCGC ATGCGCGAGG TGCTGGGCGA AAGCTACTTC AACCACCTGG CCAACCTGCC GGACCTGGTG TTGTTGATGG ACGAATCACA CCGCTATCGT GCCCAGGCCG GTATGCGGGC GATCAACGAA TTGCACCCGC TGTTCGGATT GGAGGTGACG GCCACGCCCT TTGTGGAATC CGCCAAGGCC CCCATACCTT TCAAGAATGT GGTGATGGAC TATCCCTTGG CGCGCGCCAT GGAAGATGGC TTCGTCAAAG AACCGGCCGT CGTCACCCAA CGCAACTTCA AGGCCAGCAA CCACGCTTCG GAAGAAATCG AGAAGATCAA GCTGGAGGAC GGCGTTCGCC TGCACGAAGC CACCAAGGTC GAGCTATTGA CCTATGCCCG CGAGAACGGC GTCAAGGTTG TCAAACCCTT CATCCTGGTG ATCGCGCGCG ATACGACGCA TGCCGGGCAG TTGAAGGCAT TGATCGAGTC CTCAGCATTC TTTGAAGCGC GCTACCAGGG CAAGGTCATT CAGGTGGACT CCAGCCGCAC CGGAGCCGAG GAAGAGAAGA TGATCGAGGC GTTGTTGAAT GTGGAGAATC CGGAAGAACC CACCGAGATC GTTATCCACG TCAACATGCT CAAGGAAGGC TGGGACGTCA CCAACCTCTA CACCATCGTG CCGCTACGTG CCGCCAATGC ACGCACGTTG ATCGAGCAAT CCATCGGGCG TGGCCTGCGC CTGCCTTATG GCAAGCGTAC GGGCGTGGCT GCGGTGGATC GCCTCAACAT CGTGGCGCAC GACAAGTTCC AGGAGATCAT CGACGAGGCC AACCGTGGCG ATTCGCCGAT CCGTCTCAAG CAAGTGATCC TGGAAGCGCC CAGTAGCGAG GACAAAAAGG TCAGCGTTCA AGTGCTGCCC AATCTGCTGA CCCAACTTGG TCTGCACGAC GAGCATGCGC CACATGTCCC GCCTGCGTTG GCAACGGTGG ATGCCGGTAC TGAGGTTGGT GGCGAACAAG TCAACGCACA AACTCAGCCA GTCTTTTCGA CCGAAGCAGA ATTCAAGGCG GCCCATGTCG TACGGGAAGT CCTTGCCACC TATGAGGTGA AACGTGACTT GGCACCGACC AGTGCGGCAT TGCTGAAGCC GGAAATCCAG CAAGAGATTC TGGCAGAAGT GGAAAAGCGC CTGAACCCCC AACAGGGGCA ATTGCTGCAA GGCGCCGACG ACCAGGTGCC AGCGCTGGAT CTGTCCGCCG TCGTGGCCAA GACCACTGAA ATTCTGGTTC AGCAGACCAT CGACATTCCG CGTATCGCCG TTGTGCCTAC CGGCGAAGTC ACCACAGGCT TTCATCCGTT CCGCCTGGGG GCCTTGCCCA ACTTCCAGCC AGGGCAGCGC GAGATCGTCG GCCAGACATT GCGTACCAGC GAACAATTCA CCCTGAACCG TGAAAGTGGC CTGCGGGAAA ACCGCTTTGA AGATTACATC GTCAAGAAGC TGATCGACTT CGACGACATC GACTACTTCA CGCAGGCCGA CTTGCTCTAC GACCTCGCCG GACAGGCTGC CGAGCATTAC CAGCAGCAGA ACTATGCGGA CAGTGAACTG CACGAGATTT TCGATACCTA CGGCAAGGAG CTCGCCCGCC TGATTCGTGC CCAGATGATG GAGCACTTCT GGGAAAAAGC CGCGGGCTAT GAGGTTCAGG TCAGCAGAGG CTTCACCGAG CTGAAACCAT GTAACTACAC CGCTACGGAG GGGCAGACGG CCCATAACCT GAGGGAAACC GTTACGGAAA CCAGCCGTAT CAAACAGATG CTGTTCGGTG GGTTCACCCG TTGCCTGTAC CCCTTGCAGA AGTTCGACTC GGACACCGAA CGGCGTTTTG CCTTGCTGCT GGAACGTGAC GCCCTCAAAT GGTTCAAGCC TGCCAAGGGC CAGTTCCAGA TCTACTACAA GCTGGGCAGC GAACAGCCGG AATACATCCC CGACTTCGTG GCCGAGTTGG ATGGAATGAT CCTGATGGTC GAAACCAAAG CGCGTGTCGA TCTGGCTTCC GCTGAAGTCC AGGCCAAGAG CGCTGCCGCC TCTCGATGGT GTCGGCACGC CAGTGAACAT GCGGCTGAAG TCGGCGGTAA GTCCTGGCGC TACCTCGTGG TTCCCCATGA TGAAGTCACT GAAGACAAAC GCCTGTCCGA CTACCTGCGG TTTGAAGTCA AGGACACTGC GGAAGGAGGC AGCCCAGCCT GA
|
Protein sequence | MSSRVKNHVT GRLSLRPPQA ESLARLVRAL ESAPEMLGKD RDVASILATL KAEFPMLEDF ERDFPSLCFA LATGVGKTRL MGAFVAYLHL AHGINNFFVL APNLTIYNKL IADFTPNTPK YVFKGIGEFA INAPRVITGD NYDQQNVAGG ELFGEVRINI FNISKINSEV RGGKEPRIKR MREVLGESYF NHLANLPDLV LLMDESHRYR AQAGMRAINE LHPLFGLEVT ATPFVESAKA PIPFKNVVMD YPLARAMEDG FVKEPAVVTQ RNFKASNHAS EEIEKIKLED GVRLHEATKV ELLTYARENG VKVVKPFILV IARDTTHAGQ LKALIESSAF FEARYQGKVI QVDSSRTGAE EEKMIEALLN VENPEEPTEI VIHVNMLKEG WDVTNLYTIV PLRAANARTL IEQSIGRGLR LPYGKRTGVA AVDRLNIVAH DKFQEIIDEA NRGDSPIRLK QVILEAPSSE DKKVSVQVLP NLLTQLGLHD EHAPHVPPAL ATVDAGTEVG GEQVNAQTQP VFSTEAEFKA AHVVREVLAT YEVKRDLAPT SAALLKPEIQ QEILAEVEKR LNPQQGQLLQ GADDQVPALD LSAVVAKTTE ILVQQTIDIP RIAVVPTGEV TTGFHPFRLG ALPNFQPGQR EIVGQTLRTS EQFTLNRESG LRENRFEDYI VKKLIDFDDI DYFTQADLLY DLAGQAAEHY QQQNYADSEL HEIFDTYGKE LARLIRAQMM EHFWEKAAGY EVQVSRGFTE LKPCNYTATE GQTAHNLRET VTETSRIKQM LFGGFTRCLY PLQKFDSDTE RRFALLLERD ALKWFKPAKG QFQIYYKLGS EQPEYIPDFV AELDGMILMV ETKARVDLAS AEVQAKSAAA SRWCRHASEH AAEVGGKSWR YLVVPHDEVT EDKRLSDYLR FEVKDTAEGG SPA
|
| |