Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_32900 |
Symbol | |
ID | 7762188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3367795 |
End bp | 3369843 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643806158 |
Product | Oligopeptidase B protein |
Protein accession | YP_002800422 |
Protein GI | 226945349 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1770] Protease II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCAAC CCCCCATCGC CCGAATCGAG AGCCATTGTG CCGATCCCTA CCGCTGGCTG GAGCAGCGCG ACGACCCACA GGTGCTCGCC TATCTGGAAG CGGAGAACGC CTACCTGGAG GCCGAGCTCG CGGATGTCGG CGCGCTGCGC GAGAGCCTGT TCCAGGAGGT CAAGGGCCGC ATCCGCGAAA CCGACCTGTC CCTGCCGGTA CCCTGGGGAC CCTGGCTCTA CTACCAGCGC ACCACAGCCG GCGACGAATA CCCGCGCCAC TACCGCTGCC CGCGCCCGGC GGACGGCTCG CTACGGACCG ACACCAGCCG CGAGGAACTG CTGCTCGACC CAAATGCCCT GGCCGACGGC GGCTACCTGT CGCTCGGCGC CTTCGAGATC AGCCCCGACC ACCGTTGCCT GGCCTACAGC CTGGATAGCA GCGGCGACGA GATCTACCGG CTGTTCGTCA GGGAGCTGGA CAGCGGCGTG CTGCACGAAC TGCCCTTCGA GGACTGCGAC GGCAACATGA CCTGGGCCAA CGACAGCCGG ACGCTGTTCT TCGCCGAACT GGACGACACC CAGCGCCCGC ACAGGCTGTA CCGCTACCGG CTCGGCGACG ACGAGACCGA GCTGGTGTTC AACGAGCCGG ACGGACGCTT CTTCATTCAT TGCTACCGCG CCAGCTCCGA GCGCCAGTTG CTCCTCCTGA GCAGCAGCAA GACCACCTGC GAGGCCTGGA CGCTGGATGC CGAACGCCCG CAGGAAGCCT TCGTCTGCCT GGCGCCGCGC CAGGAGGGCC ACGAGTACTA CCCCGACCAC GGGCGCATCG ACGGCGACTG GGGCTGGCTG ATCCGCAGCA ACCAGGACGG CATCGAATTC GCCGTCTACC AGGCCCCCGA GGGTGCGCCG GGGCGCGAGC ACTGGCGGCC GCGGATCGCT CACGACGCGG CGCGGATGAT CGAGGACCTC AGCCTGAACG CCGCGGGCTT CGTCCTCAGC CTGCGCGAGA AGGGTCTGCC GATCGTCGAG GTGCACCCGG CCGAGGCCAC GCCCTACCGC GTTGAACTGC CGGACGCCGC CTACAGCCTG GACGTGCAGG ACATCCTGGA ATTCGACAGC CCGGTGATCC GCCTGCGCTA CGAGGCGCTC AACCGCCCGG CGCAGATCCG CCAGTTGGAC CTGGCCAGCG GCGCCCAGCG AGTGCTCAAG GAAACCCCGG TGGAAGGACC GTTCGACGCC GACGACTATC TCAGCCTGCG GCTCTGGGCC GAGGCGGCCG ACGGCGCGCG CATCCCGGTC AGCCTGGTCG CCCGCCGCGA CATCCTCCGA GGCGAGGGGC AAAAGCGCCC CGCTCCGCTC TATCTCTATG GCTACGGCGC CTATGGCGAG AGCCTCGACC CCTGGTTCTC CCACGCCCGG CTGAGCCTCC TGGAGCGCGG CTTCGTCTTC GCCATCGCCC ATGTGCGCGG CGGCGGCGAA CTGGGCGAGG CCTGGTACCG CGCCGGCAAG CTGGAACACA AGGAAAACAC CTTCGGCGAC TTCATCGCCG TGGCCGAGCA CCTGATCGCC GAGGGCGTCA CCTGCGCCGA CCGGCTGGCG ATCAGCGGCG GCAGCGCCGG CGGCCTGCTG ATCGGCGCCG TGCTCAACCG GCGTCCGGAG CTGTTCGCCG CGGCGATCGC CGAGGTGCCC TTCGTCGACG TGCTGAACAC CATGCACAAC CCCGAGCTGC CGCTGACCGT CACCGAGTAC GACGAGTGGG GCGATCCGCG CGATCCCGAG GTCCATGCCC GGATCGCCGC CTACGCCCCC TACGAGAACG TGCGCGCCCA GGCCTACCCG GCGATCCTCG CGGTGGCCAG CTACCACGAC AGCCGGGTGC AGTACTGGGA GGCGGCGAAG TGGGTGGCCA GGCTGCGCGC CAGCAAGACC GACGCCAACC TGCTGCTGCT GAAGACCGAG TTCGGCGCCG GCCACGGCGG CATGAGCGGA CGCTATCAGG CACTCAGGGA CGTGGCGCTG GAATACGCCT TCCTGCTCAG GGTGCTCGGC CGGGTCTGA
|
Protein sequence | MPQPPIARIE SHCADPYRWL EQRDDPQVLA YLEAENAYLE AELADVGALR ESLFQEVKGR IRETDLSLPV PWGPWLYYQR TTAGDEYPRH YRCPRPADGS LRTDTSREEL LLDPNALADG GYLSLGAFEI SPDHRCLAYS LDSSGDEIYR LFVRELDSGV LHELPFEDCD GNMTWANDSR TLFFAELDDT QRPHRLYRYR LGDDETELVF NEPDGRFFIH CYRASSERQL LLLSSSKTTC EAWTLDAERP QEAFVCLAPR QEGHEYYPDH GRIDGDWGWL IRSNQDGIEF AVYQAPEGAP GREHWRPRIA HDAARMIEDL SLNAAGFVLS LREKGLPIVE VHPAEATPYR VELPDAAYSL DVQDILEFDS PVIRLRYEAL NRPAQIRQLD LASGAQRVLK ETPVEGPFDA DDYLSLRLWA EAADGARIPV SLVARRDILR GEGQKRPAPL YLYGYGAYGE SLDPWFSHAR LSLLERGFVF AIAHVRGGGE LGEAWYRAGK LEHKENTFGD FIAVAEHLIA EGVTCADRLA ISGGSAGGLL IGAVLNRRPE LFAAAIAEVP FVDVLNTMHN PELPLTVTEY DEWGDPRDPE VHARIAAYAP YENVRAQAYP AILAVASYHD SRVQYWEAAK WVARLRASKT DANLLLLKTE FGAGHGGMSG RYQALRDVAL EYAFLLRVLG RV
|
| |