Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_18310 |
Symbol | |
ID | 7760765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1812985 |
End bp | 1814985 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643804729 |
Product | Peptidase, U32 family |
Protein accession | YP_002799018 |
Protein GI | 226943945 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTAC CCAAGCACCA TCTGGAACTG CTCAGCCCGG CACGGGATGT GTCCATCGCC AAGGAGGCGA TCCTGCATGG CGCCGATGCC GTGTACATCG GCGGCCCCAG CTTCGGTGCG CGGCACAGCG CCGGCAACGA TGTGACGGAT ATCGCCGGCC TGGTGGAGTT CGCCCACCAG TTCCACGCCC GGGTGTTCGT CACCCTCAAC ACCATCCTGC ACGACGACGA GCTGGAACCA GCCCGCCAGT TGATCTGGCA GCTACACGAC GCGGGCATCG ACGCGCTGAT CGTGCAGGAC ATGGGCATCC TGGAAATGGA CATTCCGCCC ATCGAGCTGC ACGCCAGCAC CCAGTGCGAC ATCCGCACCC TGGAAAAGGC GAAATTCCTG GCCGACGCCG GTTTCTCCCA ACTGGTGCTA GCGCGCGAAC TCGGCCTTGC GGAGGTCCGC GCCATCGGCG CGCGAGTGGA CTCGACCATC GAGTTCTTCA TCCACGGCGC GCTGTGCGTG GCCTTCTCCG GGCAGTGCTA CATCTCCCAC GCGCAGACCG GGCGCAGCGC CAACCGCGGC GACTGCTCGC AGGCCTGCCG GCTGCCCTAC ACCCTGAAGG ACGACCAGGG CCGGGTGGTG GCCTACGACA AGCACCTGCT GTCGATGAAG GACAACAACC AGAGCGCCAA CCTGCGCGCG CTGATTGATG CCGGGGTGCG CTCCTTCAAG ATCGAGGGAC GCTACAAGGA CATGGGCTAC GTGAAGAACA TCACCGCCTA CTACCGGCAA CGGCTCGACG AGGTCCTGAG CGAGCGCCCG GAGCTGGCAC GAGCCTCCAG CGGCCGGACC GAGCACTTCT TCGTGCCGGA CCCGGACAAG ACCTTCCACC GCGGCAGCAC GGACTACTTC GTCAACGAGC GCAGGATCGA CATCGGCGCC TTCGATTCGC CGACCTTCAC CGGCCTGCCG GTGGGCGAGA TACTCAAGGT CCGCAAGAAG GATTTCATCG CCGGCACCAC GGAGCCGCTC GGCAACGGCG ACGGCCTGAA CGTGCTGGTC AAGCGCGAAG TGGTGGGCTT CCGTGCCAGC GTCGTCGAAC AGCTCGAACG CTCCGAGCGG GGCGGCAAGC CCCACTGGCA ATACCGCATC GAGCCCAACG AGATGCCGGC CGCGCTGAAG CAGTTGCGTC CGCATCATGC GCTGAACCGC AACCTGGACC ACGACTGGCA GCAGGCGCTG CAGAAGACTT CCGCCGAGCG CCGGGTCGGG GTGCGCTGGC AGGTCCGGCT GACGGAGGAG GCACTCGACC TGAACGTGAC CAGCGAGGAA GGCGTCGGCG CTTCGGCGAG TCTCGCCGGG CCCTTCGGAC TGGCGAAGAA GCCGGAACAG GCGCTCGAGC AGTTGCGCGA CCTGCTCGGT CAATTGGGCA CCACCCTCTA CCATGCCGAA GAAGTGGAGA TCGATGCACC GCAGGCATTC TTCGTCCCGA ACTCCCTGCT CAAGGCCCTG CGCCGCGAGG CCATCGAGGC CCTGACTGCC GCCCGCCTGG CCACCCACCG GCGCGGCAGC CGCAAGCCGG TCAGCGAGCC GCCGCCGGTG TACCCGGAAT CGCACCTGAC CTTCCTGGCC AACGTGTACA ACGAGAAGGC CCGGGCGTTC TATCGCCGCT ACGGCGTGCA ACTGATCGAC GCGGCCTACG AGGCTCACGC GGAGGCCGGC GAAGTGCCGG TGATGATCAC CAAGCACTGT CTGCGTTTCT CCTTCAACCT GTGCCCGAAA CAGGCCAAGG GCGTCACGGG CGTGCGCACC AGGGTCGCGC CGATGCAACT GGTGCATCAG GACGAGGTGC TGACCCTGAC GTTCGACTGC AAGGCCTGCG AGATGCACGT CATCGGCAGG ATGAAAGACC ACATCCTCGC CCAGCCCCAA CCCGGCAGCT CGACCGGTAT CGTCGGCCAG ATCAGTCCCG AGGAGTTGCT CAAGACCATC AGGAACAGAG CGCAACGCTG A
|
Protein sequence | MSLPKHHLEL LSPARDVSIA KEAILHGADA VYIGGPSFGA RHSAGNDVTD IAGLVEFAHQ FHARVFVTLN TILHDDELEP ARQLIWQLHD AGIDALIVQD MGILEMDIPP IELHASTQCD IRTLEKAKFL ADAGFSQLVL ARELGLAEVR AIGARVDSTI EFFIHGALCV AFSGQCYISH AQTGRSANRG DCSQACRLPY TLKDDQGRVV AYDKHLLSMK DNNQSANLRA LIDAGVRSFK IEGRYKDMGY VKNITAYYRQ RLDEVLSERP ELARASSGRT EHFFVPDPDK TFHRGSTDYF VNERRIDIGA FDSPTFTGLP VGEILKVRKK DFIAGTTEPL GNGDGLNVLV KREVVGFRAS VVEQLERSER GGKPHWQYRI EPNEMPAALK QLRPHHALNR NLDHDWQQAL QKTSAERRVG VRWQVRLTEE ALDLNVTSEE GVGASASLAG PFGLAKKPEQ ALEQLRDLLG QLGTTLYHAE EVEIDAPQAF FVPNSLLKAL RREAIEALTA ARLATHRRGS RKPVSEPPPV YPESHLTFLA NVYNEKARAF YRRYGVQLID AAYEAHAEAG EVPVMITKHC LRFSFNLCPK QAKGVTGVRT RVAPMQLVHQ DEVLTLTFDC KACEMHVIGR MKDHILAQPQ PGSSTGIVGQ ISPEELLKTI RNRAQR
|
| |