Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_00540 |
Symbol | |
ID | 7759021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 55825 |
End bp | 57186 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643802980 |
Product | Peptidase, U32 family |
Protein accession | YP_002797296 |
Protein GI | 226942223 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.210117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCTT CCGCCTTCCG TCCCGAACTG CTGTCTCCCG CCGGCACCCT CAAGTCCATG CGCTACGCCT TCGCCTACGG CGCCGATGCG GTGTACGCCG GGCAGCCGCG CTACAGCCTG CGGGTGCGCA ACAACGAATT CGATCATGCC CACCTGGCGC TCGGCATCGC GGAAGCCCAC GCCCAGGGCA GACGCTTCTA CGTGGTGGTC AACATCGCCC CGCACAACAC CAAGCTGAAG ACCTTCCTCA AGGACCTGCA GCCGGTGATC GACATGCAAC CGGATGCGCT GATCATGTCC GACCCGGGAC TGATCATGCT GGTGCGCGAG CACTTTCCGG AAATGGCCAT CCACCTCTCG GTGCAGGCCA ACGCGGTGAA CTGGGCCAGC GTGGAGTTCT GGCGCCGCCA GGGGCTGACC CGGACGATCC TCTCCCGCGA GCTGTCGTTG GAGGAGATCG GCGAGATGCG CGAGCGGGTG CCGGGCATGG AGCTGGAGGT GTTCGTCCAC GGCGCGCTGT GCATGGCCTA TTCCGGGCGC TGCCTGCTGT CCGGCTACAT CAACCACCGC GATCCCAATC AGGGCACCTG CACCAACGCC TGCCGCTGGG AGTACCGGGC GCACGAGGGC AGGGAAGACG AACTGGGCAA CATCGTCCAC GTCCAGGAGC CGGTCCGGGC GCAGCCGGCC GAGCCGACCC TGGGCAGCGG CGTGCCCACC GAACGGCTGA TGCTGCTCGA GGAGAGCAAG CGGCCGGGCG AGTACATGGA GGCCTTCGAG GACGAGCACG GCACCTACAT CATGAACTCC AAGGACCTGC GCGCCGTGCA GCACGTCGAG CGGCTGGTGA AGATGGGCGT GCATTCGCTG AAGATCGAGG GCCGCACCAA GAGCCACTAC TACGTGGCGC GCACCGCCCA GGTCTACCGC AAGGCGATCG ACGACGCGGT GGCCGGCCGG CCGTTCGACA AGTCGTTGAT GGACACCCTG GAATCGCTGG CCCATCGCGG CTACACCGAG GGCTTCCTGC GCCGCCACGT ACACGACGAA TACCAGAACT ATGCCCATGG CTATTCGCTG TCCGAACGCC AGCAGTTCGT CGGCGAGCTG ACCGGCGAGC GCCGCAACGG TCTGGCCGAG GTGCAGGTGA AGAACCGTTT CGCCCTCGGC GACCGCCTGG AGCTGATGAC CCCCCGGGGC AACCTGAACT TCCGCCTGGA GGCGCTGGAG AACAAGCGCG GCGAGCGCGC CGAGGTGGCC CCGGGCGACG GCCACACCCT CTACCTGCCG GTACCGGAAG GCGTGGACCT CGGCCATGCC CTCCTGATGC GCGAGCTGGA CGGCGCCACC ACCCGCGGCT GA
|
Protein sequence | MTASAFRPEL LSPAGTLKSM RYAFAYGADA VYAGQPRYSL RVRNNEFDHA HLALGIAEAH AQGRRFYVVV NIAPHNTKLK TFLKDLQPVI DMQPDALIMS DPGLIMLVRE HFPEMAIHLS VQANAVNWAS VEFWRRQGLT RTILSRELSL EEIGEMRERV PGMELEVFVH GALCMAYSGR CLLSGYINHR DPNQGTCTNA CRWEYRAHEG REDELGNIVH VQEPVRAQPA EPTLGSGVPT ERLMLLEESK RPGEYMEAFE DEHGTYIMNS KDLRAVQHVE RLVKMGVHSL KIEGRTKSHY YVARTAQVYR KAIDDAVAGR PFDKSLMDTL ESLAHRGYTE GFLRRHVHDE YQNYAHGYSL SERQQFVGEL TGERRNGLAE VQVKNRFALG DRLELMTPRG NLNFRLEALE NKRGERAEVA PGDGHTLYLP VPEGVDLGHA LLMRELDGAT TRG
|
| |