Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_46970 |
Symbol | gcp |
ID | 7763560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4767396 |
End bp | 4768421 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643807541 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002801777 |
Protein GI | 226946704 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGTTC TGGGGCTGGA AACCTCCTGC GACGAAACCG GCGTCGCGCT TTACGACAGC CGGCGCGGCC TGCTGGCCGA CGCGCTGTTC AGCCAGATCG ATCTGCACCG CATCTATGGC GGGGTAGTGC CCGAACTGGC CTCGCGGGAT CACGTCAAGC GCATGCTGCC GCTCTTGCGC CAGGTGCTCG ACGAATCCGG CTGCCGCACC GGGGACATCG ACGGCATCGC CTATACCGCA GGACCCGGCC TGGTCGGCGC GCTGCTGGTC GGCGCCTCCT GCGCCCAGGC GCTGGCGCTG GCCTGGGGGG TTCCGGCGCT CGGCGTGCAT CACATGGAGG GTCATCTGCT GGCGCCGATG CTGGAGGAAC AGCCGCCGCA GTTTCCCTTC GTCGCCCTGC TGGTTTCCGG CGGTCATACC CAACTGGTGC GGGTCGACGG CATCGGTCGC TACCAGGTGC TCGGCGAGTC GCTGGACGAC GCCGCCGGCG AGGCTTTCGA TAAGACCGCC AAGCTGCTCG GCCTCGGTTA TCCCGGCGGT CCGGAGATCG CCCGCTTGGC ACAGGACGGC CGGCCCGGGC GTTTCGTCTT CCCGCGGCCG ATGACCGACC GGCCAGGCCT GGAGTTCAGC TTCAGCGGCC TCAAGACCTT CGCCCTGAAT ACCTGGCAGC ACTGCCGGGC GAGCGGCGAC GACGGCGAGC AATCGCGCCG CGATATCGCT CTGGCCTTCC AGCAGGCGGT GGTGGAGACG CTGATCATCA AGTGCCGGCG GGCGCTGAAG CAGACCGGCC TGAAGCGTCT GGTCATCGCC GGCGGGGTGA GTGCCAACCA GGCGTTGCGT TCGGCGCTGG AGCGGATGCT CGGCGAACTG GATGGCCAGG TGTTCTACGC CCGGCCGCGC TTCTGCACCG ACAACGGCGC GATGATCGCC TATGCTGGCT GCCAGCGTTT GCTGGCCGGC CAGCGGGATG GGCCGGCGAT TCAGGTCCAT GCGCGCTGGC CGATGGAGAC CCTGCCGGCG CTCTGA
|
Protein sequence | MLVLGLETSC DETGVALYDS RRGLLADALF SQIDLHRIYG GVVPELASRD HVKRMLPLLR QVLDESGCRT GDIDGIAYTA GPGLVGALLV GASCAQALAL AWGVPALGVH HMEGHLLAPM LEEQPPQFPF VALLVSGGHT QLVRVDGIGR YQVLGESLDD AAGEAFDKTA KLLGLGYPGG PEIARLAQDG RPGRFVFPRP MTDRPGLEFS FSGLKTFALN TWQHCRASGD DGEQSRRDIA LAFQQAVVET LIIKCRRALK QTGLKRLVIA GGVSANQALR SALERMLGEL DGQVFYARPR FCTDNGAMIA YAGCQRLLAG QRDGPAIQVH ARWPMETLPA L
|
| |