Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_10030 |
Symbol | |
ID | 7759948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 954447 |
End bp | 955397 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643803908 |
Product | histone deacetylase superfamily |
Protein accession | YP_002798210 |
Protein GI | 226943137 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0971633 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGCC CCTTGCCGCG GAGCGCCGCC ATGTCCTTAC CGCTGGTCTA CCACGACGAA TACAGCCCGT CCTTCCCGGA TGGCCACCGC TTTCCCATGG AAAAGTTCCG CCTGCTGCGC GACCACCTGG TCGACAGCGG CCTGACCAGC GACGCCGAGC TGCGCCGCCC CGAGCCCTGT CCGACGGACA TCCTCGCCCT GGCCCACGAC CCGGCCTACA TCGAGCGCTA CTGCAGCGGC GCACTGAGCC GCGAGGAACT GCGCCGCCTG GGCCTGCCCT GGACCCCGGC GCTGGCCCGA CGCACCGTGC TGGCGGTGGG CGGCTCGCTG CTGGCCGCGG AACTGGCCCT GGAACACGGA CTCGCCTGCC ACCTGGCCGG CGGTACCCAC CACGCCCACC ACGACCACCC TTCCGGCTTC TGCATCTTCA ACGACCTGGC GGTGGTTTCC CGCTACCTGC TGGCGAGCGG GAGGGTCGGC CGCGTGCTGA TCTTCGACTG CGACGTGCAC CAGGGCGACG GCACCGCGCG CATCCTCGAA GACACCCCGG AGGCGATCAC CGTGTCGCTG CACTGCGAGC AGAACTTCCC GGCGCGCAAG GCCAGAAGCG ACTGGGACAT CGGCCTGCCG CGCGGCATGG GCGATGCCGA CTACCTGAAG GTGGTCGACG ACGCGCTGAA CTACCTGCTG CCGCTGTACC AGCCGGACCT GGTGCTCTAC GACGCCGGCG TGGACGTGCA CCAGGACGAT GCCCTCGGTT ACCTGGCGCT CAGCGATGCC GGCCTCGCCG CCCGCGACGG CGCCGTGCTG CGCCACTGCC TCGCGCGCGG GATCGCGGTG CTCGGCGTGA TCGGCGGCGG CTACGACCGG GACCGCGCCG CCCTGGCGCG GCGCCACGGC ATTCTCCACC ATGGCGCCGC GCGCCTCTGG CGGGAACTGG GCCTCGGCTG A
|
Protein sequence | MNRPLPRSAA MSLPLVYHDE YSPSFPDGHR FPMEKFRLLR DHLVDSGLTS DAELRRPEPC PTDILALAHD PAYIERYCSG ALSREELRRL GLPWTPALAR RTVLAVGGSL LAAELALEHG LACHLAGGTH HAHHDHPSGF CIFNDLAVVS RYLLASGRVG RVLIFDCDVH QGDGTARILE DTPEAITVSL HCEQNFPARK ARSDWDIGLP RGMGDADYLK VVDDALNYLL PLYQPDLVLY DAGVDVHQDD ALGYLALSDA GLAARDGAVL RHCLARGIAV LGVIGGGYDR DRAALARRHG ILHHGAARLW RELGLG
|
| |