Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_47740 |
Symbol | |
ID | 7763636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4843480 |
End bp | 4844181 |
Gene Length | 702 bp |
Protein Length | 233 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643807618 |
Product | haloacid dehalogenase-like hydrolase protein |
Protein accession | YP_002801853 |
Protein GI | 226946780 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01428] 2-haloalkanoic acid dehalogenase, type II [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.127906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCC GCCTGATCAC CTTCGACCTC GACGACACCC TGTGGAACCT CGCCCCGGTC ATCGAAAGCG CCGAAGCCGA ACTGCGCGAC TGGCTGGGCC GCCATGCGCC GCTGCTCGGC GCGGTACCGG TCGAACACCT GCAGCCGATC CGCGCCCGCC TGGTCGAAGG CGACCCGTCG CTCAAGCACC GCATCAGCGA ACTGCGCCGC CGCGTGCTTT CGCATGCCCT GGAAGAGGCC GGCTATCCCG CGCCCGAAGC CCGCGAACTG GCCGGACTGG CCTTCGAGAC CTTCCTCGCC GCGCGCCACC GCATCGAGTT CTTCCCCGAG ACGCGGCCGA CCCTGGCAAT CCTCGCCGAC CGCTACCGCC TCGGCGTGCT GACCAACGGC AATGCCGATG TGCGCCGGCT GGGCCTGGCC GACTATTTCC ACTTCATCCT CTGCGCCGAG GATCTGGGCA TCGGCAAGCC CGACCCGCAG CCCTTCCATG AGGCCCTGCG GCGCGGCGGA GCGAGTGCCG AAGAAACGGT GCACATCGGC GATCATCCCG ACGACGACAT CCAGGGCGCC CAGCGCGCCG GTCTGCGCGC GATCTGGTAT AACCCCGGCG GCAAGCCCTG GAGCCACGCG GGAACACCGG ATGCGCAGAT CGCCAGCCTG GCGGAGCTGC CGGCGCTGCT CGGCCGCTGG CAGGCCGGCT GA
|
Protein sequence | MSIRLITFDL DDTLWNLAPV IESAEAELRD WLGRHAPLLG AVPVEHLQPI RARLVEGDPS LKHRISELRR RVLSHALEEA GYPAPEAREL AGLAFETFLA ARHRIEFFPE TRPTLAILAD RYRLGVLTNG NADVRRLGLA DYFHFILCAE DLGIGKPDPQ PFHEALRRGG ASAEETVHIG DHPDDDIQGA QRAGLRAIWY NPGGKPWSHA GTPDAQIASL AELPALLGRW QAG
|
| |