Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_26160 |
Symbol | hutI |
ID | 7761524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2672080 |
End bp | 2673291 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643805494 |
Product | imidazolonepropionase |
Protein accession | YP_002799767 |
Protein GI | 226944694 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.242908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACA CCCGTTGCTG GTTCCACTGC CATGCCGCCA CCATGAAGGG CGGTCACTAC TCGGTGATCG AGGACGCCGC GATCCTCGCC CGCGGCGAGC GGATCGCCTG GATCGGCCCG CGCGACGCCT TGCCGCCCGG CGATTGCCAG GCCGTCGACC TCGGCGGCGC CTGGGTCACG CCGGGGCTGA TCGATTGCCA CACCCATCTG GTGTTCGCCG GCGAGCGCAG CGCCGAGTTC GAGCTGCGTC TGGAGGGTGC CAGCTATGCC GAGATCGCCG CCCATGGCGG CGGCATCCTC AGCACGCTGC GCGCCACCCG GGCGGCCAGC GAGGAGCAGT TGCTCGACGG CGCGTTGCGC CGCGCCCGCC AGTTGCTGCG CGACGGGGTG ACCTGCCTGG AGGTGAAGTC CGGCTATGGC CTCGACCTGC CCGGCGAGCG CAAGCAGTTG CGTGTCGCCC GCCAGCTCGC CGAGCGCCTG CCGCTGACGG TGCGCACCAC CTGCCTGGCC GCCCATGCCG TGCCGCCGGA ATACGCCGGG CGCGCCGACG CCTATATCGA GCATGTCTGC GCCGAGCTGC TCCCGGCGCT GGCCGACGAG GGCCTGGTGG ATGCGGTGGA CGCCTACTGC GAGCATCTGG CCTTCTCGCC CGAGCAGGTC GAGCGGCTGT TCCGGGCCGC CGAGGCCCTG GGCTTGCCGG TCAAGCTGCA TGCCGAGCAG CTTTCCGCGC TGGGCGGGAC GCGGGTCGCG GCGCGCCACC GGGCGTTGTC CGCCGATCAC CTGGAATACG CCGGCGAGGC GGACGTCGCC GCCCTGGCGA AGGCGGGCAC GGTGGCCGTG CTGCTGCCGG CGGCCTTCTA CCTGCTCGAC GAGAGCCGCC GGCCGCCGGT CGAGGCGCTG CGCCGGCACG CCGTGCCGAT GGCCCTGGCC AGCGACCTCA ACCCCGGCAC CGCGCCGGTG CTGTCGCTGC GCCTGATGCT GGCCATGGCC TGCACCTCCT TCGGCCTGAC CGCCGAGGAG GCCCTGGCCG GGGTAACCCT GCACGCGGCC CGCGCCCTCG GCCTGGCCGA CGACCATGGC AGCCTGGAGC CCGGCAAGTA CGCCGATTTC GTCGCCTGGG ACATCCATCG TCCGGCCGAG CTGGCCTACT GGCTGGGCGG CGAGCTGTCC AGGCGGATCG TCTTTCACGG AAAGGAAGTC AGCCATGGAT GA
|
Protein sequence | MTDTRCWFHC HAATMKGGHY SVIEDAAILA RGERIAWIGP RDALPPGDCQ AVDLGGAWVT PGLIDCHTHL VFAGERSAEF ELRLEGASYA EIAAHGGGIL STLRATRAAS EEQLLDGALR RARQLLRDGV TCLEVKSGYG LDLPGERKQL RVARQLAERL PLTVRTTCLA AHAVPPEYAG RADAYIEHVC AELLPALADE GLVDAVDAYC EHLAFSPEQV ERLFRAAEAL GLPVKLHAEQ LSALGGTRVA ARHRALSADH LEYAGEADVA ALAKAGTVAV LLPAAFYLLD ESRRPPVEAL RRHAVPMALA SDLNPGTAPV LSLRLMLAMA CTSFGLTAEE ALAGVTLHAA RALGLADDHG SLEPGKYADF VAWDIHRPAE LAYWLGGELS RRIVFHGKEV SHG
|
| |