Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_26300 |
Symbol | hutF |
ID | 7761538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2686061 |
End bp | 2687449 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643805508 |
Product | N-formimino-L-glutamate deiminase |
Protein accession | YP_002799781 |
Protein GI | 226944708 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02022] formiminoglutamate deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.670476 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCG TCCTGTTCGC CGAACGCGCC CTGCTGGCGG GGGGCTGGGC GCACAATGTG CGCCTGCAGA TCGATACCGC CGGACGGATC GAACGGATCG AGCCCGATGC CGGCTCCGCG GGAGCCGAGC GCCTGCACGG ACCGCTCCTG CCCGGCATGC CGAACCTGCA TTCGCACGCC TTCCAGCGCG CCATGGCCGG GCTCGCCGAG GTGGCCGGCG GTTCCGCGGA CAGCTTCTGG TCCTGGCGCG AACAGATGTA CCGGCTGGTC GCCCGGCTCC TTCCGGAACA ACTGGAAACC ATCGCCCGCC AGCTCTACAT CGAGCTGCTC AAGGCCGGCT ACAGCAGCGT CGCCGAGTTC CACTACCTGC ACCATGACCT CCATGGCCAC CCCTACGCCG AGCGCGCCGA GCTGTCCCTG CGACTCGCTC GCGCCGCCGG CGAGGCCGGC ATCGGCCTGA CCCTGGTGCC GGTGCTCTAC AGCCATTCCA GCTTCGGCGG CCAGCCGCCG GGGGCCGCGC AGCGGCGTTT CGTCCTCGAT GTGGACGGCT ATCTCGGCCT CCTGGCGCGC CTGCGTCCGG TGCTCAGGCG GAACGGACAG CACCTGGCGC AGGGTTTCCA CTCCCTGCGC GCCGTCACCC CCGAACAGAT CGCCCGGGTG CTGGAAGATT CGCCCATCGA CGGCCCCATC CACCTGCACA TCGCCGAACA GCAGAAGGAA GTCGACGACT GCCTGACCTG GAGCGGCCGC CGCCCCCTGC AATGGCTGTT CGAGCATTGC CCGGTGGACC GGCGCTGGTG CCTGGTGCAC GCCACCCAGG CACAGCCGGA AGAACTCGCC CGCCTGGCCG CGAGCGGCGC GGTCGCCGGC CTGTGCCCCA CGACCGAGGC CAATCTCGGC GACGGCCTGT TTCCGGCCGC CGACTACCTG GCCCATGGCG GGCGCTTCGG CATCGGCTCG GACAGCCAGG TATCGGTCAG CCCGCTGGAA GAGCTGCGCT GGCTGGAATA CGGCCAGCGC CTGCGCGACC GTCGGCGCAA TCGCCTTGCC CGGCTGGAGC GTCCGGCGGT CGGCGCCGTG CTCTACCAGG CGGCCCTGAG CGGTGGCGCC CAGGCCCTCG GACAGCCGAT AGGCGCACTG GAAGTCGGCC GACGCGCCGA TCTGCTGGTG CTGGACGGCG ACGATCCCTA TCTGGCGAGC GCCGAGGGCG ATCAGTTGCT CAACCGCTGG CTGTTCGCCG GCAACGACCG CCAGGTGCGC GACCTGATGG TCGCCGGCCG CTGGGTGGTG CGCGAAGGCC GCCACGCGGA CGAGGAGCGC AGCGCCCGTG CCTTCGCCAA GGTGCTGGCG ACACTCTCGG CGACGCCCTT CGCCTCGGCG GCCAGGTGA
|
Protein sequence | MTTVLFAERA LLAGGWAHNV RLQIDTAGRI ERIEPDAGSA GAERLHGPLL PGMPNLHSHA FQRAMAGLAE VAGGSADSFW SWREQMYRLV ARLLPEQLET IARQLYIELL KAGYSSVAEF HYLHHDLHGH PYAERAELSL RLARAAGEAG IGLTLVPVLY SHSSFGGQPP GAAQRRFVLD VDGYLGLLAR LRPVLRRNGQ HLAQGFHSLR AVTPEQIARV LEDSPIDGPI HLHIAEQQKE VDDCLTWSGR RPLQWLFEHC PVDRRWCLVH ATQAQPEELA RLAASGAVAG LCPTTEANLG DGLFPAADYL AHGGRFGIGS DSQVSVSPLE ELRWLEYGQR LRDRRRNRLA RLERPAVGAV LYQAALSGGA QALGQPIGAL EVGRRADLLV LDGDDPYLAS AEGDQLLNRW LFAGNDRQVR DLMVAGRWVV REGRHADEER SARAFAKVLA TLSATPFASA AR
|
| |