Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5956 |
Symbol | hutU |
ID | 7381041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 971515 |
End bp | 973188 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643649469 |
Product | urocanate hydratase |
Protein accession | YP_002547700 |
Protein GI | 222106909 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAACC CACGCCACAA TATCCGCGAT GTGCGGGCTG CCACGGGCAC GGAGCTATCG GCCAAGAGCT GGATGACCGA AGCCCCCTTG CGCATGTTGA TGAACAATCT CGACCCCGAC GTAGCCGAGC GTCCGCATGA GCTGGTGGTC TATGGCGGCA TTGGCCGCGC CGCCCGCACA TGGGAGGATT TTGACAGGAT TGTTGCCACG CTGAAGACGC TGACCGAAGA AGAGACGCTG ATTGTGCAAT CGGGCAAGCC AGTGGGTGTG TTCAAAACCC ATAAAGACGC GCCACGGGTG CTTATTGCCA ATTCCAATCT CGTGCCGCAT TGGGCCACAT GGGACCATTT CAACGAGCTG GATAAGAAGG GACTGGCCAT GTATGGCCAG ATGACCGCAG GCTCGTGGAT CTACATTGGC ACCCAAGGCA TTGTGCAGGG CACCTATGAA ACCTTCGTGG AAGCGGGTCG CCAGCACTAT AACGGCAACC TCAAAGGTAA GTGGATACTA ACCGGCGGTC TCGGCGGAAT GGGTGGCGCG CAGCCGCTGG CAGCCGTTAT GGCCGGTGCC TGCTGCCTTG CGGTGGAATG CGATGAAACC CGCGTGGATT TCCGCCTGCG CACGCGCTAT GTTGATGCCA AGGCCCACAC GCTGGATGAA GCCCTAGCGT TGATTGACCA ATGGACAAAG GCAGGCGAAG CCAAATCCGT GGGCCTGATT GGCAATGCTG CCGACATTTT CCCTGAACTG GTCAAGCGCG GTATCCGTCC CGATATTGTC ACCGACCAGA CCTCGGCGCA CGATCCGATC AATGGTTATC TGCCCTCCGG CTGGACCGTT GCCGAATGGC GCGCCAAGCA GGAAAGCGAT CCGAAAGCGG TGGAGCGTGC CGCCCGCGCC TCGATGAAAG TGCATGTCGC CGCCATGGTG GATTTCTGGA ACATGGGCGT TCCCACGCTG GATTACGGCA ACAATATCCG CCAGGTCGCC AAGGAAGAAG GGCTGGAAAA TGCCTTTGCC TTCCCCGGCT TTGTGCCTGC CTATATCCGC CCGCTGTTTT GCCGGGGCAT TGGTCCGTTC CGTTGGGCTG CTCTTTCGGG TGATCCAGAG GATATTTACA AGACCGATGC CAAGGTGAAG GAATTGTTGC CAGACAACAA GCACCTGCAC AATTGGCTGG ATATGGCCCG CGAGCGCATT GCCTTCCAAG GCCTGCCCGC CCGCATCTGC TGGGTGGGTC TGGGCGATCG CCACAAACTG GGTCTGGCCT TCAACGAAAT GGTGCGCAGC GGCGAACTGA AAGCCCCCGT TGTCATTGGT CGTGACCATC TTGACAGCGG CTCCGTTGCC TCGCCCAACC GCGAAACCGA AGCGATGAAG GATGGCTCGG ATGCCGTATC CGACTGGCCG TTGCTGAATG CGCTGCTCAA TTGCGCATCC GGTGCCACAT GGGTATCGCT GCACCATGGC GGCGGTGTGG GCATGGGCTT TAGCCAGCAT TCGGGCATGG TGATCTGCGC CGATGGCACC GATGATGCGG CGCGGCGCCT GGAGCGGGTG TTGTGGAACG ACCCGGCCAC CGGCGTCATG CGCCATGCCG ATGCGGGCTA TGACATTGCA CTGGATTGCG CCAAGGAAAA AGGCCTGCGC CTGCCCGGCA TTCTGGGCAA TTGA
|
Protein sequence | MTNPRHNIRD VRAATGTELS AKSWMTEAPL RMLMNNLDPD VAERPHELVV YGGIGRAART WEDFDRIVAT LKTLTEEETL IVQSGKPVGV FKTHKDAPRV LIANSNLVPH WATWDHFNEL DKKGLAMYGQ MTAGSWIYIG TQGIVQGTYE TFVEAGRQHY NGNLKGKWIL TGGLGGMGGA QPLAAVMAGA CCLAVECDET RVDFRLRTRY VDAKAHTLDE ALALIDQWTK AGEAKSVGLI GNAADIFPEL VKRGIRPDIV TDQTSAHDPI NGYLPSGWTV AEWRAKQESD PKAVERAARA SMKVHVAAMV DFWNMGVPTL DYGNNIRQVA KEEGLENAFA FPGFVPAYIR PLFCRGIGPF RWAALSGDPE DIYKTDAKVK ELLPDNKHLH NWLDMARERI AFQGLPARIC WVGLGDRHKL GLAFNEMVRS GELKAPVVIG RDHLDSGSVA SPNRETEAMK DGSDAVSDWP LLNALLNCAS GATWVSLHHG GGVGMGFSQH SGMVICADGT DDAARRLERV LWNDPATGVM RHADAGYDIA LDCAKEKGLR LPGILGN
|
| |