Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_26180 |
Symbol | hutU |
ID | 7761526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2673383 |
End bp | 2675062 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643805496 |
Product | urocanate hydratase |
Protein accession | YP_002799769 |
Protein GI | 226944696 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.60353 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGAAG CCTTCCACAA GTACCGCGAC ATCGAGATCC GCGCCCCGCG CGGCACCGCC CTGAACGCCA GGAGCTGGCT CTGCGAGGCG CCGCTGCGCC TGTTGATGAA CAACCTCGAT CCCGAGGTGG CGGAGAACCC GAAGGAACTG GTGGTCTACG GCGGCATCGG CCGGGCGGCG CGCAACTGGG AGTGCTTCGA CAGGATCGTC GAGTGCCTGA AGAACCTGGA GGAGGACGAA ACCCTGCTGA TCCAGTCGGG CAAGCCGGTG GGGGTGTTCC GCACCCAGCG CGACGCGCCG CGGGTGCTGA TCGCCAACTC CAACCTGGTG CCGCACTGGG CCACCTGGGA ACACTTCCAC GAGCTGGATG CCAGGGGCCT GGCCATGTTC GGGCAGATGA CCGCCGGCAG TTGGATCTAC ATCGGCAGCC AGGGCATCGT CCAGGGCACC TTCGAGACCT TCGTCGAGGC CGGCCGCCAG CATTACGGCG GCGATTTGCG CGGGCGCTGG CTGCTCAGCG CGGGCCTGGG CGGCATGGGC GGCGCGCAGC CGCTGGCGGC GACCCTGGCC GGGGCCAGCG CGCTGCTGGT CGAGTGCCAG CAGAGCCGCA TCGACTTCCG CCTGAAGACC GGCTATCTGG ACGAGCAGGC GCGCGACCTG GACGACGCCC TGGCGCGCAT CGCCCGCTAC CGCGGCGAAG GTCGGGCCGT GTCGGTCGGC CTCTGCGCGA ATGCCGCGGA CATCCTGCCG GAGCTGGTGC GTCGTGGCGT GCGCCCGGAC CTGGTCACCG ACCAGACCAG CGCCCACGAT CCGCTCAACG GCTACCTGCC CAGGGGCTGG AGCTGGGCCG AGTACCGCGA GCGCGCCGCC CGCGAGCCGG CCGCGACCGT CGCGGCGGCC AAGCGCTCGA TGGCCGGGCA TGTGCGCGCC ATGCTGGCCT TCCACGAACG GGGCGTGCCG GTGTTCGACT ACGGCAACAA CATCCGCCAG ATGGCCAGGG ACGAGGGCGT GGAGAACGCC TTCGACTTCC CCGGCTTCGT CCCGGCCTAT ATCCGCCCGC TGTTCTGCCG GGGGATCGGT CCGTTCCGCT GGGTGGCGCT GTCGGGCGAG GCCGAGGACA TCTACCGTAC CGATGCCAGG GTCAAGGAAC TGATCCCCGA CGATCCGCAC CTGCACCGCT GGCTGGGCAT GGCCCGCGAG CGTATCCGCT TCCAGGGCCT GCCGGCGCGC ATCTGTTGGG TCGGCCTCGG CCAGCGCGCC CGGCTCGGGC TGGCCTTCAA CGAGATGGTC CGGCGCGGCG AACTCAAGGC CCCGGTGGTC ATCGGTCGCG ACCATCTCGA CTCCGGCTCG GTGGCCAGCC CGAACCGCGA GACCGAGGCC ATGCGCGACG GCTCGGACGC GGTGTCCGAC TGGCCGCTGC TCAACGCCCT GCTGAACACG GCGAGCGGCG CCACCTGGGT GTCCCTGCAC CACGGCGGCG GCGTCGGCAT GGGCTACTCG CAGCACGCCG GGGTGGCCAT CGTCTGCGAT GGCACGGACG AGGCCGCGGC ACGCATCGCC CGCGTCCTGC ACAACGATCC GGCCAGCGGG GTGATGCGCC ACGCCGATGC CGGTTACCCG GAAGCCATCG CCTGTGCCCG CGAGCGAGGC TTGAAGCTGC CGATGCTGGG CGACGCCTGA
|
Protein sequence | MREAFHKYRD IEIRAPRGTA LNARSWLCEA PLRLLMNNLD PEVAENPKEL VVYGGIGRAA RNWECFDRIV ECLKNLEEDE TLLIQSGKPV GVFRTQRDAP RVLIANSNLV PHWATWEHFH ELDARGLAMF GQMTAGSWIY IGSQGIVQGT FETFVEAGRQ HYGGDLRGRW LLSAGLGGMG GAQPLAATLA GASALLVECQ QSRIDFRLKT GYLDEQARDL DDALARIARY RGEGRAVSVG LCANAADILP ELVRRGVRPD LVTDQTSAHD PLNGYLPRGW SWAEYRERAA REPAATVAAA KRSMAGHVRA MLAFHERGVP VFDYGNNIRQ MARDEGVENA FDFPGFVPAY IRPLFCRGIG PFRWVALSGE AEDIYRTDAR VKELIPDDPH LHRWLGMARE RIRFQGLPAR ICWVGLGQRA RLGLAFNEMV RRGELKAPVV IGRDHLDSGS VASPNRETEA MRDGSDAVSD WPLLNALLNT ASGATWVSLH HGGGVGMGYS QHAGVAIVCD GTDEAAARIA RVLHNDPASG VMRHADAGYP EAIACARERG LKLPMLGDA
|
| |