Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0823 |
Symbol | hutU |
ID | 5135821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 836308 |
End bp | 838005 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640532281 |
Product | urocanate hydratase |
Protein accession | YP_001216773 |
Protein GI | 147675330 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0000435883 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAGT CATCTGCACA AGGAACAAGA CTGGACACTC AGCGCACCAT TCGCGCCCCT CGTGGCACAC AGCTGCGCGC CAAATCTTGG CTCACTGAAG CCCCACTTCG CATGCTGATG AATAACCTCG ACCCCGATGT GGCCGAACAC CCTCATGCAC TGGTGGTGTA TGGTGGCATT GGCCGCGCGG CGCGTAACTG GGAGTGCTTT GACAAAATCG TCGAAGTGCT GGAGCGCCTC GAAGATGACC AGACTTTACT GGTGCAATCG GGTAAACCGG TGGGGGTTTT CCCCACCCAC AAAAATGCGC CACGCGTGTT GATTGCCAAC TCCAACTTAG TACCACACTG GGCAAACTGG GAGCATTTCA ACGAGCTCGA TAAACAAGGC TTGATGATGT ACGGCCAGAT GACGGCGGGA TCTTGGATTT ACATTGGCTC ACAAGGCATA GTGCAAGGCA CTTACGAAAC CTTTGTCGCG GTTGCGAAAA AGCATTTCAA TGGCGATGCC AAAGGCCGTT GGGTTTTGAC TGGCGGATTG GGCGGCATGG GCGGCGCGCA ACCTTTAGCG GCGACGATGG CAGGATTCTC GATGATTGCG GTGGAATGTG ATGAATCGCG CATCGACTAT CGTCTGCGCA CCGGTTATGT CGACAAAAAA GCCAACACGC TTGATGAAGC GCTGGCGATG ATCGCCGATA CCGATCGCCC AATTTCTGTC GGCTTACTGG GTAATGCCGC TGACATCTTC CCCGAATTAG TCAAACGCAA CATCACCCCT GATGTGGTGA CAGATCAAAC GTCGGCACAC GATCCACTCA ATGGCTATTT GCCACTCGGC TGGAGCATGG AAAAAGCTGC ACAGATGCGT CAACAAAATG AAGCTGAAGT CGTCAAAGCT GCCAAAGCTT CGATGGCGAT CCAAGTGCGC GCCATGCTTG ATTTGCAAAC TCGCGGCGCT GCGACGCTGG ACTATGGCAA TAACATTCGC CAAATGGCGC TGGAAGAAGG TGTTGCCAAT GCGTTCGACT TCCCCGGTTT TGTGCCGGCC TATATTCGTC CACTCTTCTG TGAAGGGATA GGTCCCTTCC GTTGGGCGGC ACTCTCTGGC GATCCGGAAG ACATTTACAA AACCGATCAA AAAGTCAAAG AGCTGATCCC TGACAACCCA CATCTGCATA ACTGGCTGGA TATGGCGCGT GAGCGAATCC ACTTCCAAGG TTTACCTGCC CGTATTTGCT GGGTCGGTTT AAAAGATCGC GCTCGCTTAG GCTTAGCTTT TAACGAAATG GTGAAAAATG GCGAGCTCAA AGCGCCAATC GTGATTGGTC GTGATCACCT CGATTCAGGC TCAGTCGCCA GCCCGAACCG CGAAACCGAA GGCATGTTGG ATGGTTCAGA TGCAGTCTCT GATTGGCCAC TGCTCAATGC CCTACTCAAC ACCGCAGGCG GAGCCACTTG GGTTTCTCTG CACCACGGTG GTGGCGTTGG TATGGGGTTC TCACAGCATT CCGGTATGGT GATTTGCTGC GATGGCAGTG ATGATGCCGC CGAACGTATC GCTCGTGTAC TGCACAATGA CCCAGCCACA GGCGTAATGC GCCACGCTGA TGCGGGCTAT GAGATTGCCA AACGCTGCGC GCAGCAACAA AAACTCGACT TACCTATGCT CAACGCTGAG CTGGCCAAAC TCAAGTGA
|
Protein sequence | MTQSSAQGTR LDTQRTIRAP RGTQLRAKSW LTEAPLRMLM NNLDPDVAEH PHALVVYGGI GRAARNWECF DKIVEVLERL EDDQTLLVQS GKPVGVFPTH KNAPRVLIAN SNLVPHWANW EHFNELDKQG LMMYGQMTAG SWIYIGSQGI VQGTYETFVA VAKKHFNGDA KGRWVLTGGL GGMGGAQPLA ATMAGFSMIA VECDESRIDY RLRTGYVDKK ANTLDEALAM IADTDRPISV GLLGNAADIF PELVKRNITP DVVTDQTSAH DPLNGYLPLG WSMEKAAQMR QQNEAEVVKA AKASMAIQVR AMLDLQTRGA ATLDYGNNIR QMALEEGVAN AFDFPGFVPA YIRPLFCEGI GPFRWAALSG DPEDIYKTDQ KVKELIPDNP HLHNWLDMAR ERIHFQGLPA RICWVGLKDR ARLGLAFNEM VKNGELKAPI VIGRDHLDSG SVASPNRETE GMLDGSDAVS DWPLLNALLN TAGGATWVSL HHGGGVGMGF SQHSGMVICC DGSDDAAERI ARVLHNDPAT GVMRHADAGY EIAKRCAQQQ KLDLPMLNAE LAKLK
|
| |