Gene VC0395_A0823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0823 
SymbolhutU 
ID5135821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp836308 
End bp838005 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content54% 
IMG OID640532281 
Producturocanate hydratase 
Protein accessionYP_001216773 
Protein GI147675330 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000435883 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAGT CATCTGCACA AGGAACAAGA CTGGACACTC AGCGCACCAT TCGCGCCCCT 
CGTGGCACAC AGCTGCGCGC CAAATCTTGG CTCACTGAAG CCCCACTTCG CATGCTGATG
AATAACCTCG ACCCCGATGT GGCCGAACAC CCTCATGCAC TGGTGGTGTA TGGTGGCATT
GGCCGCGCGG CGCGTAACTG GGAGTGCTTT GACAAAATCG TCGAAGTGCT GGAGCGCCTC
GAAGATGACC AGACTTTACT GGTGCAATCG GGTAAACCGG TGGGGGTTTT CCCCACCCAC
AAAAATGCGC CACGCGTGTT GATTGCCAAC TCCAACTTAG TACCACACTG GGCAAACTGG
GAGCATTTCA ACGAGCTCGA TAAACAAGGC TTGATGATGT ACGGCCAGAT GACGGCGGGA
TCTTGGATTT ACATTGGCTC ACAAGGCATA GTGCAAGGCA CTTACGAAAC CTTTGTCGCG
GTTGCGAAAA AGCATTTCAA TGGCGATGCC AAAGGCCGTT GGGTTTTGAC TGGCGGATTG
GGCGGCATGG GCGGCGCGCA ACCTTTAGCG GCGACGATGG CAGGATTCTC GATGATTGCG
GTGGAATGTG ATGAATCGCG CATCGACTAT CGTCTGCGCA CCGGTTATGT CGACAAAAAA
GCCAACACGC TTGATGAAGC GCTGGCGATG ATCGCCGATA CCGATCGCCC AATTTCTGTC
GGCTTACTGG GTAATGCCGC TGACATCTTC CCCGAATTAG TCAAACGCAA CATCACCCCT
GATGTGGTGA CAGATCAAAC GTCGGCACAC GATCCACTCA ATGGCTATTT GCCACTCGGC
TGGAGCATGG AAAAAGCTGC ACAGATGCGT CAACAAAATG AAGCTGAAGT CGTCAAAGCT
GCCAAAGCTT CGATGGCGAT CCAAGTGCGC GCCATGCTTG ATTTGCAAAC TCGCGGCGCT
GCGACGCTGG ACTATGGCAA TAACATTCGC CAAATGGCGC TGGAAGAAGG TGTTGCCAAT
GCGTTCGACT TCCCCGGTTT TGTGCCGGCC TATATTCGTC CACTCTTCTG TGAAGGGATA
GGTCCCTTCC GTTGGGCGGC ACTCTCTGGC GATCCGGAAG ACATTTACAA AACCGATCAA
AAAGTCAAAG AGCTGATCCC TGACAACCCA CATCTGCATA ACTGGCTGGA TATGGCGCGT
GAGCGAATCC ACTTCCAAGG TTTACCTGCC CGTATTTGCT GGGTCGGTTT AAAAGATCGC
GCTCGCTTAG GCTTAGCTTT TAACGAAATG GTGAAAAATG GCGAGCTCAA AGCGCCAATC
GTGATTGGTC GTGATCACCT CGATTCAGGC TCAGTCGCCA GCCCGAACCG CGAAACCGAA
GGCATGTTGG ATGGTTCAGA TGCAGTCTCT GATTGGCCAC TGCTCAATGC CCTACTCAAC
ACCGCAGGCG GAGCCACTTG GGTTTCTCTG CACCACGGTG GTGGCGTTGG TATGGGGTTC
TCACAGCATT CCGGTATGGT GATTTGCTGC GATGGCAGTG ATGATGCCGC CGAACGTATC
GCTCGTGTAC TGCACAATGA CCCAGCCACA GGCGTAATGC GCCACGCTGA TGCGGGCTAT
GAGATTGCCA AACGCTGCGC GCAGCAACAA AAACTCGACT TACCTATGCT CAACGCTGAG
CTGGCCAAAC TCAAGTGA
 
Protein sequence
MTQSSAQGTR LDTQRTIRAP RGTQLRAKSW LTEAPLRMLM NNLDPDVAEH PHALVVYGGI 
GRAARNWECF DKIVEVLERL EDDQTLLVQS GKPVGVFPTH KNAPRVLIAN SNLVPHWANW
EHFNELDKQG LMMYGQMTAG SWIYIGSQGI VQGTYETFVA VAKKHFNGDA KGRWVLTGGL
GGMGGAQPLA ATMAGFSMIA VECDESRIDY RLRTGYVDKK ANTLDEALAM IADTDRPISV
GLLGNAADIF PELVKRNITP DVVTDQTSAH DPLNGYLPLG WSMEKAAQMR QQNEAEVVKA
AKASMAIQVR AMLDLQTRGA ATLDYGNNIR QMALEEGVAN AFDFPGFVPA YIRPLFCEGI
GPFRWAALSG DPEDIYKTDQ KVKELIPDNP HLHNWLDMAR ERIHFQGLPA RICWVGLKDR
ARLGLAFNEM VKNGELKAPI VIGRDHLDSG SVASPNRETE GMLDGSDAVS DWPLLNALLN
TAGGATWVSL HHGGGVGMGF SQHSGMVICC DGSDDAAERI ARVLHNDPAT GVMRHADAGY
EIAKRCAQQQ KLDLPMLNAE LAKLK