Gene Cwoe_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3641 
Symbol 
ID8734096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3876549 
End bp3878204 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content74% 
IMG OID646504263 
Producturocanate hydratase 
Protein accessionYP_003395433 
Protein GI284045093 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.278935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCG CGGCGAGCGG CGGCGCGCTC CGCTGCCGCG GCTGGCAGCA GGAGGCCGCG 
CTGCGGATGC TGGAGAACAA CCTCCACCCG GACGTCGCCG AGAAGCCGTC CGAGCTGATC
GTCTATGGCG GCATCGGCAA GGCCGCGCGC GACCGGGCGA GCTACGACGC GATCGTGCGC
GAGCTGACCC GGCTCGGCGA CGAGCAGACG CTGCTGGTCC AGTCCGGCAA GCCGGTCGCG
GTCTTCGACA CGCACCCGCA CGCACCGCGC GTGCTGCTGG CCAACTCGAA CCTCGTCCCC
GACTGGGCGA ACTGGGAGAC GTTCCGCGAG CTCGACGCCG CCGGGCTGAC GATGTACGGC
CAGATGACGG CCGGGTCGTG GATCTACATC GGCTCGCAGG GGATCCTCCA GGGCACCTAC
GAGACGTTCG CGGCGATCGC GCGCAAGCGC TTCGGCGGGT CGCTCGCCGG CCGCCTCGTC
GTGACCGCCG GGCTCGGCGG GATGGGCGGC GCGCAGCCGC TCGCGGTCAC GCTCAACGAC
GGCTGCGCGC TGTGCGTCGA AGTCGACCTG CAGCGGATCG AGCGCCGCAT CCGGACGGGC
TACCTCGACG AGCGCGCCGC GGACCTCGAC GACGCGCTCG CGCGGCTGGA GACGGCGCAG
GCCGAGCGCC GCCCGCTGTC GATCGGCCTG CTCGGCAACG TCGCCGACGT GCTGCCGGAG
CTGGTCCGCC GCGGCGTCGC GATCGACGTC GTGACCGATC AGACGAGCGC TCACGATCCG
CTCACCGGCT ACATCCCCGC CGGCCTGACC GTCGAGCAGG CCGACACGCT GCGCACGCGC
GACCAGGACG ACTACCTGCG CCGCGTCGGC GAGAGCGCGG TGACGCACGT CGGCGCGATC
CGCGCGCTCC AGCAGGCGGG CGCCGAGGCG TTCGACTACG GCAACGCGCT GCGTGGCCTC
GCCGCCGCGC ACGGCGACGC CGACGCGTTC TCCTACCCGG GCTTCGTGCC GGCGTACATC
CGCCCGCTCT TCTGCGAGGG CAAGGGCCCG TTCCGCTGGG TCGCGCTGTC GGGCGACCCG
GAGGACATCC GCAGGACGGA CGCGGCGATC CTCGACCTCT TCGGCGACCA GGAGCACGTC
GCGCGCTGGA TCCGGCTCGC GGGCGAGAAG GTGCGGTTCC AGGGGCTGCC GGCGCGGATC
TGCTGGCTCG GCTACGGCGA GCGCGACCGT GCCGGGCTGC GCTTCAACGA GATGGTCGCG
AGCGGCGAGC TGCGCGCGCC GATCGTGATC GGCCGCGACC ACCTCGACGG CGGCTCGGTC
GCCTCGCCCG AGCGCGAGAC GGAGGCGATG CGCGACGGCT CCGACGCGAT CGCCGACTGG
CCGCTGCTGA ACGCGCTGAT CAACACGGCG TGCGGCGCCA CGTGGGTCTC GATCCACCAC
GGCGGCGGTG TCGGGATGGG CAAGTCGATC CATGCCGGCC AGGTGGTCGT CGCCGACGGC
ACCGCCGGCG CGGCCGAGCG GATCCGGCGC ACGCTGACGG CCGACCCGGG GATGGGGATC
GTCCGCCACG TCGACGCCGG CTATCCCGAG GCGATCGACG CCGCGCGGCG GCTCGGCGTG
CACGTGCCGA TGCTCGACGG CCCTCCGGCG GCCTGA
 
Protein sequence
MNVAASGGAL RCRGWQQEAA LRMLENNLHP DVAEKPSELI VYGGIGKAAR DRASYDAIVR 
ELTRLGDEQT LLVQSGKPVA VFDTHPHAPR VLLANSNLVP DWANWETFRE LDAAGLTMYG
QMTAGSWIYI GSQGILQGTY ETFAAIARKR FGGSLAGRLV VTAGLGGMGG AQPLAVTLND
GCALCVEVDL QRIERRIRTG YLDERAADLD DALARLETAQ AERRPLSIGL LGNVADVLPE
LVRRGVAIDV VTDQTSAHDP LTGYIPAGLT VEQADTLRTR DQDDYLRRVG ESAVTHVGAI
RALQQAGAEA FDYGNALRGL AAAHGDADAF SYPGFVPAYI RPLFCEGKGP FRWVALSGDP
EDIRRTDAAI LDLFGDQEHV ARWIRLAGEK VRFQGLPARI CWLGYGERDR AGLRFNEMVA
SGELRAPIVI GRDHLDGGSV ASPERETEAM RDGSDAIADW PLLNALINTA CGATWVSIHH
GGGVGMGKSI HAGQVVVADG TAGAAERIRR TLTADPGMGI VRHVDAGYPE AIDAARRLGV
HVPMLDGPPA A