Gene Cmaq_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1541 
Symbol 
ID5709056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1621142 
End bp1623001 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content45% 
IMG OID641276049 
Producturocanate hydratase 
Protein accessionYP_001541354 
Protein GI159042102 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.284209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTAC CCCAGAGGTA CAGGGGTAGG CCTATTGAGG AGCTTATATC CTCAGGTTAC 
TATGATCCCG AGTCCAGGAC TGTTAAAGCC ATTAAGGGTG ATGAGATTCA CGTACATAGT
AGGGATTGGC AGATTGAGGG TCCATTAAGA ATGCTCTTTC ACGTGCTTGA TCCTATGGTT
GCTAAGGATC CTAAGAATTT AATAGTCTAC GGTGGTACCG GTAAGGCTGC TAGGTCGTGG
GATGATTTCG AGGATATTGT GGATGCTTTA CTTACCATGG ATAGTGAGGA TACACTTGTA
ATCCAGAGTG GCCAACCTGT GGCTATTTGG AGGCTTAGTA GGAGTGCGCC CAGGGTTTTA
ATGAGTAATG CAATGCTTGT TCCAAAGTGG GCTGATTGGA GGGTTTTCTG GGAGCTTGAG
GCTAAGGGTT TAATCAGTTT CCACCAGATG ACTGCAGGAT GCTGGGCCTA CATTGGTACA
CAGGGTATTC TTCAAGGTAC CTATGAAACA ATTGGAGCCG CCGCTGATAG GCATTTTAAT
GGTTCATTAG AGGGTAGGCT AGTGGTTAGC GCTGGCCTTG GTAACATGGG TGGTGCCCAA
CCATTAGCGA TTAAAATGCT TGGTGGCGTT GCGTTAATAG CCGATGTGGA TGTTAATATG
ATTCGAAGAA TGATTGATAC AGGTTACTTA GATACTTGGA CTGATAACAT TGATAAGGCA
ATCGACATGG CTATTGATGC TAAGGAGAAG AAGCAGGCAG TAAGCATAGG TGTTCTGGCT
AACGCCGTTG ATTTACTTGA GAAACTCATC AAGGATAATA TAGTGCCGGA AATATTAACC
GATCAAACAC CTGCACATGA CCCATTATCA TACGTACCTA AAGGCCTAAC CATTGAGCAG
GCAATGCAAT TAAGGAAGAG TGACCCTGAG AAATACATGG TGATGGCTAA GGAAACAATG
AAGAGGCATG TTCAATTAAT GCTTGAACTA CAGAGAATGG GTTCCGTGAC CTTTGAGTAC
GGTAATAATT TAAGAAAGCA GGCTTATGAT GCTGGCGTCA ATGATGCATT CAAGATACCT
GGCCAAATGG AGTACATGAG GCCCTTATTT GAGGAGGGAC GTGGCCCATT TAGGTGGACT
AGCCTGGTGG GTGATCCTAA TGATATTTAT AAACTGGATG ACGTGTTATT AACCTTATTT
GAAAGAAACA GTAAATTGAT TAGGTGGATT AAGGCGGCTC ATCAATATGT TAAGTTCCAG
GGGTTACCAG CCAGGGTTGT TTATTTAGGC TATGGTGAGA GGGCATTATT CGGTAAAGTG
GTTAGTGAGC TTGTTAGGAA GGGTGAATTA CATGGACCCA TATGGTTTGG TAGAGACCAC
TTAGACAGTG GCTCTGTGGC ATCACCCTTC AGGGAGACTG AAGGCATGCT TGATGGCTCC
GATGCAATTG GTGACTGGCC AATACTCAAC TACGCATTAA ACACTGCAGC AGGGGCAACC
TGGACATGCT TCCACCACGG TGGTGGAGTC GGTATTGGCT TCTCAATACA CGCTGGTTTC
GGTATGGTTG TTGATGGCAG TGAATTAGCT GAGGAGAAGG CGCTCAGGGT GTTTACTGTT
GATCCAGGTT CCGGTGTTGT TAGGCATGCT CATGCCGGCT ACCCCAAGTC ATTAATGGTT
GCCAGGGAGA GGGGGATTAG GATACCGATA ATTAATAGGC TTGAGGAGAA GAGTACACGT
GTAATTGAGG AGGCTTATAG AGAGGGTAGG GTAAGTAAGT TTACGTATGA GAGGGTTAAG
AAGGATTTAG AGGAGTATAA GGCTAGGAGA CAGAATTATC GGACTCCATT TAATACATGA
 
Protein sequence
MSVPQRYRGR PIEELISSGY YDPESRTVKA IKGDEIHVHS RDWQIEGPLR MLFHVLDPMV 
AKDPKNLIVY GGTGKAARSW DDFEDIVDAL LTMDSEDTLV IQSGQPVAIW RLSRSAPRVL
MSNAMLVPKW ADWRVFWELE AKGLISFHQM TAGCWAYIGT QGILQGTYET IGAAADRHFN
GSLEGRLVVS AGLGNMGGAQ PLAIKMLGGV ALIADVDVNM IRRMIDTGYL DTWTDNIDKA
IDMAIDAKEK KQAVSIGVLA NAVDLLEKLI KDNIVPEILT DQTPAHDPLS YVPKGLTIEQ
AMQLRKSDPE KYMVMAKETM KRHVQLMLEL QRMGSVTFEY GNNLRKQAYD AGVNDAFKIP
GQMEYMRPLF EEGRGPFRWT SLVGDPNDIY KLDDVLLTLF ERNSKLIRWI KAAHQYVKFQ
GLPARVVYLG YGERALFGKV VSELVRKGEL HGPIWFGRDH LDSGSVASPF RETEGMLDGS
DAIGDWPILN YALNTAAGAT WTCFHHGGGV GIGFSIHAGF GMVVDGSELA EEKALRVFTV
DPGSGVVRHA HAGYPKSLMV ARERGIRIPI INRLEEKSTR VIEEAYREGR VSKFTYERVK
KDLEEYKARR QNYRTPFNT