Gene Sbal223_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_0102 
Symbol 
ID7087367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp117632 
End bp119299 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content54% 
IMG OID643459026 
Producturocanate hydratase 
Protein accessionYP_002356066 
Protein GI217971315 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGC GACACGACCC AAGCCGCCGC ATTATTGCAC CGCATGGAAC AAGATTAAGC 
TGCAAAAGCT GGTTGACCGA AGCGCCAATG CGCATGTTAA TGAACAACTT ACATCCCGAT
GTCGCCGAGC GCCCAGAAGA CTTAGTCGTC TATGGTGGTA TCGGCCGCGC CGCTCGCGAC
TGGGATTGCT ATGACAAAAT CATCGAAGTC TTACAACGCC TCGAAGATGA CGAAACCTTA
TTAGTGCAAT CGGGCAAACC TGTGGGCGTA TTTCGCACCC ATGCCGATGC ACCGCGCGTG
CTGATTGCTA ACTCAAACCT AGTGCCACAT TGGGCGAACT GGGAGCATTT CAACGAGTTA
GATAAGCTAG GTTTGGCCAT GTACGGCCAG ATGACCGCAG GTTCTTGGAT CTACATTGGT
ACACAAGGCA TAGTTCAAGG TACCTACGAG ACCTTTGTGT CTGTAGCGAA ACAGCACTTT
GAGGGTATCT CCAAAGGTAA ATGGATCCTC ACCGGCGGGT TAGGCGGCAT GGGCGGCGCG
CAAACGCTGG CGGGCACTAT GGCTGGCTTC TCGGTGTTAG CCTGTGAAGT CGACGAGACT
CGCATCGATT TCCGTTTGCG CACCCGCTAT GTTGACAAAA AAGCCACTTC GCTCGATGAA
GCATTGGCGA TGATTGAAGA GGCAAACCAA GCTGGTAAGC CTGTATCTGT TGGCTTACTA
GCAAATGCCG CCGATGTGTT TGCCGAACTG GTTAAGCGCG GCGTTACACC TGATGTCGTA
ACTGACCAAA CCTCGGCCCA CGATCCATTA AACGGTTATT TGCCGCAGGG TTGGACTATG
GCAGAGGCCG CAGCCATGCG TAAAACCGAC GAAGCGGGCG TAGTGAAAGC AGCAAAAGCC
TCGATGGCGG TGCAAGTACA AGCCATGCTC GACCTGCAAA CCGCGGGTGC AGCAACGCTC
GATTACGGAA ACAACATTCG CCAAATGGCG TTTGAAGTGG GCGTTGAAAA CGCCTTTGAT
TTCCCAGGCT TTGTGCCTGC ATACATTCGC CCGCTGTTCT GTGAGGGCAT TGGCCCGTTC
CGCTGGGTAG CACTGTCTGG CGATCCAGAA GATATCTATA AAACCGACGC CAAAGTGAAA
GAACTTATTC CGGATAATCC ACATCTGCAC AATTGGTTAG ACATGGCGCG TGAGCGTATC
GCCTTCCAAG GTCTGCCTGC GCGTATCTGC TGGGTCGGCT TAAAAGATCG CGCTCGTTTA
GCGTTAGCCT TTAACGAAAT GGTCAAAAAT GGTGAGTTGT CGGCGCCTGT GGTGATTGGC
CGCGATCACT TAGATTCTGG CTCTGTTGCC AGCCCGAACC GCGAAACCGA ATCTATGCTG
GACGGCTCAG ATGCGGTATC CGATTGGCCA TTATTGAATG CACTACTCAA CACCGCCAGC
GGCGCGACTT GGGTATCTTT GCACCACGGC GGCGGCGTCG GCATGGGCTT TAGCCAACAT
TCGGGTGTGG TGATTGTGTG TGACGGTACC GATGCGGCGG CAAAACGGGT TGGCCGTGTG
CTGTGGAATG ACCCAGCGAC AGGCGTGATG CGCCATGCCG ATGCGGGCTA CGAGATTGCG
AAAAACTGCG CCAAAGAGCA GGGGCTCGAC TTACCTATGC AAGAGTAG
 
Protein sequence
MDKRHDPSRR IIAPHGTRLS CKSWLTEAPM RMLMNNLHPD VAERPEDLVV YGGIGRAARD 
WDCYDKIIEV LQRLEDDETL LVQSGKPVGV FRTHADAPRV LIANSNLVPH WANWEHFNEL
DKLGLAMYGQ MTAGSWIYIG TQGIVQGTYE TFVSVAKQHF EGISKGKWIL TGGLGGMGGA
QTLAGTMAGF SVLACEVDET RIDFRLRTRY VDKKATSLDE ALAMIEEANQ AGKPVSVGLL
ANAADVFAEL VKRGVTPDVV TDQTSAHDPL NGYLPQGWTM AEAAAMRKTD EAGVVKAAKA
SMAVQVQAML DLQTAGAATL DYGNNIRQMA FEVGVENAFD FPGFVPAYIR PLFCEGIGPF
RWVALSGDPE DIYKTDAKVK ELIPDNPHLH NWLDMARERI AFQGLPARIC WVGLKDRARL
ALAFNEMVKN GELSAPVVIG RDHLDSGSVA SPNRETESML DGSDAVSDWP LLNALLNTAS
GATWVSLHHG GGVGMGFSQH SGVVIVCDGT DAAAKRVGRV LWNDPATGVM RHADAGYEIA
KNCAKEQGLD LPMQE