Gene Shewmr4_0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0099 
Symbol 
ID4250978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp109241 
End bp110911 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content55% 
IMG OID638116641 
Producturocanate hydratase 
Protein accessionYP_732237 
Protein GI113968444 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAGC GACACGACCC AAGCCGCCGC ATTATTGCGC CCCATGGTAG CCAATTAAGC 
TGCAAAAGCT GGTTAACCGA AGCGCCAATG CGCATGTTAA TGAACAACTT ACATCCAGAT
GTGGCCGAGC GCCCAGAGGA TTTAGTGGTT TACGGTGGTA TCGGCCGCGC GGCCCGCGAC
TGGGACTGCT ACGACAAGAT TATCGAAGTA CTAAAACGCC TCGAAGACGA CGAAACCTTA
ATGGTGCAAT CGGGTAAACC TGTGGGCGTA TTCCGTACCC ACGCCGATGC TCCGCGCGTG
CTGATTGCTA ACTCTAACCT AGTGCCACAT TGGGCAAACT GGGAACACTT CAACGAGTTA
GATAAGCAAG GTCTGGCGAT GTATGGCCAG ATGACTGCGG GTTCGTGGAT TTATATCGGG
ACTCAGGGCA TTGTCCAAGG TACCTACGAA ACCTTCGTTG CCGTGGCAAA ACAACACTTT
GGCGGTGTGG CTGCCGGTAA GTGGATCCTC ACCGGTGGTT TAGGTGGTAT GGGCGGCGCG
CAAACGCTGG CCGGTACTAT GGCTGGCTTC TCTGTGCTTG CCTGTGAAGT CGATGAAACC
CGTATCGATT TTCGTCTGCG TACCCGTTAT GTGGATAAAA AGGCCACTTC ATTAGACGAA
GCCTTGGCTA TGATCAACGA CGCCAATGCC TCGGGCAAAC CCGTATCTGT GGGCCTGCTG
GCTAACGCCG CCGATATCTT TGCCGAATTG GTGAAGCGTG GCATTACTCC TGATGTTGTC
ACCGACCAAA CCTCTGCCCA CGATCCGCTG AACGGCTATT TGCCACAGGG CTGGACCATG
GCGCAAGCGG CTGACATGCG TAAAACCGAT GAAGCCGCCG TCGTTAAAGC GGCTAAAGCC
TCAATGGCGG TACAGGTGCA AGCCATGCTG GATCTGCAAG CGGCAGGCGC GGCGACCTTA
GACTATGGTA ACAACATTCG CCAAATGGCC TTCGAAACTG GCGTTAAAAA CGCGTTCGAT
TTCCCTGGTT TTGTGCCTGC TTATATTCGC CCACTCTTCT GCGAAGGTAT CGGCCCGTTC
CGTTGGGTAG CCCTGTCTGG CGATCCTGAG GATATCTACA AGACCGACGC CAAAGTGAAG
GAGCTGATCC CCGACAATCC TCATTTGCAC AACTGGTTAG ATATGGCCCG TGAACGTATC
GCCTTCCAAG GTCTACCTGC GCGTATCTGC TGGGTGGGCC TAAAGGACCG TGCGCGTTTA
GCCCAAGCCT TTAACGAGAT GGTGAAAAAC GGCGAGCTGT CGGCGCCCAT CGTCATTGGT
CGTGACCACT TAGACTCAGG CTCAGTCGCC AGCCCTAACC GTGAAACCGA ATCTATGATG
GACGGTTCAG ACGCAGTATC TGACTGGCCA TTACTGAACG CTCTGCTAAA CACTGCCAGC
GGCGCGACTT GGGTGTCGCT GCACCACGGC GGTGGCGTAG GCATGGGCTT TAGCCAACAC
TCGGGTGTAG TGATCGTGTG TGACGGTACT GAGGCCGCCG CTAAACGTGT TGGCCGCGTA
CTGTGGAACG ACCCTGCGAC AGGCGTGATG CGCCATGCGG ATGCGGGTTA CGAGATCGCG
AAAAACTGCG CAAAAGAGCA GGGCTTAGAT CTGCCCATGC TTAAAGACTA G
 
Protein sequence
MDKRHDPSRR IIAPHGSQLS CKSWLTEAPM RMLMNNLHPD VAERPEDLVV YGGIGRAARD 
WDCYDKIIEV LKRLEDDETL MVQSGKPVGV FRTHADAPRV LIANSNLVPH WANWEHFNEL
DKQGLAMYGQ MTAGSWIYIG TQGIVQGTYE TFVAVAKQHF GGVAAGKWIL TGGLGGMGGA
QTLAGTMAGF SVLACEVDET RIDFRLRTRY VDKKATSLDE ALAMINDANA SGKPVSVGLL
ANAADIFAEL VKRGITPDVV TDQTSAHDPL NGYLPQGWTM AQAADMRKTD EAAVVKAAKA
SMAVQVQAML DLQAAGAATL DYGNNIRQMA FETGVKNAFD FPGFVPAYIR PLFCEGIGPF
RWVALSGDPE DIYKTDAKVK ELIPDNPHLH NWLDMARERI AFQGLPARIC WVGLKDRARL
AQAFNEMVKN GELSAPIVIG RDHLDSGSVA SPNRETESMM DGSDAVSDWP LLNALLNTAS
GATWVSLHHG GGVGMGFSQH SGVVIVCDGT EAAAKRVGRV LWNDPATGVM RHADAGYEIA
KNCAKEQGLD LPMLKD