Gene Shew_3757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_3757 
Symbol 
ID4920749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp4473477 
End bp4475147 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content61% 
IMG OID640165383 
Producturocanate hydratase 
Protein accessionYP_001095882 
Protein GI127514685 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0181663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGA GACACGACCC TAGCCGTCGC ATCATAGCGC CACATGGCAC TACGCTGAGC 
TGTAAGAGCT GGCTCACCGA GGCGCCGATG CGCATGCTGA TGAACAACCT GCACCCAGAT
GTCGCCGAGC GTCCGGAAGA CCTGGTGGTC TACGGCGGTA TCGGCCGTGC GGCCCGTGAC
TGGCAATGCT ATGACAAGAT CGTAGAGGTG CTGCAGCGCC TAGAAGAAGA TGAGACCCTG
CTGGTACAGT CGGGCAAGCC TGTGGGCGTG TTCAAGACCC ACAGCAACGC GCCGCGCGTC
ATCATCGCCA ACTCTAACCT GGTGCCACAC TGGGCCAACT GGGAACACTT CAACGAGCTG
GATAAGAAGG GCCTGGCCAT GTATGGCCAG ATGACTGCAG GTTCTTGGAT CTACATCGGC
TCTCAGGGCA TTGTCCAGGG CACCTACGAG ACCTTCGTGG CCATGGCCAA GCAACACTTT
GGTGGCGATG CCAGCGGCAA GTGGATCCTC ACCGGCGGCC TGGGTGGCAT GGGTGGCGCT
CAGCCACTGG CCGGCACCAT GGCGGGCTAC TCTGTACTGG CCTGTGAGGT GGACGAGACT
CGCATCGACT TCCGTCTACG TACCCGTTAT GTGGACAAGA AGGCCACTAG CCTGGATGAG
GCGCTGGCGA TGATCGACGA AGCCAACAAG AGCGGCAAGC CAGTGTCTGT CGGCCTGCTG
GCCAACGCCG CCGACATCTT CGCCGAGCTG GTAGAGCGTG GCATCACCCC GGATGTAGTG
ACCGACCAGA CCTCGGCCCA CGATCCACTA AACGGCTATC TGCCACAGGG TTGGACCCTG
GAATACGCCG CCGAGATGCG TAAGCAAGAT GAGGCGGCCG TGGTTAAGGC GGCCAAGCAG
TCGATGGCGG TACAGGTTAA AGCCATGCTG GCCCTGCAGG CGGCAGGTGC GGCCACCACA
GACTATGGTA ACAACATTCG CCAGATGGCG TTCGAAGAGG GCGTGGAAAA TGCCTTCGAC
TTCCCAGGCT TCGTGCCCGC CTATGTGCGC CCACTCTTCT GCGAAGGCAT AGGTCCCTTC
CGCTGGGCGG CGCTCTCTGG CGATCCGGAA GATATCTACA AGACAGATGC CAAGGTGAAG
GAGCTGATCC CAGACAACCC ACACCTGCAT AACTGGCTGG ACATGGCGCG TGAGCGCATC
GCTTTCCAGG GCCTGCCGGC ACGTATCTGC TGGGTCGGCC TGAAAGACAG GGCGCGTCTG
GCTAAGGCCT TCAACGAGAT GGTGAAAAAC GGTGAGCTGT CGGCGCCAAT CGTGATCGGT
CGTGACCATC TGGATTCTGG CTCTGTGGCA AGCCCTAACC GCGAGACCGA ATCTATGCTT
GACGGCAGCG ATGCGGTATC GGATTGGCCG CTGATGAACG CCCTACTTAA CACGGCAAGC
GGCGCGACCT GGGTGTCTCT GCACCATGGC GGCGGCGTCG GCATGGGCTT CAGCCAACAC
TCGGGTGTGG TGATCGTTGC CGATGGTACC GACGAGGCCG AGGCGCGTCT GGGCCGTGTG
CTGTGGAACG ACCCTGCCAC TGGCGTGATG CGTCATGCGG ATGCTGGCTA TGAGATCGCC
AAGCAGTGCG CCAAGGAGCA GGGCCTGGAT TTGCCTATGC TCGACCTATA A
 
Protein sequence
MDKRHDPSRR IIAPHGTTLS CKSWLTEAPM RMLMNNLHPD VAERPEDLVV YGGIGRAARD 
WQCYDKIVEV LQRLEEDETL LVQSGKPVGV FKTHSNAPRV IIANSNLVPH WANWEHFNEL
DKKGLAMYGQ MTAGSWIYIG SQGIVQGTYE TFVAMAKQHF GGDASGKWIL TGGLGGMGGA
QPLAGTMAGY SVLACEVDET RIDFRLRTRY VDKKATSLDE ALAMIDEANK SGKPVSVGLL
ANAADIFAEL VERGITPDVV TDQTSAHDPL NGYLPQGWTL EYAAEMRKQD EAAVVKAAKQ
SMAVQVKAML ALQAAGAATT DYGNNIRQMA FEEGVENAFD FPGFVPAYVR PLFCEGIGPF
RWAALSGDPE DIYKTDAKVK ELIPDNPHLH NWLDMARERI AFQGLPARIC WVGLKDRARL
AKAFNEMVKN GELSAPIVIG RDHLDSGSVA SPNRETESML DGSDAVSDWP LMNALLNTAS
GATWVSLHHG GGVGMGFSQH SGVVIVADGT DEAEARLGRV LWNDPATGVM RHADAGYEIA
KQCAKEQGLD LPMLDL