Gene TM1040_3007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3007 
Symbol 
ID4076580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3175510 
End bp3177198 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content60% 
IMG OID638008336 
Producturocanate hydratase 
Protein accessionYP_615001 
Protein GI99082847 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.237465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATC CCCGCAAGAA TACCCGTGAC ATTTTCCCTG CCACTGGCAC CGAGATCACT 
GCGAAGTCCT GGTTGACGGA GGCTCCGATG CGGATGCTGA TGAACAACCT GCACCCCGAT
GTGGCTGAAA ACCCGCATGA GCTCGTGGTT TATGGCGGGA TTGGCCGGGC GGCACGCACG
TGGCAGGATT TCGACCTCAT CGTCGAGACC CTCAAGACTC TTGAAGAAGA TCAGACCCTG
ATGGTGCAGT CCGGCAAACC CGTCGGCGTC TTTCAGACCC ACAAGGACGC GCCGCGCGTG
TTGATCGCCA ACTCCAACCT CGTGCCGCAT TGGGCCAATT GGGATCACTT CAACGAGCTC
GATAAGAAGG GTCTGATGAT GTACGGCCAG ATGACCGCTG GTTCGTGGAT TTACATCGGC
ACCCAGGGCA TCGTGCAGGG CACCTATGAG ACCTTTGCCG AGGCGGGCCG TCAGCACTAT
GGCGGGGACC TGACGGGAAA ATGGATCCTC ACCGGCGGTC TTGGGGGGAT GGGGGGGGCG
CAGCCTTTGG CTGCGGTTTT TGCTGGTGCT TGCTGCCTTG CGGTGGAGTG CAACCCCGAC
TCGATCGATT TCCGCCTGCG CACCAAATAC CTTGATGAGA AGGCTGAAAC GCTGGACGAA
GCGCTGGAGA TGATTGAGCG TTGGACCAAG GCGGGCGAGG CCAAATCCGT TGGCCTTTTG
GGCAATGCGG CGGATGTGTT TGCAGAGCTT GTGGAGCGTG CGAAGGCGGG TGGCATGCGC
CCCGACATCG TGACGGATCA GACCTCAGCG CATGACCCGG TCAACGGTTA TCTGCCGCAA
GGCTGGACGA TGGCCGAATG GAGAGAGAAG CGCGAAACGG ACAAAAAGGC GGTTGAGAAA
GCCTCTCGGG CGTCGATGAA GGCTCATGTG AAGGCCATGG TGGATTTCCA CGAGATGGGG
ATTCCCACCG TCGATTATGG CAACAACATC CGTCAAGTCG CGCTGGAAGA GGGGCTGGAG
ACGGCGTTCT CATTCCCCGG ATTTGTGCCA GCCTACATTC GCCCGCTGTT CTGCAAGGGG
ATCGGTCCCT TCCGTTGGTG TGCGCTCTCG GGGGATCCCG AGGATATCCG CAAGACCGAT
GCCAAGATGA AAGAGCTGTT CCCGGAGAAC GAGAGCCTGC ACCGCTGGCT CGACATGGCG
CAGGACCGCA TCGCCTTTCA GGGGCTACCG GCGCGGATCT GCTGGATCGG CCTTGGAGAT
CGCCACAAGG CGGGGCTTGC CTTCAACGAA ATGGTGCGCA ACGGCGAATT GTCGGCGCCG
GTCGTGATTG GCCGAGATCA TCTTGACTCG GGTTCCGTGG CATCGCCCAA CCGTGAAACC
GAAGCGATGA TGGATGGATC GGATGCGGTC TCTGACTGGC CATTGCTCAA TGCGCTTTTG
AACACGGCCT CGGGCGCGAC ATGGGTTTCG CTGCATCATG GCGGCGGTGT TGGCATGGGG
TTTTCGCAGC ACTCTGGCGT GGTGATCTGC TGTGACGGCA CAGAGGATGC AGATCGCCGG
ATCGGGCGCG TTCTGTGGAA CGACCCGGCG ACCGGCGTAA TGCGTCACGC AGACGCAGGC
TATGAGATCG CGAAGGACTG CGCCAAAGAG CACGGATTGA ACCTGCCCGG TATCCTGCGC
TCGGAATGA
 
Protein sequence
MSDPRKNTRD IFPATGTEIT AKSWLTEAPM RMLMNNLHPD VAENPHELVV YGGIGRAART 
WQDFDLIVET LKTLEEDQTL MVQSGKPVGV FQTHKDAPRV LIANSNLVPH WANWDHFNEL
DKKGLMMYGQ MTAGSWIYIG TQGIVQGTYE TFAEAGRQHY GGDLTGKWIL TGGLGGMGGA
QPLAAVFAGA CCLAVECNPD SIDFRLRTKY LDEKAETLDE ALEMIERWTK AGEAKSVGLL
GNAADVFAEL VERAKAGGMR PDIVTDQTSA HDPVNGYLPQ GWTMAEWREK RETDKKAVEK
ASRASMKAHV KAMVDFHEMG IPTVDYGNNI RQVALEEGLE TAFSFPGFVP AYIRPLFCKG
IGPFRWCALS GDPEDIRKTD AKMKELFPEN ESLHRWLDMA QDRIAFQGLP ARICWIGLGD
RHKAGLAFNE MVRNGELSAP VVIGRDHLDS GSVASPNRET EAMMDGSDAV SDWPLLNALL
NTASGATWVS LHHGGGVGMG FSQHSGVVIC CDGTEDADRR IGRVLWNDPA TGVMRHADAG
YEIAKDCAKE HGLNLPGILR SE