Gene TM1040_1497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1497 
Symbol 
ID4077053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1602053 
End bp1603507 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content60% 
IMG OID638006810 
Productphenylhydantoinase 
Protein accessionYP_613492 
Protein GI99081338 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.204674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAG TCATCAAGAA CGGCACCATT GTGACCGCGG ATCTAACCTA TAAGGCGGAT 
GTCCTGATCG AGGGCGGCGT CATCACCGAA ATCGGCCCGG ATCTGAAGGG CGATGAGGTG
CTGGATGCCA GCGGTTGCTA TGTCATGCCT GGCGGGATTG ATCCGCATAC CCATCTGGAG
ATGCCGTTCA TGGGCACCTA TTCTTCGGAT GATTTTGAAA GTGGCACCCG TGCCGGGCTT
GCGGGCGGGA CCACCATGGT GGTCGATTTT GCGCTTCCGC AGCCGGGTGA GAGCCTGCTC
GATGCGCTCA AGCGCTGGGA CAACAAGTCG ACTCGCGCCA ATTGTGACTA TTCCTTCCAT
ATGGCGGTGA CCTGGTGGGG CGAGCAGGTC TTTGATGAGA TGAAGACCGT CATCGAGACC
CGGGGCATCA ACACCTTCAA GCATTTCATG GCCTACAAGG GCGCCTTGAT GGTGAATGAT
GATGAGCTTT ATGCGTCATT TCAGCGTCTT GCGGAGTTGG GTGGCATCGC CATGGTGCAT
GCCGAGAACG GCGATGTGGT GGCGGAGTTG AGTGCCAAGC TTTTGGCCGA GGGCAATACC
GGCCCCGAAG CGCATGCTTA TTCGCGCCCG CCGCAGGTGG AAGGCGAAGC CACCAACCGG
GCGGTCATGA TTGCGGATAT GGCCGGTGTG CCGCTCTATG TGGTGCACAC CTCCTGTGAA
GAGAGCCACG AGGCCATTCG TCGGGCGCGC ATGCTTGGCA AGCGGGTCTG GGGCGAGCCG
CTCATCCAGC ATCTGACACT GGATGAGAGC GAGTATTTCA ACCCCGATTG GGATCACGCT
GCGCGGCGCG TGATGTCGCC ACCGTTTCGC AACAAACAGC ATCAGGACAG CCTCTGGGCG
GGGCTTCAGT CCGGGTCCCT GTCGGTGGTC GCGACCGATC ACTGCGCGTT CTCGACCGAG
CAGAAAAGAT ACGGCGTGGG CGATTTCACC AAGATCCCCA ACGGCACGGG CGGGCTTGAG
GACCGGATGC CGATGCTCTG GACGCATGGT GTCGAAACTG GCCGCCTGAC GCCCAATGAG
TTTGTCGCGG TGACCTCGAC CAATATTGCC AAGATCCTGA ACTGCTATCC CAAGAAGGGG
GCTGTGCTTG TGGGGGCGGA TGCAGATCTG GTGGTCTGGG ATCCCAAGAA AACCAAGACG
ATCTCTGCCG AGAGCCAGCA ATCTGCCATT GATTACAACG TGTTCGAGGG CAAAGAGGTG
AAGGGCCTGC CGCGCTACAC CCTGACCCGT GGACAGGTCG CCGTGATGGA CGGTGAGATC
AAGACCCAGG AAGGTCACGG CAAATTTGTG GAGCGCGCGC CCAACACCGT GGTGAACAAG
GCGCTCAGCA CCTGGAAAGA GCTGACCGCG CCGCGCCCGG TTGAGCGCAG CGGCATTCCC
GCCACTGGCG TCTAA
 
Protein sequence
MTTVIKNGTI VTADLTYKAD VLIEGGVITE IGPDLKGDEV LDASGCYVMP GGIDPHTHLE 
MPFMGTYSSD DFESGTRAGL AGGTTMVVDF ALPQPGESLL DALKRWDNKS TRANCDYSFH
MAVTWWGEQV FDEMKTVIET RGINTFKHFM AYKGALMVND DELYASFQRL AELGGIAMVH
AENGDVVAEL SAKLLAEGNT GPEAHAYSRP PQVEGEATNR AVMIADMAGV PLYVVHTSCE
ESHEAIRRAR MLGKRVWGEP LIQHLTLDES EYFNPDWDHA ARRVMSPPFR NKQHQDSLWA
GLQSGSLSVV ATDHCAFSTE QKRYGVGDFT KIPNGTGGLE DRMPMLWTHG VETGRLTPNE
FVAVTSTNIA KILNCYPKKG AVLVGADADL VVWDPKKTKT ISAESQQSAI DYNVFEGKEV
KGLPRYTLTR GQVAVMDGEI KTQEGHGKFV ERAPNTVVNK ALSTWKELTA PRPVERSGIP
ATGV