Gene TM1040_1500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1500 
Symbol 
ID4077056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1606078 
End bp1607382 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content59% 
IMG OID638006813 
Productdihydropyrimidine dehydrogenase 
Protein accessionYP_613495 
Protein GI99081341 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.190066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATC TTACAACAGA ATTCCTCGGT ATCAAATCGC CGAATCCTTT CTGGCTGGCC 
TCCGCGCCGC CCACCGACAA AGAATATAAC GTGCGCCGCG CCTTTGAGGC GGGCTGGGGC
GGTGTGGTCT GGAAGACCCT CGGCGCCGAA GGGCCACCGG TTGTCAACGT GAACGGGCCG
CGGTATGGCG CGATCTGGGG GGCGGACCGC CGTCTTCTGG GGCTCAACAA CATCGAACTG
ATCACGGATC GCCCGCTGGA TGTGAACCTC GAGGAGATGA CCCGCGTCAA AAAGGACTAC
CCGGATCGCG CGCTGATTGC GTCGATCATG GTGCCCTGTG AAGAGGCGGC TTGGAAAGCG
ATCCTGCCGC GTGTGGCGGA AACAGGATGT GACGGGATCG AGCTCAACTT TGGCTGCCCG
CATGGGATGG CCGAGCGCGG CATGGGTTCT GCTGTGGGGC AGGTTCCGGA ATACATTCAG
ATGGTCACCG AATGGTGCAA ACAGTATTAT GACAAGCCGG TGATCGTGAA GCTCACGCCC
AATATCACCG ACATTCGTCA TCCGGCGCGG GCCGCGAAGG CCGGCAATGC CGATGCTGTG
TCTCTGATCA ACACCATCAA TTCGATCACC TCGGTCAACC TTGATGCAAT GTCGCCCGAA
CCGATGATTG GCGGCAAGGG CACCCATGGC GGCTATTGCG GCCCGGCGGT GAAACCGATC
GCCATGAATA TGGTGGCCGA AATTTCCCGC GATCCGCAAA CCGCAGGTCT GCCTATTTCC
GCCATTGGCG GCGTGACAAC ATGGCGCGAT GCGGCGGAGT TCATCGCTCT TGGGGCTGGC
AATGTGCAGG TTTGCACGGC GGCCATGACC TATGGGTTCA AGGTCGTTGA AGAGATGATT
TCGGGCCTGT CGGATTGGAT GGACGAGAAG GGCTATTCCT CGATCGAGGA CTTCCGTGGC
ATGGCGGTTC CGAATGTGAC CGACTGGCAG TATCTGGACC TCAACTATGT GACCAAGGCC
AAGATCTCTC AGGATGACTG CATCAAATGC GGACGTTGCT ATGCGGCCTG CGAGGATACC
TCGCATCAGG CGATTGAGAT GTCGGCGGAT CGGACCTTTA CCGTGAAGGA CGACGAATGC
GTGGCGTGTA ACCTGTGCGT CAACGTCTGT CCGGTTGAAG GCTGTATCAC CATGGAAGAG
GTTGCCGTGG GCGCCATTGA TGAACGCACC GGCAAGGTGG TGAGCGGCGA ATATGGCAAC
TGGACCCAGC ACCCTAATAA TCCGTCTGCA ACGGCTGCGG AATAA
 
Protein sequence
MADLTTEFLG IKSPNPFWLA SAPPTDKEYN VRRAFEAGWG GVVWKTLGAE GPPVVNVNGP 
RYGAIWGADR RLLGLNNIEL ITDRPLDVNL EEMTRVKKDY PDRALIASIM VPCEEAAWKA
ILPRVAETGC DGIELNFGCP HGMAERGMGS AVGQVPEYIQ MVTEWCKQYY DKPVIVKLTP
NITDIRHPAR AAKAGNADAV SLINTINSIT SVNLDAMSPE PMIGGKGTHG GYCGPAVKPI
AMNMVAEISR DPQTAGLPIS AIGGVTTWRD AAEFIALGAG NVQVCTAAMT YGFKVVEEMI
SGLSDWMDEK GYSSIEDFRG MAVPNVTDWQ YLDLNYVTKA KISQDDCIKC GRCYAACEDT
SHQAIEMSAD RTFTVKDDEC VACNLCVNVC PVEGCITMEE VAVGAIDERT GKVVSGEYGN
WTQHPNNPSA TAAE