Gene TM1040_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1550 
Symbol 
ID4075848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1656807 
End bp1657865 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content64% 
IMG OID638006863 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_613545 
Protein GI99081391 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.11793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.477736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGA GCGAGAAACT TGCCCTCGGG CTGATGCATC GCCTTGATCC CGAAACCGCG 
CATGGTCTGT CGATCAAGGC GCTCAGAGCG GGGCTGACGC CACGCCCTGG TCCGGTGACA
TCACCGCGCC TGCGCACGGA TGTGGCGGGT CTTTCGCTGC CGAACCCGGT GGGGCTTGCA
GCCGGGTTTG ACAAGAACGC CGAAGCGCTT GCTCCGCTTT CAGAGGCTGG CTTTGGATTT
ATCGAAGTGG GGGCCGCCAC GCCGCGCCCG CAACCGGGCA ACCCCAAGCC GCGCCTTTTT
CGTCTGAGCG AAGATCGCGC CGCCATCAAC CGCTTTGGCT TTAACAATGA GGGCATGGAC
ACGATCGGGA AGCGCCTCGC GCAGCGTCCA AAGTCGGGCG TGATCGGCCT CAACCTCGGG
GCCAACAAGG ACAGCGAGGA CCGCGCGCAG GATTTCGCCC GCGTGCTCAG CCATTGCGGC
GCGCATCTGG ATTTTGCCAC CGTGAACGTG TCGTCGCCCA ACACAGAGAA ACTGCGCGAT
CTGCAGGGCA AGGATGCTCT TGCTTCACTG CTGGCAGGGG TCATTGACGC CCGAGAGGCC
CTGCAGCGCC CCATCCCGGT CTTTCTCAAG ATTGCGCCGG ATCTCGACAT ATCCGGGCTT
GATGACATTG CCGAGGTCGC GCGTGACAGC GGCATTGATG CGGTGATCGC CACCAACACG
ACGCTTTCGC GCGACGGCCT GAAAAGCACG CACCGGGACG AGATGGGCGG CCTCTCGGGC
GCGCCCCTAT TTGAGCGCTC GACACGGGTG CTGGCGCAGC TTTCGCAACG TCTGGATGGG
GCGGTGCCGA TCATCGGCGT CGGGGGCATC TCCACGGCTG AAGGCGCCTA TGCCAAGATC
CGCGCTGGAG CCTCGGCGGT GCAGCTTTAT ACGGCGCTGG TCTACGGCGG GCTGTCGCTG
GCCTCTGAGG TCGCTTCGGG TCTTGATGCA TTGCTCGCGC GAGACGGGTT TTCAAATGTC
GCGGAAGCGG TTGGCACAGG GCGTGCGGAC TGGCTCTGA
 
Protein sequence
MKLSEKLALG LMHRLDPETA HGLSIKALRA GLTPRPGPVT SPRLRTDVAG LSLPNPVGLA 
AGFDKNAEAL APLSEAGFGF IEVGAATPRP QPGNPKPRLF RLSEDRAAIN RFGFNNEGMD
TIGKRLAQRP KSGVIGLNLG ANKDSEDRAQ DFARVLSHCG AHLDFATVNV SSPNTEKLRD
LQGKDALASL LAGVIDAREA LQRPIPVFLK IAPDLDISGL DDIAEVARDS GIDAVIATNT
TLSRDGLKST HRDEMGGLSG APLFERSTRV LAQLSQRLDG AVPIIGVGGI STAEGAYAKI
RAGASAVQLY TALVYGGLSL ASEVASGLDA LLARDGFSNV AEAVGTGRAD WL