Gene TM1040_1776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1776 
Symbol 
ID4076805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1869523 
End bp1870563 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content62% 
IMG OID638007091 
Productdihydroorotase 
Protein accessionYP_613771 
Protein GI99081617 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.130496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.212063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA GCCTGACGAT CACCCGCCCC GACGACTGGC ATCTGCATCT GCGCGACGGC 
GACATGCTGC GCGCGGTGCT GCCGGAAACA GCCCGCCATT TTGGCCGCGC CATCATCATG
CCCAATCTCG TGCCCCCGGT TGTCACCGGC GCCGAGGCCA GCGCCTATCG CGACCGCATT
CTTGCCGCAC TCCCTGAGGG CATGACGTTC GAGCCCCTGA TGACGCTCTA TCTCACCGAG
GACACGGACC CTGCAGATGT CGCAGCCGCC CATGCCTCGG GTCTGGTCAA AGCCGTCAAG
CTCTACCCCG CTGGCGCCAC CACCAACTCC TCGTCCGGTG TGCGCGATTT CGACAAGGTC
CGCCCCGTGC TTGAGAAAAT GGCCGAAATC GGCCTGCCGC TCTGCACCCA TGGCGAGGTC
ACCGACCACG ACATCGACAT CTTTGACCGC GAGGCCGTCT TTATCGATCG CGTGCTGGAC
CCGATCCGCC AATCCACACC GGGCCTGCGT GTGGTGATGG AGCATATCAC CACCAAGGAC
GCGGCGGATT ATGTGCGATC GCAGGACAAG GATCTTGGCG CGACAATCAC CACGCACCAC
CTGATCATCA ATCGCAACCA CATCCTCGTG GGCGGGATCA AGCCACACTA TTACTGCCTG
CCTGTCGCCA AGCGCGAAGA GCACCGCCTC GCCCTGCGCC AAGCTGCGAC CTCCGGTGAT
GCGCGGTTCT TCCTTGGTAC CGACTCAGCG CCCCACACCG ATGCCAACAA GCTCCAGACC
TGTGGCTGCG CGGGTTGTTT CACGGCGACC AACACCATGG CTCTGCTGGC CCATGTATTT
GAGGAAGAAG GCGCGCTCGA CAAGCTCGAA GGGTTTGCCT CCAAGAACGG CCCCGCCTTT
TATCGCTTAC CCGAAAACGA TGGTCAGATC ACACTGGTGA AACAGGACGC GCCCGTCGCT
TTCCCGGAAC AGATCGACAC GCCGGACGGC CCCGTGACCG TCTTTGATCC AAGCTTTGCG
GTGCATTGGA CCGTCACCTG A
 
Protein sequence
MTQSLTITRP DDWHLHLRDG DMLRAVLPET ARHFGRAIIM PNLVPPVVTG AEASAYRDRI 
LAALPEGMTF EPLMTLYLTE DTDPADVAAA HASGLVKAVK LYPAGATTNS SSGVRDFDKV
RPVLEKMAEI GLPLCTHGEV TDHDIDIFDR EAVFIDRVLD PIRQSTPGLR VVMEHITTKD
AADYVRSQDK DLGATITTHH LIINRNHILV GGIKPHYYCL PVAKREEHRL ALRQAATSGD
ARFFLGTDSA PHTDANKLQT CGCAGCFTAT NTMALLAHVF EEEGALDKLE GFASKNGPAF
YRLPENDGQI TLVKQDAPVA FPEQIDTPDG PVTVFDPSFA VHWTVT