Gene TM1040_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3072 
Symbol 
ID4075166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp41351 
End bp42661 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content63% 
IMG OID638004573 
Productdihydroorotase 
Protein accessionYP_611308 
Protein GI99078050 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.490545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC TTTTCCTCAA CGCCCGCCTG ATCGATCCCG AAACAGGCAC CGACGCGCCT 
GGCAGCCTCC TGGTGCAACG TGGCAAGATC CTCGCCCGCG CTGATCAAAG CGACAAGGAG
ATGTTTCTTG CGGACAACGG TCTGCGCACC AAAGACGTGC AGATGGTGGA CTGCAACGGC
AAATGCCTTG CCCCCGGGAT CGTGGACATC GGCGTCAAGG TTTGCGAGCC GGGCGAGCGG
CACAAGGAGA GCTACAAATC CGCAGGGCTT GCAGCTGCCG CGGGTGGTGT GACCACCATC
GTGACCCGCC CTGACACCTC CCCCTGCATC GACAGCCCTG AGACGCTGGA ATTCGTCACG
CGGCGCGCGC AAGCAGATGC ACCGGTCAAT GTCCTGCCGA TGGCGGCTCT GACTAAGGGG
CGCGAAGGTC GTGAGATGAC CGAAATCGGC TTTTTGCTGG ACGCTGGCGC CGTGGCCTTC
ACCGATTGCG ATCATGTGGT CACAAGTACC AAGGTGCTGT CGCGCGCCCT GACCTATGCC
AAAAGCTGCG GCGCGCTGGT CATTGCGCAT CCGCAGGAAC CCGGCCTCTC TCAGGGGGCG
GCAGCCACAT CCGGAAAGTT CGCGGCGCTG CGCGGGCTGC CTTCTGTGTC TCCGATGGCC
GAGCGCATGG GGCTTGATCG CGATATCGCA TTGCTGGAGA TGACCGGCGC CAAGTATCAC
GCCGATCAGA TCACCACCGC GCGCGCGCTG CCCGCGCTGG AACGCGCCAA GGCAAACGGG
CTCGACATTA CGGCGGGGAC ATCCATCCAC CATCTCACCC TGAATGAGCT GGACGTGGCC
GACTATCGCA CCTTCTTCAA GGTGAAGCCG CCGCTTCGGT CCGAAGATGA TCGCCTCGCG
GTGGTCGAGG CGGTACGCAG CGGGCTCATT GATGTGATCT CCTCCATGCA CACACCGCAG
GACGAAGAAA GCAAGCGGTT GCCCTTTGAA GAGGCCGCCG CCGGTGCGGT TGCGCTCGAG
ACCCTGTTGC CAGCGGCAAT GCGGCTCTAT CACGCCGAGC TTCTGGACCT GCCAACGCTG
TTTCGTGCCA TGGCGCTTAA CCCGTCTCGA CGGCTTGGGC TTGCCTCCGG ACGACTGAGC
GCGGGCGCAC CTGCGGATCT CGTGCTGTTT GACCCCGACG CCCCCTTGGT GCTGGATCGT
TTCAAGCTGC AGTCGAAATC CAAGAACACG CCTTTTGACA CCCAGCGGAT GCAGGGACGT
GTCTTGGCAA CCTATGTGGC CGGTGAGCCC GTTTATCGAA AGGACGCATG A
 
Protein sequence
MTTLFLNARL IDPETGTDAP GSLLVQRGKI LARADQSDKE MFLADNGLRT KDVQMVDCNG 
KCLAPGIVDI GVKVCEPGER HKESYKSAGL AAAAGGVTTI VTRPDTSPCI DSPETLEFVT
RRAQADAPVN VLPMAALTKG REGREMTEIG FLLDAGAVAF TDCDHVVTST KVLSRALTYA
KSCGALVIAH PQEPGLSQGA AATSGKFAAL RGLPSVSPMA ERMGLDRDIA LLEMTGAKYH
ADQITTARAL PALERAKANG LDITAGTSIH HLTLNELDVA DYRTFFKVKP PLRSEDDRLA
VVEAVRSGLI DVISSMHTPQ DEESKRLPFE EAAAGAVALE TLLPAAMRLY HAELLDLPTL
FRAMALNPSR RLGLASGRLS AGAPADLVLF DPDAPLVLDR FKLQSKSKNT PFDTQRMQGR
VLATYVAGEP VYRKDA