Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3072 |
Symbol | |
ID | 4075166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 41351 |
End bp | 42661 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638004573 |
Product | dihydroorotase |
Protein accession | YP_611308 |
Protein GI | 99078050 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.490545 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC TTTTCCTCAA CGCCCGCCTG ATCGATCCCG AAACAGGCAC CGACGCGCCT GGCAGCCTCC TGGTGCAACG TGGCAAGATC CTCGCCCGCG CTGATCAAAG CGACAAGGAG ATGTTTCTTG CGGACAACGG TCTGCGCACC AAAGACGTGC AGATGGTGGA CTGCAACGGC AAATGCCTTG CCCCCGGGAT CGTGGACATC GGCGTCAAGG TTTGCGAGCC GGGCGAGCGG CACAAGGAGA GCTACAAATC CGCAGGGCTT GCAGCTGCCG CGGGTGGTGT GACCACCATC GTGACCCGCC CTGACACCTC CCCCTGCATC GACAGCCCTG AGACGCTGGA ATTCGTCACG CGGCGCGCGC AAGCAGATGC ACCGGTCAAT GTCCTGCCGA TGGCGGCTCT GACTAAGGGG CGCGAAGGTC GTGAGATGAC CGAAATCGGC TTTTTGCTGG ACGCTGGCGC CGTGGCCTTC ACCGATTGCG ATCATGTGGT CACAAGTACC AAGGTGCTGT CGCGCGCCCT GACCTATGCC AAAAGCTGCG GCGCGCTGGT CATTGCGCAT CCGCAGGAAC CCGGCCTCTC TCAGGGGGCG GCAGCCACAT CCGGAAAGTT CGCGGCGCTG CGCGGGCTGC CTTCTGTGTC TCCGATGGCC GAGCGCATGG GGCTTGATCG CGATATCGCA TTGCTGGAGA TGACCGGCGC CAAGTATCAC GCCGATCAGA TCACCACCGC GCGCGCGCTG CCCGCGCTGG AACGCGCCAA GGCAAACGGG CTCGACATTA CGGCGGGGAC ATCCATCCAC CATCTCACCC TGAATGAGCT GGACGTGGCC GACTATCGCA CCTTCTTCAA GGTGAAGCCG CCGCTTCGGT CCGAAGATGA TCGCCTCGCG GTGGTCGAGG CGGTACGCAG CGGGCTCATT GATGTGATCT CCTCCATGCA CACACCGCAG GACGAAGAAA GCAAGCGGTT GCCCTTTGAA GAGGCCGCCG CCGGTGCGGT TGCGCTCGAG ACCCTGTTGC CAGCGGCAAT GCGGCTCTAT CACGCCGAGC TTCTGGACCT GCCAACGCTG TTTCGTGCCA TGGCGCTTAA CCCGTCTCGA CGGCTTGGGC TTGCCTCCGG ACGACTGAGC GCGGGCGCAC CTGCGGATCT CGTGCTGTTT GACCCCGACG CCCCCTTGGT GCTGGATCGT TTCAAGCTGC AGTCGAAATC CAAGAACACG CCTTTTGACA CCCAGCGGAT GCAGGGACGT GTCTTGGCAA CCTATGTGGC CGGTGAGCCC GTTTATCGAA AGGACGCATG A
|
Protein sequence | MTTLFLNARL IDPETGTDAP GSLLVQRGKI LARADQSDKE MFLADNGLRT KDVQMVDCNG KCLAPGIVDI GVKVCEPGER HKESYKSAGL AAAAGGVTTI VTRPDTSPCI DSPETLEFVT RRAQADAPVN VLPMAALTKG REGREMTEIG FLLDAGAVAF TDCDHVVTST KVLSRALTYA KSCGALVIAH PQEPGLSQGA AATSGKFAAL RGLPSVSPMA ERMGLDRDIA LLEMTGAKYH ADQITTARAL PALERAKANG LDITAGTSIH HLTLNELDVA DYRTFFKVKP PLRSEDDRLA VVEAVRSGLI DVISSMHTPQ DEESKRLPFE EAAAGAVALE TLLPAAMRLY HAELLDLPTL FRAMALNPSR RLGLASGRLS AGAPADLVLF DPDAPLVLDR FKLQSKSKNT PFDTQRMQGR VLATYVAGEP VYRKDA
|
| |