Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2039 |
Symbol | |
ID | 3917686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2175675 |
End bp | 2177105 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444791 |
Product | uracil-DNA glycosylase superfamily protein |
Protein accession | YP_497312 |
Protein GI | 87200055 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCCGG ACAGGCGCCG GTTTGAGGCC TTCCGCGTAC AATTCGCCGC GTCGGACGAT TTCGAAGGCT GGCGTGACGC CGCCCGCCGC ATGATTCGCG CGAAGATTGC CCCAGACCAG GTGATGTGGG AATCTCCCGC CGACCAGTCT GCCGATATCT TCGCCCGAGG CGGTGTCGCC CTGCCATCGC CGCCGACGGA CGCTCCCCAA CCGCGCGCGT CGAAGGACTT CCTTCAACTT GCGCAAAGCG TCATCCTTCA TTCTGGCAGT AAACGGTTTT CTACTCTTTA TCGTACGCTT TGGCGGCTCC AGTCTCGGCC CCGGCTGATG GACGACAAGG CCGATGCCGA CGTGCGGGCG ATGGAGGACC TTGCCCGGCA GGTGCGGCGC GACATCCACA AGATGCGCGC CTTCGTCCGC TTCCGCAGCG TCGAGGGCGA GGCGGGAGAG CGATATGTTG CCTGGTTCGA GCCCGAGCAT CACATCCTGC GCGCGAATGC GGGCTTCTTC GTCCGTCGCT TCACCACCAT GCAGTGGTCG ATCCTAACCC CGCGCGGTAG CCTGCACTGG GATGGCGAGA CGCTGCACGA GGGGCCGCCG GCCACCCGCG CCGATGCGCC TTCCGGCGAT CCGGTCGAAG GGCTGTGGCG TACCTACTAT GCATCGATCT TCAATCCCGC GCGTTTGAAG GTCGGGGCCA TGCTCAAGGA AATGCCCCGC AAATACTGGA AGAACATGCC GGAGGCGGCG CTCATTCCCG AATTGATCGC AGGGGCACAA TCACGCGAGG CACGAATGGT ACAGGCTGGC GAGCAGGATC TCGGTGAGAC GCCAGTGAGC ATCGATGCCA TCGGCGCGGC CATCCTGGCC TGCCGTCGGT GCGACATCGG CTGCAATGGC ACCCGGGCGG TCATGGGCGA GGGACCGCAC GACGCGGCCC TGATGATCAT CGGCGAGCAG CCCGGCGAAC AGGAAGAGGC GCAGGGCCGG CCTTTTGTCG GTCCTGCCGG TCAACTGCTG CGCACCCATC TCGAACATGC CGGCATTCCG GCGGAGCGCG CTTACGTCAC CAATGCGGTC AAGCACTTCA AGTTCATGCC GCAGGGAAAG CGTCGCCTGC ACCAGAACCC GTCAGCCAGG GAAATCGACG TGTGCCGCTG GTGGCTCGAG GGTGAACGCG GGCTGGTTCG CCCGCGCCTG ATCCTCGCGC TGGGGGCAAG TGCGGCGCGC AGCCTGCTGG GCAGGACTGT CAGCGTCCAG AAGGTGCGGG GTGCACCGCA TGTGCTGGAC GATGGCAGCG AACTGTGGAT CACCACCCAC CCCAGCTACC TCCTGCGCCT GGACGACGGC GGGCGTTCGG AAGAAGAAGC CAGATTTTCA AATGACTTGC AGAAGGTAGC TGCGCGGCTT TCGCAGATTT CGTCCGGCTA G
|
Protein sequence | MMPDRRRFEA FRVQFAASDD FEGWRDAARR MIRAKIAPDQ VMWESPADQS ADIFARGGVA LPSPPTDAPQ PRASKDFLQL AQSVILHSGS KRFSTLYRTL WRLQSRPRLM DDKADADVRA MEDLARQVRR DIHKMRAFVR FRSVEGEAGE RYVAWFEPEH HILRANAGFF VRRFTTMQWS ILTPRGSLHW DGETLHEGPP ATRADAPSGD PVEGLWRTYY ASIFNPARLK VGAMLKEMPR KYWKNMPEAA LIPELIAGAQ SREARMVQAG EQDLGETPVS IDAIGAAILA CRRCDIGCNG TRAVMGEGPH DAALMIIGEQ PGEQEEAQGR PFVGPAGQLL RTHLEHAGIP AERAYVTNAV KHFKFMPQGK RRLHQNPSAR EIDVCRWWLE GERGLVRPRL ILALGASAAR SLLGRTVSVQ KVRGAPHVLD DGSELWITTH PSYLLRLDDG GRSEEEARFS NDLQKVAARL SQISSG
|
| |