Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3581 |
Symbol | |
ID | 5077730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 199847 |
End bp | 201334 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481305 |
Product | thymidine phosphorylase |
Protein accession | YP_001165967 |
Protein GI | 146275807 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTTGA AAATCAAGCG CATCGCGATC GACACTCACC CGGAGAACAC CGCGTTCCTC CTGCGCAGGA GGAACGGCTA TTCGCCCGAA CAGTTCGTCG CGCTGCGCAA GATCGAGATC ACCGGCGGCG ATGCGTCGAT CCTCGCCACC CTGGCGCTGA TCGATGACGA GAGCCTGCTC GAACCGCACA TGATCGGCCT TGGCGAACAG GCCTTCCGCC GCCTCGGCCT GCCCGAAGGC GCGGAAGTAA CCTTCCGCCA GGCGCCCGTC CCGCACAGCC TCGAACACGT CCGTCGCAAG ATCGACGGCG ACGAACTGGA AGAAACCGAG ATCGCGGAGA TCATCCGCGA CATCGCCGGA TACCGCTATT CCCCGATGGA GATCGGGGCC TTCCTCGTCG CCTGCGCCGG GTTCATGTCC ACGCACGAAA CGCTTGCCCT CACCCGCGCA ATGGCCGGCG TCGGCCGCCA GATGCACTGG CCGTCCGAAA TCGTCGTCGA CAAGCACTGC ATCGGCGGCA TCCCCGGCAA CCGCACCTCG ATGATCATCG TGCCGATCAT CGCCGCGCAC GGGCTGACCA TGCCCAAGAC CTCGTCGCGC GCGATCACCT CGCCATCGGG CACGGCCGAC ACGATGGAAG TCCTCGCCTC GGTGGACCTG CCAGAGGACC GGCTCGTCTC CATCGTGGCA AAGGAACACG CGGTGCTTGC CTGGGGCGGC CGGGTGAACC TGTCGCCCGC CGACGACGTG CTGATCACCG TCGAGCGTCC GTTGCGGATC GACACCTTCG ACCAGATGGT CGCCTCGATC CTGTCGAAGA AGCTGGCCGC CGGCTCGACC CACCTTCTCA TCGACATTCC CGTCGGCCCC ACCGCCAAGG TCCGCACCAC GCGCGAGGCG ATCCGCCTGC GCAAGCTGTT CGAATACGTC GGCCATCGTC TCGGCCTCGT TCTCGACATC GTCGTCACCG ACGGCTCGCA GCCGGTGGGC CGGGGCGTGG GCCCCGTGCT CGAGGCGCGC GACGTGATGG CCGTCCTGCG CAACGAGGAC GACGCACCCC GGGACTTGCG CGAACGCGCC GTCATGCTCG CGGGCCGGGT GCTCGAATTC GATCCCGCGC TGGCGGGCGG CAAGGGCTAT GCCCGCGCGA TGGAACTGCT CGGTTCCGGT GCCGCGCTGG CGGCGATGGA GCGCCTGATC GATGCGCAGG GCCGGTGCCG CGAAGTGATC CTTCCCGGCA GCCACGTCCA CGATATCTGC GCGCCAGCCG GCGGGACGGT CATGTCCATC GATTGCCACC TGATCGCGCG CATCGCCCGC CTTGCCGGCG CGCCGATGGA CAAGGGCGCG GGAATCGACC TGCTGCACAA GGTGGGCGAC CGGGTGCGCG CTGACGAAGT GCTCTATCGC ATCCACGCCC ACTCTCCGAC CGGCCTCGAA TATGCGCGCG AACTGGCCGT GGCGAGTTCC GGTTACGTGG TCGGATGA
|
Protein sequence | MALKIKRIAI DTHPENTAFL LRRRNGYSPE QFVALRKIEI TGGDASILAT LALIDDESLL EPHMIGLGEQ AFRRLGLPEG AEVTFRQAPV PHSLEHVRRK IDGDELEETE IAEIIRDIAG YRYSPMEIGA FLVACAGFMS THETLALTRA MAGVGRQMHW PSEIVVDKHC IGGIPGNRTS MIIVPIIAAH GLTMPKTSSR AITSPSGTAD TMEVLASVDL PEDRLVSIVA KEHAVLAWGG RVNLSPADDV LITVERPLRI DTFDQMVASI LSKKLAAGST HLLIDIPVGP TAKVRTTREA IRLRKLFEYV GHRLGLVLDI VVTDGSQPVG RGVGPVLEAR DVMAVLRNED DAPRDLRERA VMLAGRVLEF DPALAGGKGY ARAMELLGSG AALAAMERLI DAQGRCREVI LPGSHVHDIC APAGGTVMSI DCHLIARIAR LAGAPMDKGA GIDLLHKVGD RVRADEVLYR IHAHSPTGLE YARELAVASS GYVVG
|
| |