Gene Saro_3581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3581 
Symbol 
ID5077730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp199847 
End bp201334 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content67% 
IMG OID640481305 
Productthymidine phosphorylase 
Protein accessionYP_001165967 
Protein GI146275807 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTTGA AAATCAAGCG CATCGCGATC GACACTCACC CGGAGAACAC CGCGTTCCTC 
CTGCGCAGGA GGAACGGCTA TTCGCCCGAA CAGTTCGTCG CGCTGCGCAA GATCGAGATC
ACCGGCGGCG ATGCGTCGAT CCTCGCCACC CTGGCGCTGA TCGATGACGA GAGCCTGCTC
GAACCGCACA TGATCGGCCT TGGCGAACAG GCCTTCCGCC GCCTCGGCCT GCCCGAAGGC
GCGGAAGTAA CCTTCCGCCA GGCGCCCGTC CCGCACAGCC TCGAACACGT CCGTCGCAAG
ATCGACGGCG ACGAACTGGA AGAAACCGAG ATCGCGGAGA TCATCCGCGA CATCGCCGGA
TACCGCTATT CCCCGATGGA GATCGGGGCC TTCCTCGTCG CCTGCGCCGG GTTCATGTCC
ACGCACGAAA CGCTTGCCCT CACCCGCGCA ATGGCCGGCG TCGGCCGCCA GATGCACTGG
CCGTCCGAAA TCGTCGTCGA CAAGCACTGC ATCGGCGGCA TCCCCGGCAA CCGCACCTCG
ATGATCATCG TGCCGATCAT CGCCGCGCAC GGGCTGACCA TGCCCAAGAC CTCGTCGCGC
GCGATCACCT CGCCATCGGG CACGGCCGAC ACGATGGAAG TCCTCGCCTC GGTGGACCTG
CCAGAGGACC GGCTCGTCTC CATCGTGGCA AAGGAACACG CGGTGCTTGC CTGGGGCGGC
CGGGTGAACC TGTCGCCCGC CGACGACGTG CTGATCACCG TCGAGCGTCC GTTGCGGATC
GACACCTTCG ACCAGATGGT CGCCTCGATC CTGTCGAAGA AGCTGGCCGC CGGCTCGACC
CACCTTCTCA TCGACATTCC CGTCGGCCCC ACCGCCAAGG TCCGCACCAC GCGCGAGGCG
ATCCGCCTGC GCAAGCTGTT CGAATACGTC GGCCATCGTC TCGGCCTCGT TCTCGACATC
GTCGTCACCG ACGGCTCGCA GCCGGTGGGC CGGGGCGTGG GCCCCGTGCT CGAGGCGCGC
GACGTGATGG CCGTCCTGCG CAACGAGGAC GACGCACCCC GGGACTTGCG CGAACGCGCC
GTCATGCTCG CGGGCCGGGT GCTCGAATTC GATCCCGCGC TGGCGGGCGG CAAGGGCTAT
GCCCGCGCGA TGGAACTGCT CGGTTCCGGT GCCGCGCTGG CGGCGATGGA GCGCCTGATC
GATGCGCAGG GCCGGTGCCG CGAAGTGATC CTTCCCGGCA GCCACGTCCA CGATATCTGC
GCGCCAGCCG GCGGGACGGT CATGTCCATC GATTGCCACC TGATCGCGCG CATCGCCCGC
CTTGCCGGCG CGCCGATGGA CAAGGGCGCG GGAATCGACC TGCTGCACAA GGTGGGCGAC
CGGGTGCGCG CTGACGAAGT GCTCTATCGC ATCCACGCCC ACTCTCCGAC CGGCCTCGAA
TATGCGCGCG AACTGGCCGT GGCGAGTTCC GGTTACGTGG TCGGATGA
 
Protein sequence
MALKIKRIAI DTHPENTAFL LRRRNGYSPE QFVALRKIEI TGGDASILAT LALIDDESLL 
EPHMIGLGEQ AFRRLGLPEG AEVTFRQAPV PHSLEHVRRK IDGDELEETE IAEIIRDIAG
YRYSPMEIGA FLVACAGFMS THETLALTRA MAGVGRQMHW PSEIVVDKHC IGGIPGNRTS
MIIVPIIAAH GLTMPKTSSR AITSPSGTAD TMEVLASVDL PEDRLVSIVA KEHAVLAWGG
RVNLSPADDV LITVERPLRI DTFDQMVASI LSKKLAAGST HLLIDIPVGP TAKVRTTREA
IRLRKLFEYV GHRLGLVLDI VVTDGSQPVG RGVGPVLEAR DVMAVLRNED DAPRDLRERA
VMLAGRVLEF DPALAGGKGY ARAMELLGSG AALAAMERLI DAQGRCREVI LPGSHVHDIC
APAGGTVMSI DCHLIARIAR LAGAPMDKGA GIDLLHKVGD RVRADEVLYR IHAHSPTGLE
YARELAVASS GYVVG