Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0756 |
Symbol | deoA |
ID | 5706960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 841700 |
End bp | 842980 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270275 |
Product | thymidine phosphorylase |
Protein accession | YP_001535666 |
Protein GI | 159036413 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0537084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGATT TTACGGCAGT TGACATCATT CGGGCCAAAC GGGACGGCGA GGCGTTCTCC GACGCGCAGG TCGACTGGGT GGTGGATGCC TATACCCGAG GTCAGGTGGC GGACGAACAG ATGGCCGCGC TGGCGATGGC GATCCTGCTG AACGGGATGA CCACGCCGGA GATCGCCCGG TGGACCGCGG CGATGATCGC CAGCGGTGAG CGACTGGACC TCGCCGCGGT CGATCGGCCG ACCGTCGACA AACACTCCAC CGGTGGTGTC GGAGACAAGA TCACCCTGCC GCTCACCCCG CTGGTGGCGG CCTGCGGCGC GGCGGTGCCG CAGCTGAGCG GTCGGGGACT CGGGCACACC GGTGGCACGC TCGACAAGCT GGAGTCCATC CGGGGCTGGC GGGCGTCGCT GGGCAACGAG GAGTTCATCG CCCAGCTCCG CGGTGTCGGC GCGGTGATCT GCGCGGCCGG CGCCGGTCTC GCGCCGGCCG ACCGCAAGCT GTACGCGCTG CGGGACGTGA CCGGCACGGT GGAGGCCATC CCACTCATCG CCAGTTCGAT TATGAGCAAG AAGATCGCTG AGGGGACCGG TGCCCTGGTC CTGGACGTCA AGGTCGGCTC CGGTGCGTTC ATGAAGTCGG TCGACCAGGC CCGGCAACTG GCCCGCACCA TGGTCGAGTT GGGCAGCGCG CACGACGTAC GCACGGTTGC CCTGCTCACC GACATGTCCA CCCCACTAGG GCTGGCCATC GGCAACGCGG TCGAGGTGGC CGAATCGGTC GAGGTACTGG CGGGTGGTGG ACCGGCCGAC GTGGTCGAGC TGACCCTGGC CCTGGCCCGG GAGATGCTCG AAGCCGCCGG TCTACCGGAT GCCGACCCGG CGACGGCGCT GCGCGACGGC CGGGCGATGG ACTCCTGGCG GGCGATGCTG CGGGCCCAGG GCGGGGATCC GGATGCCCCG CTGCCCACGG CGCCGGAGAC CGAGGTGGTG CGCGCCGACA CCGACGGTGT GGTGGCGGAG GTCGACGCGT TCGGCATGGG GGTGGCCGCG TGGCGGCTCG GCGCCGGGCG GGCCCGCAAG GAGGACCCGG TGTCCGCGCC GGCGGGCGTG CTGCTGCGCA AACGCCCCGG CGATCCGGTC CGGGCCGGTG ACCCGCTCTT CGAGCTGCGG GCCGAGGACG CCGCTCGGAT CCCGGCGGCC CGGGAGGAGG CGGTGCGGGC GGTGCGTATT GCGGCGTCGA CCCCGGAGCC GAGGCCACTG GTGCTCGAAC GAATCGGCTG A
|
Protein sequence | MVDFTAVDII RAKRDGEAFS DAQVDWVVDA YTRGQVADEQ MAALAMAILL NGMTTPEIAR WTAAMIASGE RLDLAAVDRP TVDKHSTGGV GDKITLPLTP LVAACGAAVP QLSGRGLGHT GGTLDKLESI RGWRASLGNE EFIAQLRGVG AVICAAGAGL APADRKLYAL RDVTGTVEAI PLIASSIMSK KIAEGTGALV LDVKVGSGAF MKSVDQARQL ARTMVELGSA HDVRTVALLT DMSTPLGLAI GNAVEVAESV EVLAGGGPAD VVELTLALAR EMLEAAGLPD ADPATALRDG RAMDSWRAML RAQGGDPDAP LPTAPETEVV RADTDGVVAE VDAFGMGVAA WRLGAGRARK EDPVSAPAGV LLRKRPGDPV RAGDPLFELR AEDAARIPAA REEAVRAVRI AASTPEPRPL VLERIG
|
| |