Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3535 |
Symbol | deoA |
ID | 4595717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3745401 |
End bp | 3746687 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639778143 |
Product | thymidine phosphorylase |
Protein accession | YP_924722 |
Protein GI | 119717757 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00606286 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGACC ATGACGCCGT CGAGGTGATC GCCGCCAAGC GCGACCGCCA CGAGCTGACC GACAGCCAGA TCGACTGGGT GGTCGACGCC TACACCAGGG GTGCGGTCGC CGACGAGCAG ATGTCGTCGC TCGCGATGGC CATCCTGCTC AACGGGATGA ACCGGCGCGA GATCGCCCGC TGGACCGCCG CGATGATCGC GTCGGGGGAG CGGATGGACT TCTCCTCGCT CTCGCGGCCG ACCGCCGACA AGCACTCCAC CGGCGGGGTC GGCGACAAGA TCACGCTGCC GCTCGCGCCG CTGGTCGCCG CCTGCGGGGT CGCCGTCCCG CAGCTCTCCG GGCGCGGCCT GGGCCACACG GGTGGCACCC TCGACAAGCT CGAGGCCATC CCCGGCTGGC GGGCGGCCCT GTCGAACGAC GAGATGATGG CGCAGCTCGA GTCGGTGGGT GCGGTGATCT GCGCGGCCGG CGATGGGCTG GCGCCCGCGG ACAAGAAGCT CTACGCGCTG CGCGACGTGA CCGGCACCGT CGAGGCGATC CCGCTGATCG CCTCCTCGAT CATGTCCAAG AAGATCGCCG AGGGCACCGG CTCACTGGTG CTCGACGTCA AGGTCGGCAC CGGCGCGTTC ATGAAGGACA TCGACTCCGC GCGCGAGCTC GCCGAGACGA TGGTCGCGCT CGGCACGGAC GCGGGCGTCC ACACGGTCGC GCTCCTGACC GACATGTCTA CCCCCCTGGG GCGCACCGCC GGCAACGCGA TCGAGGTCGC CGAGTCGGTG GAGGTGCTCG CCGGCGGCGG CCCGGCCGAC GTCGTGGAGC TGACCCTGGC GCTGGCCCGC GAGATGCTGG CCGGCGCGGG TCGCGACGAC GTCGACCCGG CCGACAAGCT GGCCGACGGC TCCGCGATGG ACGCCTGGAA GGCGATGATC CGGGCCCAGG GCGGCGACCC CGACGCCGCG CTCCCGCAGG CGCGGGAGAG CCATGTCGTC AGTGCTCCCG CGTCCGGCGT GCTGACCCGG CTGGACGCGA TGGCCGTCGG GCTGGCCGCC TGGCGGCTGG GCGCCGGCCG GGCCCGCAAG GAGGACCCGG TGCAGGCCGG CGCCGGCGTC GTCTGGCACG CCCGCCCCGG GGACGCCGTC ACCGAGGGGC AGCCGCTGTT CACGCTGCTC ACCGACGACG AGCACCGGTT CGAGCGGGCC CTGGACTCAC TCGGGGGCGG CTACGACATC GCGCCCGCGG ACTCGCCGTA CACCCCGACG CCGCTGGTGA TCGACCGGAT CGCCTGA
|
Protein sequence | MPDHDAVEVI AAKRDRHELT DSQIDWVVDA YTRGAVADEQ MSSLAMAILL NGMNRREIAR WTAAMIASGE RMDFSSLSRP TADKHSTGGV GDKITLPLAP LVAACGVAVP QLSGRGLGHT GGTLDKLEAI PGWRAALSND EMMAQLESVG AVICAAGDGL APADKKLYAL RDVTGTVEAI PLIASSIMSK KIAEGTGSLV LDVKVGTGAF MKDIDSAREL AETMVALGTD AGVHTVALLT DMSTPLGRTA GNAIEVAESV EVLAGGGPAD VVELTLALAR EMLAGAGRDD VDPADKLADG SAMDAWKAMI RAQGGDPDAA LPQARESHVV SAPASGVLTR LDAMAVGLAA WRLGAGRARK EDPVQAGAGV VWHARPGDAV TEGQPLFTLL TDDEHRFERA LDSLGGGYDI APADSPYTPT PLVIDRIA
|
| |