Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1142 |
Symbol | deoA |
ID | 4446375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1238756 |
End bp | 1240078 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639688948 |
Product | thymidine phosphorylase |
Protein accession | YP_830636 |
Protein GI | 116669703 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.698076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACACAAA CCCCAAGCAA CACCGAAGCG TTCGACGCCG TCGACATCAT CCGCATCAAG CGGGACAGGG GAGTCCTGAG CCCGGCCCAG ATCGACTGGA CCATCGACGC CTATACCCGC GGCGCGATCG CCGACGAGCA GATGGCCGCC CTGAACATGG CCATCCTGCT CAACGGCATG GACCGCGCGG AAATTTCGCG CTGGACCGCC GCGATGATCG CGTCCGGCGA ACGGATGGAC TTCTCCGGAC TGAGGCGGGC CGACGGCAGC GTGAAGCCGA CGTCGGACAA GCACTCCACC GGGGGAGTGG GGGACAAGAT CACGCTGCCG CTGGCACCGC TTGTTGCTGT ATTTGGCGTG GCTGTTCCAC AGCTCTCCGG CCGGGGCCTG GGCCACACCG GCGGCACACT GGACAAGCTG GAAGCCATCC CGGGCTGGCG CGCAGGGCTG AGCAACGACG AAATGATGGC TCAGCTCCAA AACGTCGGGG CAGTGATCTG CGCCGCGGGT GCCGGGCTGG CCCCTGCGGA CAAGAAGCTC TATGCCCTGC GCGACGTCAC GGGAACGGTG GAAGCCATCC CGCTGATTGC GTCCTCCATC ATGAGCAAGA AGATCGCGGA GGGCACCGGC TCACTGGTCC TCGACGTCAA GGTGGGAAGT GGCGCCTTTA TGAAGGACGA GGCCCGCGCC AGGGAACTGG CCGAGACGAT GGTGGCCTTG GGCAAGGATG CCGGTGTCAA CACCGTGGCC CTTTTGACTA ACATGAACAC GCCACTGGGC CTGACCGCCG GAAACTCGAT CGAGGTTGAG GAATCCGTGG AGGTGCTGGC CGGCGGCGGT CCGGAAGACG TCGTGGAACT GACGGTCCGG CTGGCGGAGG AGATGCTCGC CTGCGCCGGC GTTCATGACG CCGACCCGCG CGCCGCGCTC AAGGACGGCC GGGCCATGGA CGTGTGGAAC CGCATGATCG AGGCACAAGG TGGGGACCCC CGGGCCAAGC TTCCCGTTGC CAGGGAATCC GAAGTCGTCT ACGCGCCGGC CGACGGTGTC CTCGTGGAGC TGGATGCACT GGCCGTTGGC GTGGCCGCGT GGCGGCTAGG TGCCGGCCGG GCGCGCAAGG AGGACCAGGT CCAGGCGGGT GCCGGCGTCC GGCTGCACGC CAAGCCCGGT GCCGTGGTCC GCGCAGGCGA GCCCCTGATG ACGCTACTGA CGGACACCCC GGAGAAGTTC GAGCGGGCCA AGGAAGCCCT CGAAGGGTCC GTCGTCATCG CACCGGAGGG CTCCCGGCCG GCGCAGAAAC TCATCATCGA CCGCATCGCC TGA
|
Protein sequence | MTQTPSNTEA FDAVDIIRIK RDRGVLSPAQ IDWTIDAYTR GAIADEQMAA LNMAILLNGM DRAEISRWTA AMIASGERMD FSGLRRADGS VKPTSDKHST GGVGDKITLP LAPLVAVFGV AVPQLSGRGL GHTGGTLDKL EAIPGWRAGL SNDEMMAQLQ NVGAVICAAG AGLAPADKKL YALRDVTGTV EAIPLIASSI MSKKIAEGTG SLVLDVKVGS GAFMKDEARA RELAETMVAL GKDAGVNTVA LLTNMNTPLG LTAGNSIEVE ESVEVLAGGG PEDVVELTVR LAEEMLACAG VHDADPRAAL KDGRAMDVWN RMIEAQGGDP RAKLPVARES EVVYAPADGV LVELDALAVG VAAWRLGAGR ARKEDQVQAG AGVRLHAKPG AVVRAGEPLM TLLTDTPEKF ERAKEALEGS VVIAPEGSRP AQKLIIDRIA
|
| |