Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1215 |
Symbol | deoA |
ID | 7292660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 1337067 |
End bp | 1338398 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643589620 |
Product | thymidine phosphorylase |
Protein accession | YP_002487295 |
Protein GI | 220911986 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.000539787 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGAGA CCCGCGCAAC GGACAGCATT GCCGAAGCAT TCGACGCCGT CGACATCATC CGCGTCAAGC GGGACAAGGG CACGCTGAGC CCGGAGCAGA TCGACTGGAC CATCGATGCC TACACCCGCG GCGTCATCGC GGATGAGCAG ATGGCCGCGC TGAACATGGC CATCCTGCTC AATGGCATGG ACCGGACCGA GATTGCGCGC TGGACGGCCG CGATGATCGC ATCCGGCGAA CGGATGGACT TCTCCAGCCT CCGTCGCCCC GACGGCGGCC TGAAATACAC GTCGGACAAG CACTCCACCG GCGGCGTGGG AGACAAGATC ACCCTGCCGC TGGCTCCGCT CGTGGCGGTA TTCGGCGTTG CAGTCCCGCA GCTGTCCGGC CGCGGCCTGG GGCACACCGG CGGCACCCTG GACAAGCTGG AGGCGATTCC CGGCTGGCGG GCGTCCCTGA GCAACGACGA AATACTCGCC CAGCTCCAGG ACGTGGGCGC TGTCATCTGC GCCGCCGGGG CAGGCCTGAC CCCGGCCGAT AAGAAGCTGT ACGCCCTGCG CGACGTCACC GGCACGGTGG AGGCCATCCC GCTGATCGCC TCGTCCATCA TGAGCAAGAA AATCGCCGAG GGCACCGGTT CCCTGGTGCT CGATGTGAAG GTGGGCAGCG GCGCCTTCAT GAAGGATGAG GCAAAGGCCC GCGAGCTGGC GGAGACCATG GTGGCCCTGG GCCAGGACGC CGGCGTGAAC ACGGTGGCAC TGCTTACCAA CATGGGCACC CCGCTCGGCC TGACCGCCGG GAACGCGATT GAAGTCGAGG AGTCGGTGGA GGTGCTGGCG GGCGGCGGCC CGGCCGACGT CGTCGAACTG ACGGTCAGGC TCGCCGAGGA AATGCTCGCC TGCGCGGGAG TGCGCGACGC CGATCCGGCC GCTGCGCTCA AGGACGGGCG CGCCATGGAC GTCTGGAACA GGATGATCCG TGCCCAGGGA GGTGACCCCG CCGCGAAGCT GCCGGTGGCC CGTGAGTCAG AGGTGCTCTA CGCTCCCGCC GACGGCGTCC TGGTGGAACT GGATGCCCTC GCCGTGGGCG TGGCCGCCTG GCGATTGGGC GCCGGACGTG CCCGCAAGGA GGATGCGGTG CAGGCCGGCG CAGGGGTGCG CATGCATGCC AAGCCGGGCG CACTGGTCCG GGCAGGTGAA CCGCTGATGA CCCTGCTCAC GGACACCCCC GAACGCTTCG ACAGGGCAAA GGAAGCGCTG GAGCACGCAG CGGTCATCGC ACCGGAGGGG TCCCGGCCGG CACAGCAGTT GATCATCGAC CGAATAGCAT AG
|
Protein sequence | MTETRATDSI AEAFDAVDII RVKRDKGTLS PEQIDWTIDA YTRGVIADEQ MAALNMAILL NGMDRTEIAR WTAAMIASGE RMDFSSLRRP DGGLKYTSDK HSTGGVGDKI TLPLAPLVAV FGVAVPQLSG RGLGHTGGTL DKLEAIPGWR ASLSNDEILA QLQDVGAVIC AAGAGLTPAD KKLYALRDVT GTVEAIPLIA SSIMSKKIAE GTGSLVLDVK VGSGAFMKDE AKARELAETM VALGQDAGVN TVALLTNMGT PLGLTAGNAI EVEESVEVLA GGGPADVVEL TVRLAEEMLA CAGVRDADPA AALKDGRAMD VWNRMIRAQG GDPAAKLPVA RESEVLYAPA DGVLVELDAL AVGVAAWRLG AGRARKEDAV QAGAGVRMHA KPGALVRAGE PLMTLLTDTP ERFDRAKEAL EHAAVIAPEG SRPAQQLIID RIA
|
| |