Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_1970 |
Symbol | deoA |
ID | 3520590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 2038824 |
End bp | 2040140 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637284432 |
Product | thymidine phosphorylase |
Protein accession | YP_268700 |
Protein GI | 71280126 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02643] thymidine phosphorylase [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.380336 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTAA CCGCTAAAAT CATCCCTCAA GAGATTATTC GACTGAAACG TGATGGTAAA ATATTAGATG AACAAGCGAT AAATGGCTTT GTTTCAGGCC TTGTTGATGG CAACTTTTCA GATAGCCAAG TTGGCGCTAT GGCCATGGCT ATTTTTCAAC AGGGTATGTC GATTGATGAA AGAGTCAATT TTACCAAGGC TATGATGCGC TCAGGAGAGG TTCTTAGCTG GGAAGGTTTT GATGGTCCTA TTGTTGATAA GCATTCAACG GGTGGTGTCG GTGATAAAGT TAGCTTTATG TTAGCTGCTA TCGTTGCTGC TTGCGGCGGT TATGTTCCTA TGATTTCAGG GCGTGGTTTG GGCCATACTG GTGGTACCGC AGATAAACTT GAAAGTATTG CTGGTTTTAA TGTACAACCC AGTATTAGTG AATTTAAACG TATTGTTAAA GACGTTGGTG TTGCCATTAT TTCACAAACC GATAATTTAG CACCCGCCGA TAAACGTTTG TATTCTATTC GTGATGTGAC CGCTACAGTT GAATCTATTC CATTGATCAC CGCTTCTATC TTATCGAAGA AGCTTGCCGC AGGACTTGAT GTCTTAGTGA TGGATGTCAA GGTTGGCAAT GGTGCCATGA TGAATAATTT AGATGATGCT AAAGCACTTG CACAAAGCAT TACCAGTGTT GCTAACGGAG CGGGCGTTAA AACACAGGCT ATTATTACCG ATATGAATCA AGTCTTGGGT ACTAGTGCCG GTAACGCCAT AGAAATGTAT GAAACAGTTA AATACTTAAC CGGTAAACAA CGAGAGCCGC GCTTACATAA AATTGTGCAA GCACTTGCTA GTGCAATGCT TATTAATACA AATCTAGCGA GTAGTGAAAA AGATGCCCGT GAGAAAATTG ATAAGGTATT AAATTCAGGC TTAGCGGCTG AAAAATTTGA TCGCATGGTA TCCGCGTTAG GTGGACCGAA AAACTTTATA GAAAAGCCTT GGGACTCAAT GAAAAAGGCA AATGTTATTA CTGAGGTGCG AGCATTACAG CATGGTTACA TTGCGCAGAC GGACACTCGT GCTATTGGCA TGTCGGTGGT TGGTTTAGGT GGAGGGAGAA CGGCACCTAC ACAACAGGTA GATCACAGTG TTGGTTTTGA TCGGATATTA CCACTGGGTG TACAAGTGAA TCGTGGCGAA GTGATTGCAC GTTTACATGC TAAAGATGAA GACTCAGCCA ATAGAGCCAT TGAGCAATTT AATAATGCGA TTACTTACTC TGAAGAGAGC CCTGAGCTAC CACCGGTAAT CTACTAA
|
Protein sequence | MSVTAKIIPQ EIIRLKRDGK ILDEQAINGF VSGLVDGNFS DSQVGAMAMA IFQQGMSIDE RVNFTKAMMR SGEVLSWEGF DGPIVDKHST GGVGDKVSFM LAAIVAACGG YVPMISGRGL GHTGGTADKL ESIAGFNVQP SISEFKRIVK DVGVAIISQT DNLAPADKRL YSIRDVTATV ESIPLITASI LSKKLAAGLD VLVMDVKVGN GAMMNNLDDA KALAQSITSV ANGAGVKTQA IITDMNQVLG TSAGNAIEMY ETVKYLTGKQ REPRLHKIVQ ALASAMLINT NLASSEKDAR EKIDKVLNSG LAAEKFDRMV SALGGPKNFI EKPWDSMKKA NVITEVRALQ HGYIAQTDTR AIGMSVVGLG GGRTAPTQQV DHSVGFDRIL PLGVQVNRGE VIARLHAKDE DSANRAIEQF NNAITYSEES PELPPVIY
|
| |