Gene CPS_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_1970 
SymboldeoA 
ID3520590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp2038824 
End bp2040140 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content42% 
IMG OID637284432 
Productthymidine phosphorylase 
Protein accessionYP_268700 
Protein GI71280126 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02643] thymidine phosphorylase
[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.380336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTAA CCGCTAAAAT CATCCCTCAA GAGATTATTC GACTGAAACG TGATGGTAAA 
ATATTAGATG AACAAGCGAT AAATGGCTTT GTTTCAGGCC TTGTTGATGG CAACTTTTCA
GATAGCCAAG TTGGCGCTAT GGCCATGGCT ATTTTTCAAC AGGGTATGTC GATTGATGAA
AGAGTCAATT TTACCAAGGC TATGATGCGC TCAGGAGAGG TTCTTAGCTG GGAAGGTTTT
GATGGTCCTA TTGTTGATAA GCATTCAACG GGTGGTGTCG GTGATAAAGT TAGCTTTATG
TTAGCTGCTA TCGTTGCTGC TTGCGGCGGT TATGTTCCTA TGATTTCAGG GCGTGGTTTG
GGCCATACTG GTGGTACCGC AGATAAACTT GAAAGTATTG CTGGTTTTAA TGTACAACCC
AGTATTAGTG AATTTAAACG TATTGTTAAA GACGTTGGTG TTGCCATTAT TTCACAAACC
GATAATTTAG CACCCGCCGA TAAACGTTTG TATTCTATTC GTGATGTGAC CGCTACAGTT
GAATCTATTC CATTGATCAC CGCTTCTATC TTATCGAAGA AGCTTGCCGC AGGACTTGAT
GTCTTAGTGA TGGATGTCAA GGTTGGCAAT GGTGCCATGA TGAATAATTT AGATGATGCT
AAAGCACTTG CACAAAGCAT TACCAGTGTT GCTAACGGAG CGGGCGTTAA AACACAGGCT
ATTATTACCG ATATGAATCA AGTCTTGGGT ACTAGTGCCG GTAACGCCAT AGAAATGTAT
GAAACAGTTA AATACTTAAC CGGTAAACAA CGAGAGCCGC GCTTACATAA AATTGTGCAA
GCACTTGCTA GTGCAATGCT TATTAATACA AATCTAGCGA GTAGTGAAAA AGATGCCCGT
GAGAAAATTG ATAAGGTATT AAATTCAGGC TTAGCGGCTG AAAAATTTGA TCGCATGGTA
TCCGCGTTAG GTGGACCGAA AAACTTTATA GAAAAGCCTT GGGACTCAAT GAAAAAGGCA
AATGTTATTA CTGAGGTGCG AGCATTACAG CATGGTTACA TTGCGCAGAC GGACACTCGT
GCTATTGGCA TGTCGGTGGT TGGTTTAGGT GGAGGGAGAA CGGCACCTAC ACAACAGGTA
GATCACAGTG TTGGTTTTGA TCGGATATTA CCACTGGGTG TACAAGTGAA TCGTGGCGAA
GTGATTGCAC GTTTACATGC TAAAGATGAA GACTCAGCCA ATAGAGCCAT TGAGCAATTT
AATAATGCGA TTACTTACTC TGAAGAGAGC CCTGAGCTAC CACCGGTAAT CTACTAA
 
Protein sequence
MSVTAKIIPQ EIIRLKRDGK ILDEQAINGF VSGLVDGNFS DSQVGAMAMA IFQQGMSIDE 
RVNFTKAMMR SGEVLSWEGF DGPIVDKHST GGVGDKVSFM LAAIVAACGG YVPMISGRGL
GHTGGTADKL ESIAGFNVQP SISEFKRIVK DVGVAIISQT DNLAPADKRL YSIRDVTATV
ESIPLITASI LSKKLAAGLD VLVMDVKVGN GAMMNNLDDA KALAQSITSV ANGAGVKTQA
IITDMNQVLG TSAGNAIEMY ETVKYLTGKQ REPRLHKIVQ ALASAMLINT NLASSEKDAR
EKIDKVLNSG LAAEKFDRMV SALGGPKNFI EKPWDSMKKA NVITEVRALQ HGYIAQTDTR
AIGMSVVGLG GGRTAPTQQV DHSVGFDRIL PLGVQVNRGE VIARLHAKDE DSANRAIEQF
NNAITYSEES PELPPVIY