Gene Noca_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3535 
SymboldeoA 
ID4595717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3745401 
End bp3746687 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content73% 
IMG OID639778143 
Productthymidine phosphorylase 
Protein accessionYP_924722 
Protein GI119717757 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00606286 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGACC ATGACGCCGT CGAGGTGATC GCCGCCAAGC GCGACCGCCA CGAGCTGACC 
GACAGCCAGA TCGACTGGGT GGTCGACGCC TACACCAGGG GTGCGGTCGC CGACGAGCAG
ATGTCGTCGC TCGCGATGGC CATCCTGCTC AACGGGATGA ACCGGCGCGA GATCGCCCGC
TGGACCGCCG CGATGATCGC GTCGGGGGAG CGGATGGACT TCTCCTCGCT CTCGCGGCCG
ACCGCCGACA AGCACTCCAC CGGCGGGGTC GGCGACAAGA TCACGCTGCC GCTCGCGCCG
CTGGTCGCCG CCTGCGGGGT CGCCGTCCCG CAGCTCTCCG GGCGCGGCCT GGGCCACACG
GGTGGCACCC TCGACAAGCT CGAGGCCATC CCCGGCTGGC GGGCGGCCCT GTCGAACGAC
GAGATGATGG CGCAGCTCGA GTCGGTGGGT GCGGTGATCT GCGCGGCCGG CGATGGGCTG
GCGCCCGCGG ACAAGAAGCT CTACGCGCTG CGCGACGTGA CCGGCACCGT CGAGGCGATC
CCGCTGATCG CCTCCTCGAT CATGTCCAAG AAGATCGCCG AGGGCACCGG CTCACTGGTG
CTCGACGTCA AGGTCGGCAC CGGCGCGTTC ATGAAGGACA TCGACTCCGC GCGCGAGCTC
GCCGAGACGA TGGTCGCGCT CGGCACGGAC GCGGGCGTCC ACACGGTCGC GCTCCTGACC
GACATGTCTA CCCCCCTGGG GCGCACCGCC GGCAACGCGA TCGAGGTCGC CGAGTCGGTG
GAGGTGCTCG CCGGCGGCGG CCCGGCCGAC GTCGTGGAGC TGACCCTGGC GCTGGCCCGC
GAGATGCTGG CCGGCGCGGG TCGCGACGAC GTCGACCCGG CCGACAAGCT GGCCGACGGC
TCCGCGATGG ACGCCTGGAA GGCGATGATC CGGGCCCAGG GCGGCGACCC CGACGCCGCG
CTCCCGCAGG CGCGGGAGAG CCATGTCGTC AGTGCTCCCG CGTCCGGCGT GCTGACCCGG
CTGGACGCGA TGGCCGTCGG GCTGGCCGCC TGGCGGCTGG GCGCCGGCCG GGCCCGCAAG
GAGGACCCGG TGCAGGCCGG CGCCGGCGTC GTCTGGCACG CCCGCCCCGG GGACGCCGTC
ACCGAGGGGC AGCCGCTGTT CACGCTGCTC ACCGACGACG AGCACCGGTT CGAGCGGGCC
CTGGACTCAC TCGGGGGCGG CTACGACATC GCGCCCGCGG ACTCGCCGTA CACCCCGACG
CCGCTGGTGA TCGACCGGAT CGCCTGA
 
Protein sequence
MPDHDAVEVI AAKRDRHELT DSQIDWVVDA YTRGAVADEQ MSSLAMAILL NGMNRREIAR 
WTAAMIASGE RMDFSSLSRP TADKHSTGGV GDKITLPLAP LVAACGVAVP QLSGRGLGHT
GGTLDKLEAI PGWRAALSND EMMAQLESVG AVICAAGDGL APADKKLYAL RDVTGTVEAI
PLIASSIMSK KIAEGTGSLV LDVKVGTGAF MKDIDSAREL AETMVALGTD AGVHTVALLT
DMSTPLGRTA GNAIEVAESV EVLAGGGPAD VVELTLALAR EMLAGAGRDD VDPADKLADG
SAMDAWKAMI RAQGGDPDAA LPQARESHVV SAPASGVLTR LDAMAVGLAA WRLGAGRARK
EDPVQAGAGV VWHARPGDAV TEGQPLFTLL TDDEHRFERA LDSLGGGYDI APADSPYTPT
PLVIDRIA