Gene Namu_1318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1318 
Symbol 
ID8446914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1446805 
End bp1448094 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content73% 
IMG OID645040451 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_003200710 
Protein GI258651554 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.615956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCT TCGCCGCGGT GGACATCATC CGGGCCAAGC GGGACGGTCA GGAGCTGCGC 
ACCGAGCAGA TCCAGTGGGT GATCGACGCC TACACCCGGC ACGAGGTGGC CGAGGAGCAG
ATGTCGGCGC TGCTGATGGC CATCCTGCTC AACGGGATGT CCGGTGCCGA GATCGCCGCC
TGGACCACCG CCATGATCGA GTCCGGCCGG CGGATGGACC TGTCCGCCGT GCCCCGGCCG
ACCGTGGACA AGCACTCCAC CGGCGGGGTG GGGGACAAGA TCTCCTTGCC GCTGGTCCCG
CTGGTCGCCG CGTGCGGGGC GGCCGTGCCG CAGCTGTCCG GCCGCGGGTT GGGCCACACC
GGCGGCACCC TGGACAAGAT GGAGTCGATC GCCGGGTGGC GGGCCACCCT GTCCCCGCAG
GAGATGACCG ACCAGCTCAG CACCCTCGGC GCCGTCATCT GCGCCGCGTC GGAGGACCTG
GCCCCGGCCG ACCGCAAGCT CTACGCGCTG CGGGACGTCA CCGGGACCGT CGAATCGATC
CCGATGATCG CCGCGTCGAT CATGAGCAAG AAGATTGCCG AGGGCACCTC GGCCCTGGTG
CTGGACGTCA AGGTCGGCAC CGGCGCGTTC ATGAAGCGGG CCGACCACGC CGCCGAGCTC
GCCCGCACCA TGGTCGAACT CGGGACCCGG GAAGGGGTCC GGACCAGCGC GCTGCTCACC
GACATGTCCA CCCCGCTGGG GGTGGCGGTG GGCAACGCGA TCGAGGTCGC CGAATCGGTC
GAGGTGCTGG CCGGTGGCGG GCCGGCCGAT GTGGTCGAGC TGACGGTCGC ACTGGCCCGG
CAGATGGTTG AGCTGGCCGG GCTGTCGGTC GACCCGGCCG ACGTGCTGGC CTCCGGGCAG
GCCATGGACT GCTGGCGGGC GATGGTCAGC GCCCAGGGCG GCGACCCGGA CGCGCCGCTG
CCCACGCCCC GGTTCCGCGA GGTGGTGGTC GCCGACGAGG CGGGGGTGGT CGCCGAGCTG
GACGCGCTGC CCGTCGGCCT GGCCGCCTGG CGGCTGGGTG CCGGCCGGGC CCGCAAGGAG
CATGCCGTGC AGGCCGCCGC CGGGGTGCTG TGCCTGGCCA AGCCGGGGCA GACCGTGCGG
ATCGGCCAGC CGCTGTTCGA GCTGCACACC AACACCCCGG ACAAGTTCCC GGCCGCCCTG
GCCGACCTGG ACGGTGCGAT CCGGATCGCC CCCGCGGGCA CCACCGTGCA CCGGCCCGCG
CTCGTCCAGG ACGTGATCAC CGCCGGCTGA
 
Protein sequence
MSTFAAVDII RAKRDGQELR TEQIQWVIDA YTRHEVAEEQ MSALLMAILL NGMSGAEIAA 
WTTAMIESGR RMDLSAVPRP TVDKHSTGGV GDKISLPLVP LVAACGAAVP QLSGRGLGHT
GGTLDKMESI AGWRATLSPQ EMTDQLSTLG AVICAASEDL APADRKLYAL RDVTGTVESI
PMIAASIMSK KIAEGTSALV LDVKVGTGAF MKRADHAAEL ARTMVELGTR EGVRTSALLT
DMSTPLGVAV GNAIEVAESV EVLAGGGPAD VVELTVALAR QMVELAGLSV DPADVLASGQ
AMDCWRAMVS AQGGDPDAPL PTPRFREVVV ADEAGVVAEL DALPVGLAAW RLGAGRARKE
HAVQAAAGVL CLAKPGQTVR IGQPLFELHT NTPDKFPAAL ADLDGAIRIA PAGTTVHRPA
LVQDVITAG