Gene YPK_0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0237 
Symbol 
ID6089405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp275768 
End bp277066 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content52% 
IMG OID641595297 
Productcytosine deaminase 
Protein accessionYP_001719003 
Protein GI170022498 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGGC AATCTGTACC GAATTATTCA TTAACGACGG TAAATAATGT GCGTCGTATA 
GGGTTAGCGG GGCTGTGGCA GATAACGATA GCGGAAGGCA AAATCAGCCT GATTCAGCCT
CAGCCGGAGC AATCCCTAAC AGGGCAAGGT GTGCTGGATG CGCAGGGCGG TTTGGCGCTC
CCGCCATTTA TTGAGCCGCA TATTCATCTG GATACGACTC AAACCGCTGG GCAACCGAGT
TGGAATCAGT CGGGAACTCT GTTTGAGGGA ATTGAACGTT GGGCGGAGCG CAAAGCGCGA
CTAACGCGCG AGGATGTGAA GCAGCGGGCC TGGCAAACAT TGAAATGGCA GATAGCCAAC
GGTATCCAGC ATGTCCGAAC GCATGTTGAT GTTTCCGATC CACACTTGAC GGCATTGAGC
GCCATGTTGG AGGTGAAAGA AGAGGTTAGC CCTTGGGTTG ATATGCAGAT TGTGGCTTTC
CCACAAGAGG GCATCCTCTC TTATCCTGAT GGTGCGGCGT TATTGGAAGA GGCACTGCGC
TTGGGTGCGG ATGCGGTGGG GGCGATCCCC CATTTTGAAT TTACCCGCGA ATATGGCGTG
GAATCTTTGC ATATCGCGTT TGCATTGGCG CAAAAATATC AGCGGCTGGT GGATGTTCAT
TGCGATGAAA CGGATGATGA GCAATCGCGC TTTATCGAAA CCGTGGCGGC GCTGGCCCTT
CGCGAAAATA TGGGCGCACG GGTAACCGCC AGCCATACCA CGGCGATGCA CTCTTATAAT
GGTGCCTATA CATCACGGCT GTTCCGCTTG CTGAAATTAT CGGGCATTAA TTTTGTCGCC
AATCCGCTGG TGAATATTCA TCTGCAAGGC CGTTTTGATA CTTATCCGAA GCGGCGTGGT
ATCACGCGGG TCAAAGAGAT GTTGGCTGCC GAGATTAATG TGTGCTTCGG CCATGACGAT
GTGTTCGACC CGTGGTATCC GCTAGGTACC GCCAATATGT TGCAGGTGCT GCATATGGGA
TTACATATTT GTCAGTTGAT GGGATATGAA CAAATTAATG ATGGTCTGAA TCTGATAACC
ACCCACAGTG CCAAGACGCT GAATTTAGCA GATTACGGTT TGCAGCCTGG TAACCGCGCC
AACCTGATTA TTCTACCCGC AGAAAATGGT TTTGATGCTC TACGGCGGCA GGTTCCCGTC
AGTTACTCTA TTCGTCATGG TGTGGTGATC GCGAAGACGC AACCCGCAGA GAGCCGGATC
TATTTGGGGC AGGAAGAGAT CGTGGATTTT CGGCGTTAA
 
Protein sequence
MAGQSVPNYS LTTVNNVRRI GLAGLWQITI AEGKISLIQP QPEQSLTGQG VLDAQGGLAL 
PPFIEPHIHL DTTQTAGQPS WNQSGTLFEG IERWAERKAR LTREDVKQRA WQTLKWQIAN
GIQHVRTHVD VSDPHLTALS AMLEVKEEVS PWVDMQIVAF PQEGILSYPD GAALLEEALR
LGADAVGAIP HFEFTREYGV ESLHIAFALA QKYQRLVDVH CDETDDEQSR FIETVAALAL
RENMGARVTA SHTTAMHSYN GAYTSRLFRL LKLSGINFVA NPLVNIHLQG RFDTYPKRRG
ITRVKEMLAA EINVCFGHDD VFDPWYPLGT ANMLQVLHMG LHICQLMGYE QINDGLNLIT
THSAKTLNLA DYGLQPGNRA NLIILPAENG FDALRRQVPV SYSIRHGVVI AKTQPAESRI
YLGQEEIVDF RR