Gene GWCH70_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2254 
Symbol 
ID7978417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2305895 
End bp2307196 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content47% 
IMG OID644799068 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_002950228 
Protein GI239827604 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGG TGGATTTAAT TGCCAAAAAA AGGGATGGCC ACGCTCTTAC AAAAGAGGAA 
ATTGAATTTA TCATTCGCGG CTATACAAAC GGGGATATTC CGGATTATCA AATGAGCGCA
TTTGCGATGG CGGTGTTTTT CCGCGGCATG ACCGAAGAAG AAACGGCGGC GCTGACGATG
GCGATGGTTC ATTCGGGAGA TGTCATCGAT TTATCGAAAA TCGAAGGAAT TAAAGTCGAT
AAACACTCAA CCGGCGGCGT CGGCGATACG ACAACGCTAG TGTTAGGCCC GCTCGTTGCG
TCTGTCGGTG TGCCGGTCGC GAAAATGTCG GGCCGCGGCC TCGGGCATAC AGGCGGGACG
ATTGATAAAT TAGAATCGGT TCCTGGTTTT CATGTAGAAA TCGATAATGA CCAATTTATT
GAGCTTGTTA ATAAAAATAA AATCGCGATT ATCGGCCAGA CAGGCAATTT AACACCGGCT
GATAAAAAGC TGTATGCGCT CCGCGACGTC ACCGCGACGG TAAACAGCAT TCCGCTCATC
GCTTCGTCGA TTATGAGCAA AAAAATCGCC GCTGGTGCGG ATGCGATTGT TTTAGATGTC
AAAACAGGCG CTGGTGCATT TATGAAAGAT TTAGAAGGGG CAAAACAACT CGCAAAAGCG
ATGGTTGAAA TCGGCAAGCG CGTCGGCCGG AAAACGATGG CGGTTATTTC CGATATGAGC
CAGCCGCTCG GATACGCAGT TGGCAACGCG CTCGAAGTAA AAGAAGCGAT TGATACACTC
AAAGGAAAAG GGCCGGAAGA TTTACAAGAG CTATGTTTGA CGCTTGGAAG CTATATGGTA
TATTTAGCGG AAAAAGCATC TTCATTAGAG GAAGCGCGTG CGTTATTAGA AACATCGATT
CAAGAAGGAA AAGCGCTAGA AACGTTCAAA GTGTTTCTAA AAGCGCAAGG CGGCGATGCA
TCGGTTGTCG ATGACCCATC CAAACTGCCG CAAGCAAAAT ATCAATGGGA GTTAGAAGCG
CCGGAAGATG GATATGTCGC CGAAATTGTC GCCGACGAAG TCGGAACGGC TGCGATGCTG
CTTGGAGCAG GACGGGCGAC AAAAGAATCG ACCATCGATC TCTCTGTCGG TCTCGTTTTG
CGTAAAAAAG TCGGCGATGC GGTGAAAAAA AGGGAATCGC TTGTTACGAT TTACAGCAAT
ACGGAAAATA TCGAAGAAGT GAAACAAAAA CTTGCTAAAA GCATCCGAAT CTCCCAAATT
CCTGTTGCCA AACCAACGCT CATATATGAA ACGATTTCAT AA
 
Protein sequence
MRMVDLIAKK RDGHALTKEE IEFIIRGYTN GDIPDYQMSA FAMAVFFRGM TEEETAALTM 
AMVHSGDVID LSKIEGIKVD KHSTGGVGDT TTLVLGPLVA SVGVPVAKMS GRGLGHTGGT
IDKLESVPGF HVEIDNDQFI ELVNKNKIAI IGQTGNLTPA DKKLYALRDV TATVNSIPLI
ASSIMSKKIA AGADAIVLDV KTGAGAFMKD LEGAKQLAKA MVEIGKRVGR KTMAVISDMS
QPLGYAVGNA LEVKEAIDTL KGKGPEDLQE LCLTLGSYMV YLAEKASSLE EARALLETSI
QEGKALETFK VFLKAQGGDA SVVDDPSKLP QAKYQWELEA PEDGYVAEIV ADEVGTAAML
LGAGRATKES TIDLSVGLVL RKKVGDAVKK RESLVTIYSN TENIEEVKQK LAKSIRISQI
PVAKPTLIYE TIS