Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2254 |
Symbol | |
ID | 7978417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2305895 |
End bp | 2307196 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644799068 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_002950228 |
Protein GI | 239827604 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGG TGGATTTAAT TGCCAAAAAA AGGGATGGCC ACGCTCTTAC AAAAGAGGAA ATTGAATTTA TCATTCGCGG CTATACAAAC GGGGATATTC CGGATTATCA AATGAGCGCA TTTGCGATGG CGGTGTTTTT CCGCGGCATG ACCGAAGAAG AAACGGCGGC GCTGACGATG GCGATGGTTC ATTCGGGAGA TGTCATCGAT TTATCGAAAA TCGAAGGAAT TAAAGTCGAT AAACACTCAA CCGGCGGCGT CGGCGATACG ACAACGCTAG TGTTAGGCCC GCTCGTTGCG TCTGTCGGTG TGCCGGTCGC GAAAATGTCG GGCCGCGGCC TCGGGCATAC AGGCGGGACG ATTGATAAAT TAGAATCGGT TCCTGGTTTT CATGTAGAAA TCGATAATGA CCAATTTATT GAGCTTGTTA ATAAAAATAA AATCGCGATT ATCGGCCAGA CAGGCAATTT AACACCGGCT GATAAAAAGC TGTATGCGCT CCGCGACGTC ACCGCGACGG TAAACAGCAT TCCGCTCATC GCTTCGTCGA TTATGAGCAA AAAAATCGCC GCTGGTGCGG ATGCGATTGT TTTAGATGTC AAAACAGGCG CTGGTGCATT TATGAAAGAT TTAGAAGGGG CAAAACAACT CGCAAAAGCG ATGGTTGAAA TCGGCAAGCG CGTCGGCCGG AAAACGATGG CGGTTATTTC CGATATGAGC CAGCCGCTCG GATACGCAGT TGGCAACGCG CTCGAAGTAA AAGAAGCGAT TGATACACTC AAAGGAAAAG GGCCGGAAGA TTTACAAGAG CTATGTTTGA CGCTTGGAAG CTATATGGTA TATTTAGCGG AAAAAGCATC TTCATTAGAG GAAGCGCGTG CGTTATTAGA AACATCGATT CAAGAAGGAA AAGCGCTAGA AACGTTCAAA GTGTTTCTAA AAGCGCAAGG CGGCGATGCA TCGGTTGTCG ATGACCCATC CAAACTGCCG CAAGCAAAAT ATCAATGGGA GTTAGAAGCG CCGGAAGATG GATATGTCGC CGAAATTGTC GCCGACGAAG TCGGAACGGC TGCGATGCTG CTTGGAGCAG GACGGGCGAC AAAAGAATCG ACCATCGATC TCTCTGTCGG TCTCGTTTTG CGTAAAAAAG TCGGCGATGC GGTGAAAAAA AGGGAATCGC TTGTTACGAT TTACAGCAAT ACGGAAAATA TCGAAGAAGT GAAACAAAAA CTTGCTAAAA GCATCCGAAT CTCCCAAATT CCTGTTGCCA AACCAACGCT CATATATGAA ACGATTTCAT AA
|
Protein sequence | MRMVDLIAKK RDGHALTKEE IEFIIRGYTN GDIPDYQMSA FAMAVFFRGM TEEETAALTM AMVHSGDVID LSKIEGIKVD KHSTGGVGDT TTLVLGPLVA SVGVPVAKMS GRGLGHTGGT IDKLESVPGF HVEIDNDQFI ELVNKNKIAI IGQTGNLTPA DKKLYALRDV TATVNSIPLI ASSIMSKKIA AGADAIVLDV KTGAGAFMKD LEGAKQLAKA MVEIGKRVGR KTMAVISDMS QPLGYAVGNA LEVKEAIDTL KGKGPEDLQE LCLTLGSYMV YLAEKASSLE EARALLETSI QEGKALETFK VFLKAQGGDA SVVDDPSKLP QAKYQWELEA PEDGYVAEIV ADEVGTAAML LGAGRATKES TIDLSVGLVL RKKVGDAVKK RESLVTIYSN TENIEEVKQK LAKSIRISQI PVAKPTLIYE TIS
|
| |