Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dhaf_3447 |
Symbol | |
ID | 7260465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfitobacterium hafniense DCB-2 |
Kingdom | Bacteria |
Replicon accession | NC_011830 |
Strand | - |
Start bp | 3679161 |
End bp | 3680453 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643563370 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_002459901 |
Protein GI | 219669466 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGTCG TTGATATAAT TAACAAAAAG AAAAAAGGCG AAGCATTAAC AAAAGAGGAA ATAGAGTTTT TCGTACAAGG TTATGTAAAG GGGGAGATTC CCGACTATCA AATAGCTGCT CTTTACATGG CCATTTACTT CCAAGGGATG AATGATGAGG AAATCGCCGA CTTAACCATG GCCTATGTGA ACTCAGGAGA GACCATCGAT TTATCCGGCA TCCCAGGAGT AAAGGTCGAT AAGCATTCAA CAGGGGGAGT GGGGGATAAG ATTAGCCTGA TCGTTATTCC CTTGGTTGCC TCATTGGGAA TCCCTGTGGC GAAAATGAGC GGCAGAGGAT TAGGTCATAC CGGCGGGACC ATCGATAAGC TTGAAGCGAT TCAAGGATTC AGGACCGCTT TAAGTACCGG GGAATTTATC GCCAATGTGA ATAACCACGG CATGGCGGTA GTAGGGCAAA CAGCTAATTT GACCCCTGCC GACAAGCTGA CCTATGCCTT AAGAGATGTG ACAGGAACCG TGGATAGTAT TCCTCTCATA GCAGGTTCCA TCATGAGCAA GAAGATTGCC TCCGGGGCGG ACGCCATAGT CCTCGATGTT AAGGTAGGCT CAGGGGCTTT CATGAAATCT CTCCCAGAAG CCAAAAAGCT TGCTGAGTGT ATGGTGCAAA TAGGTAAGTC ATTAAAGAGA AGGACTATCG CCATCATCAC CGATATGAAT CAGCCTCTTG GCCATGAAGT GGGGAACGCT AATGAAATTA AAGAGATTAT CGATGTCTTA AAGGGCAAGG GAGCGGAAGA CGAAACCAGA ATTGCTTTGA CGATAGCCTC CTATATGGCC ATTGCCAGTG GGAACTATCA TGATTTCCAG TCAACTTATG CAGAGCTTCA GCAAGTTATT GCCTCAGGTA AGGCCGTAGA AAAATTAAAA GAGCTGATCT CAATTCAAGG AGGAAATCCT CAAATAGTCC ATGAGCCCTC CCAATTACCC CAAGCGAAGC ATCATATCGA AGTCCTCGCC AATCATGCAG GGTATATAAG CTCTATTGAC GCCGAACAGA TTGGGCTGGT GGCTATGCTG TTGGGGGCAG GAAGAAAGAA GAAGGACGAT CCCATCGACT ATGCGGCCGG GGTAACGCTT TTGAAGAAGG TAGGGGATTA TGTTGACCTT GATGAACCTC TCTGTATTCT GCACACCAAC CTTGAATATG TAGAAGCAGA TGTTTTAAAG GCCTATGGAT TTCAGGAGAG CAAGCCCGAT CCGATAGAAT ATATTCATGA AGTGGTAAAG TAA
|
Protein sequence | MRVVDIINKK KKGEALTKEE IEFFVQGYVK GEIPDYQIAA LYMAIYFQGM NDEEIADLTM AYVNSGETID LSGIPGVKVD KHSTGGVGDK ISLIVIPLVA SLGIPVAKMS GRGLGHTGGT IDKLEAIQGF RTALSTGEFI ANVNNHGMAV VGQTANLTPA DKLTYALRDV TGTVDSIPLI AGSIMSKKIA SGADAIVLDV KVGSGAFMKS LPEAKKLAEC MVQIGKSLKR RTIAIITDMN QPLGHEVGNA NEIKEIIDVL KGKGAEDETR IALTIASYMA IASGNYHDFQ STYAELQQVI ASGKAVEKLK ELISIQGGNP QIVHEPSQLP QAKHHIEVLA NHAGYISSID AEQIGLVAML LGAGRKKKDD PIDYAAGVTL LKKVGDYVDL DEPLCILHTN LEYVEADVLK AYGFQESKPD PIEYIHEVVK
|
| |