Gene Dhaf_3447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_3447 
Symbol 
ID7260465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp3679161 
End bp3680453 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content46% 
IMG OID643563370 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_002459901 
Protein GI219669466 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGTCG TTGATATAAT TAACAAAAAG AAAAAAGGCG AAGCATTAAC AAAAGAGGAA 
ATAGAGTTTT TCGTACAAGG TTATGTAAAG GGGGAGATTC CCGACTATCA AATAGCTGCT
CTTTACATGG CCATTTACTT CCAAGGGATG AATGATGAGG AAATCGCCGA CTTAACCATG
GCCTATGTGA ACTCAGGAGA GACCATCGAT TTATCCGGCA TCCCAGGAGT AAAGGTCGAT
AAGCATTCAA CAGGGGGAGT GGGGGATAAG ATTAGCCTGA TCGTTATTCC CTTGGTTGCC
TCATTGGGAA TCCCTGTGGC GAAAATGAGC GGCAGAGGAT TAGGTCATAC CGGCGGGACC
ATCGATAAGC TTGAAGCGAT TCAAGGATTC AGGACCGCTT TAAGTACCGG GGAATTTATC
GCCAATGTGA ATAACCACGG CATGGCGGTA GTAGGGCAAA CAGCTAATTT GACCCCTGCC
GACAAGCTGA CCTATGCCTT AAGAGATGTG ACAGGAACCG TGGATAGTAT TCCTCTCATA
GCAGGTTCCA TCATGAGCAA GAAGATTGCC TCCGGGGCGG ACGCCATAGT CCTCGATGTT
AAGGTAGGCT CAGGGGCTTT CATGAAATCT CTCCCAGAAG CCAAAAAGCT TGCTGAGTGT
ATGGTGCAAA TAGGTAAGTC ATTAAAGAGA AGGACTATCG CCATCATCAC CGATATGAAT
CAGCCTCTTG GCCATGAAGT GGGGAACGCT AATGAAATTA AAGAGATTAT CGATGTCTTA
AAGGGCAAGG GAGCGGAAGA CGAAACCAGA ATTGCTTTGA CGATAGCCTC CTATATGGCC
ATTGCCAGTG GGAACTATCA TGATTTCCAG TCAACTTATG CAGAGCTTCA GCAAGTTATT
GCCTCAGGTA AGGCCGTAGA AAAATTAAAA GAGCTGATCT CAATTCAAGG AGGAAATCCT
CAAATAGTCC ATGAGCCCTC CCAATTACCC CAAGCGAAGC ATCATATCGA AGTCCTCGCC
AATCATGCAG GGTATATAAG CTCTATTGAC GCCGAACAGA TTGGGCTGGT GGCTATGCTG
TTGGGGGCAG GAAGAAAGAA GAAGGACGAT CCCATCGACT ATGCGGCCGG GGTAACGCTT
TTGAAGAAGG TAGGGGATTA TGTTGACCTT GATGAACCTC TCTGTATTCT GCACACCAAC
CTTGAATATG TAGAAGCAGA TGTTTTAAAG GCCTATGGAT TTCAGGAGAG CAAGCCCGAT
CCGATAGAAT ATATTCATGA AGTGGTAAAG TAA
 
Protein sequence
MRVVDIINKK KKGEALTKEE IEFFVQGYVK GEIPDYQIAA LYMAIYFQGM NDEEIADLTM 
AYVNSGETID LSGIPGVKVD KHSTGGVGDK ISLIVIPLVA SLGIPVAKMS GRGLGHTGGT
IDKLEAIQGF RTALSTGEFI ANVNNHGMAV VGQTANLTPA DKLTYALRDV TGTVDSIPLI
AGSIMSKKIA SGADAIVLDV KVGSGAFMKS LPEAKKLAEC MVQIGKSLKR RTIAIITDMN
QPLGHEVGNA NEIKEIIDVL KGKGAEDETR IALTIASYMA IASGNYHDFQ STYAELQQVI
ASGKAVEKLK ELISIQGGNP QIVHEPSQLP QAKHHIEVLA NHAGYISSID AEQIGLVAML
LGAGRKKKDD PIDYAAGVTL LKKVGDYVDL DEPLCILHTN LEYVEADVLK AYGFQESKPD
PIEYIHEVVK