Gene Daud_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2046 
Symbol 
ID6025959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2156589 
End bp2157494 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content66% 
IMG OID641594867 
Productcytidine deaminase 
Protein accessionYP_001718168 
Protein GI169832186 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0295] Cytidine deaminase
[COG0319] Predicted metal-dependent hydrolase 
TIGRFAM ID[TIGR00043] metalloprotein, YbeY/UPF0054 family
[TIGR01354] cytidine deaminase, homotetrameric 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTCG TAATAAACAA CTTACAGGAC ACAGTTCCGG TTGACGAGCA CCTGATGGCC 
CTGATCACCA GAGCCGTGGA GTCGGCACTG GCCGGGGGGG ACAGGCGGCG GGTCGAAGTG
AGTGTCGCCC TGGTGGATGA TGAATACATT CACGACCTGA ACCGGCGGTT CCGGGGGCAG
GATCGCCCCA CCGACGTGTT ATCCTTCCCA ATGGGCGAAG AGGAACCCGG TGCCGGAGAT
GAGCCGGGTG TGCTGCTCCT GGGCGACGTG GTGATCTCCC TGCCGGCGGC GGCGCGTCAG
GCTGCCGAAT ACGGTCACGG GCTGGACCGG GAGGTCGCCC GGTTGGCCGT GCACGGTACC
CTCCACCTCC TGGGCTATGA CCATGAACAG GATGAGGACG CGAGCCGGAT GCAGGAACGC
GAAGACGCCG TTCTCGCCGT GCTGGGAAAA AGGGGAACAG TCCCCTGGGA AAAGGGGGAC
AGTCCTCCTT TTTCCGCCGA TACCGAACTT GTCGAACGGG CCCTGGCGGC CCGTCAAAAC
GCCTATGCGC CCTACTCCGG ATTCCGGGTG GGGGCGGCGC TCCAGGTCCG GGGTGACCGG
GTGTTCACCG GGTGCAACGT GGAAAACGCT TCGTACGGCC TCACGGTCTG TGCCGAGCGG
GTGGCCGTGG TTTCGGCCGT GGCCGCCGGG GCCCGGGAAT TCACGGCCCT GGCGGTGGCC
GGGGACGGAG AAGACCCCGC TCTTCCCTGC GGCGCCTGCC TCCAGGTGTT GAGTGAGTTC
GCGCCGCACC TCCGCCTGCT TCTGGCCAAC GCCCGCGGCG AGTTCGCAGT GCGGACCCTC
CCCGAACTCC TGCCGAAAGC GTTCCAACTC CACCGCCGGG GTGACACCCC GCAGCCGACA
TCATGA
 
Protein sequence
MTVVINNLQD TVPVDEHLMA LITRAVESAL AGGDRRRVEV SVALVDDEYI HDLNRRFRGQ 
DRPTDVLSFP MGEEEPGAGD EPGVLLLGDV VISLPAAARQ AAEYGHGLDR EVARLAVHGT
LHLLGYDHEQ DEDASRMQER EDAVLAVLGK RGTVPWEKGD SPPFSADTEL VERALAARQN
AYAPYSGFRV GAALQVRGDR VFTGCNVENA SYGLTVCAER VAVVSAVAAG AREFTALAVA
GDGEDPALPC GACLQVLSEF APHLRLLLAN ARGEFAVRTL PELLPKAFQL HRRGDTPQPT
S