Gene Dbac_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_0042 
Symbol 
ID8375675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp63104 
End bp64384 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content61% 
IMG OID644999272 
Productamidohydrolase 
Protein accessionYP_003156588 
Protein GI256827860 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTATCT TGATAAAAAA AGTGCGCCTG AATGGAGAGC TGGTCGATGT GCTCATCAAG 
GGCAATCGTT TCGACTCCAT CGGAACGGAC GTGGACTCGT CCGCCGACGT GGTGATCGAC
GGGTCGGGCA AGGCCATCCT GCCATCCTTC CACAATGCCC ACACCCATGC GGCAATGACG
CTTCTGCGCG GCTATGCCGA CGATATGGAT CTGCATACAT GGCTGGCCGA TCACATCTGG
CCCTTCGAGG CCCGGCTGAC GGAGGATGAC ATCTATTGGG GCGCGAAGCT CGCCTGCCTT
GAGATGATCA AGTCCGGAAC GACCTTTTTC GCCGATATGT ACTGGCATTG GAAGGGCACG
GCCCGGGCCG TGACGGACAT GGGCATGCGT GCGGCGCTGT CCGCGGCATT TTTTGATTTC
GACGATCCGG TCCGTGCCGA AACCATGAAG CGGCAGGTCA TGGATCTGCA CGCCGCCAGC
GTCGCGTTTC CGGACCGGAT TCAGTTCATT CTCGGGCCTC ACGCCATCTA CACCGTGTCT
TCGGACTCCC TGCGCTGGCT GGGGGAATAC GCGAACCGGC ACGGTCTTCT GGTGCATCTG
CACCTTTCCG AGACGCAAAA AGAGGTTGAG GACTGTTTGG CCAAACATGG CAAAAGGCCG
GTGGAGTATC TGCACGAGCT GGGTCTTTTG GCCCCGAACC TGATCCTGGC GCATGCCGTG
TGGATGACCG GGAAGGAGAT GGAGCTGCTG GCCGGGCACG GGGTGCAGGT CGTGCACTGC
CCGGTCTCGA ACATGAAGCT GTGTTCCGGG CAGTTCGACT ACGCCGCCAT GCAGGCTCAT
GGCGTCACCG TGGCCCTGGG TACGGACGGC TGTTCCTCGA ACAACAATCT GGACATGATC
GAGGAAATGA AGATCGCCTC TCTGCTGGCC AAGGTCACGT CCATGGACCC CACCGTCTTT
CCGGCCCAGG AAGCTCTCGA CGCGGCCACC GTGAACGGGG CGCGCATGTA CGGCCTGGAT
GCGGGGTGCA TTGCCTCGGG CAAGCTCGCG GATTGCATTC TGGTCGATCT GGAGCATGTG
CGCATGGTCC CGAACCATCA CCTTGTGTCC AACCTGGTCT ACAGCGCGAA CAGCTCCTGC
GTGGACACGA CCATCTGCGA CGGCCGGGTG CTCATGCTCG GCGGCAAGGT CGAAGGCGAG
GAAGAGATCC TGGCCCAGGT CCGCGCGACA TTGGCCCGTC TGAATGCTCC GCGTGAGCCT
GAGGGGGACG CATGCTCCTG A
 
Protein sequence
MSILIKKVRL NGELVDVLIK GNRFDSIGTD VDSSADVVID GSGKAILPSF HNAHTHAAMT 
LLRGYADDMD LHTWLADHIW PFEARLTEDD IYWGAKLACL EMIKSGTTFF ADMYWHWKGT
ARAVTDMGMR AALSAAFFDF DDPVRAETMK RQVMDLHAAS VAFPDRIQFI LGPHAIYTVS
SDSLRWLGEY ANRHGLLVHL HLSETQKEVE DCLAKHGKRP VEYLHELGLL APNLILAHAV
WMTGKEMELL AGHGVQVVHC PVSNMKLCSG QFDYAAMQAH GVTVALGTDG CSSNNNLDMI
EEMKIASLLA KVTSMDPTVF PAQEALDAAT VNGARMYGLD AGCIASGKLA DCILVDLEHV
RMVPNHHLVS NLVYSANSSC VDTTICDGRV LMLGGKVEGE EEILAQVRAT LARLNAPREP
EGDACS