Gene Dole_1275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1275 
Symbol 
ID5694110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1526463 
End bp1527662 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content63% 
IMG OID641263869 
Productamidohydrolase 
Protein accessionYP_001529158 
Protein GI158521288 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCTTA TGGCTTCCAT TATACATAAG GCCGGATGGG TAGTGGTCCA TGGACACCGG 
GTGATCCGTG ACGGGTTTGT GCGCGTGGCC GGCGGCGTGA TCACCGAAGT GGGGACCGGC
GCGGCGGGAA ACGGAACCGT TGTCGACCAC GGGGAGGGCG CCCTTGTGCC GGCCTTTGTC
AATGCCCATA CCCACCTGGA GCTTTCCGCC CTGGCCGGAC GGCTTTCCAC CGGCCAGGGG
TTTGAATCGT GGGTGCGGGA GCTCCTGGCC CTGCGCCAGG AACAGACCAG AGACCATCTG
CGCCGGGAGG CACGCGTCGC CGCTGATCGC ATGATAAAAG CCGGTACACT GGTGGCCGGT
GAGGTATCCA CCCTCGGCAT CACGGCGGAT CTGTTTCGGG ATGCCGGCCT GGCCGGCGTC
TGGTTTTCCG AGGTGCTGGG CCAGCACCTG CCTGAATCCA TGGACCTGCC GCCTGCCGAC
CAATGGCGCG CCTCTTCTTT TGCGGCCCAC GCGCCGCACA CCACGGCGCC GGAAGTGTTG
TGCCGCCTGA AACAGATGTG TGATGAACGG GGCCTGCCGT TTTCCATTCA TCTGGCTGAA
TCGCCCGAGG AGGCTGAGTT TATTCAAACC GGAAAGGGCA GGTGGGCCGA TTTTTTAAGC
GAGCGGGGCA TCGGTTTTTT CAAGTGGCCG GTCCCGTCAA AAAGTCCGGT GGGCTATCTG
GCCGATCTGG GCCTGCTGGG ACCGAACCTG CTGGCGGTTC ACCTGGTTTA CGCCGATGCA
GCAGATATAG AGATGCTGGC CCGGAACCGG GTCCATGGGT GCCTGTGTCT GCGGAGTAAC
ATGGCCCTGC ACGGCCGGAT GCCGGATGTA GCCCGAATGG TGGATGCCGG GTTTTACCTG
TGCCTGGGCA CCGACAGCCT GGCCTGCGTG GATTCCCTGA GCATGGTTGA CGAGATGGCC
TTTGTGGCAT ATAAGTGTCC TGCCCTCCGG CCGGAAGACC TGCTGAACAT GGCGACAATC
AACGGTGCAG CGGCGCTTGG CGTGGCCGAC CGGTTCGGCT CGCTGGAACC GGGAAAAAAA
GGCGCCCTGG TGTATCTGCC GGTAAAGGCG GAAAACCCAA AGGCCCTGCT TGAGCGGATC
GTCTCCGGCG AGGGCGGGCC GGTCTCAACC TGGTGGCCGG AAGAAAGAAA ACGGGAGTAA
 
Protein sequence
MPLMASIIHK AGWVVVHGHR VIRDGFVRVA GGVITEVGTG AAGNGTVVDH GEGALVPAFV 
NAHTHLELSA LAGRLSTGQG FESWVRELLA LRQEQTRDHL RREARVAADR MIKAGTLVAG
EVSTLGITAD LFRDAGLAGV WFSEVLGQHL PESMDLPPAD QWRASSFAAH APHTTAPEVL
CRLKQMCDER GLPFSIHLAE SPEEAEFIQT GKGRWADFLS ERGIGFFKWP VPSKSPVGYL
ADLGLLGPNL LAVHLVYADA ADIEMLARNR VHGCLCLRSN MALHGRMPDV ARMVDAGFYL
CLGTDSLACV DSLSMVDEMA FVAYKCPALR PEDLLNMATI NGAAALGVAD RFGSLEPGKK
GALVYLPVKA ENPKALLERI VSGEGGPVST WWPEERKRE