Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1275 |
Symbol | |
ID | 5694110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1526463 |
End bp | 1527662 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641263869 |
Product | amidohydrolase |
Protein accession | YP_001529158 |
Protein GI | 158521288 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCTTA TGGCTTCCAT TATACATAAG GCCGGATGGG TAGTGGTCCA TGGACACCGG GTGATCCGTG ACGGGTTTGT GCGCGTGGCC GGCGGCGTGA TCACCGAAGT GGGGACCGGC GCGGCGGGAA ACGGAACCGT TGTCGACCAC GGGGAGGGCG CCCTTGTGCC GGCCTTTGTC AATGCCCATA CCCACCTGGA GCTTTCCGCC CTGGCCGGAC GGCTTTCCAC CGGCCAGGGG TTTGAATCGT GGGTGCGGGA GCTCCTGGCC CTGCGCCAGG AACAGACCAG AGACCATCTG CGCCGGGAGG CACGCGTCGC CGCTGATCGC ATGATAAAAG CCGGTACACT GGTGGCCGGT GAGGTATCCA CCCTCGGCAT CACGGCGGAT CTGTTTCGGG ATGCCGGCCT GGCCGGCGTC TGGTTTTCCG AGGTGCTGGG CCAGCACCTG CCTGAATCCA TGGACCTGCC GCCTGCCGAC CAATGGCGCG CCTCTTCTTT TGCGGCCCAC GCGCCGCACA CCACGGCGCC GGAAGTGTTG TGCCGCCTGA AACAGATGTG TGATGAACGG GGCCTGCCGT TTTCCATTCA TCTGGCTGAA TCGCCCGAGG AGGCTGAGTT TATTCAAACC GGAAAGGGCA GGTGGGCCGA TTTTTTAAGC GAGCGGGGCA TCGGTTTTTT CAAGTGGCCG GTCCCGTCAA AAAGTCCGGT GGGCTATCTG GCCGATCTGG GCCTGCTGGG ACCGAACCTG CTGGCGGTTC ACCTGGTTTA CGCCGATGCA GCAGATATAG AGATGCTGGC CCGGAACCGG GTCCATGGGT GCCTGTGTCT GCGGAGTAAC ATGGCCCTGC ACGGCCGGAT GCCGGATGTA GCCCGAATGG TGGATGCCGG GTTTTACCTG TGCCTGGGCA CCGACAGCCT GGCCTGCGTG GATTCCCTGA GCATGGTTGA CGAGATGGCC TTTGTGGCAT ATAAGTGTCC TGCCCTCCGG CCGGAAGACC TGCTGAACAT GGCGACAATC AACGGTGCAG CGGCGCTTGG CGTGGCCGAC CGGTTCGGCT CGCTGGAACC GGGAAAAAAA GGCGCCCTGG TGTATCTGCC GGTAAAGGCG GAAAACCCAA AGGCCCTGCT TGAGCGGATC GTCTCCGGCG AGGGCGGGCC GGTCTCAACC TGGTGGCCGG AAGAAAGAAA ACGGGAGTAA
|
Protein sequence | MPLMASIIHK AGWVVVHGHR VIRDGFVRVA GGVITEVGTG AAGNGTVVDH GEGALVPAFV NAHTHLELSA LAGRLSTGQG FESWVRELLA LRQEQTRDHL RREARVAADR MIKAGTLVAG EVSTLGITAD LFRDAGLAGV WFSEVLGQHL PESMDLPPAD QWRASSFAAH APHTTAPEVL CRLKQMCDER GLPFSIHLAE SPEEAEFIQT GKGRWADFLS ERGIGFFKWP VPSKSPVGYL ADLGLLGPNL LAVHLVYADA ADIEMLARNR VHGCLCLRSN MALHGRMPDV ARMVDAGFYL CLGTDSLACV DSLSMVDEMA FVAYKCPALR PEDLLNMATI NGAAALGVAD RFGSLEPGKK GALVYLPVKA ENPKALLERI VSGEGGPVST WWPEERKRE
|
| |