Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1381 |
Symbol | |
ID | 5712557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1434352 |
End bp | 1435626 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267293 |
Product | cytosine deaminase-like protein |
Protein accession | YP_001532724 |
Protein GI | 159043930 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.283067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTACC GCGCATTGCC CAGCGGCGCC TTTACGATGA CGAACGTGCA TGTGCCCGCC TGCCTTTTGG GGCAGGACGG GGACCTGGTC AGGACCGAGA TTTCCATCGA TGCCCAAGGC GACCTGTCGG AGCCGCAGCC CATCGCGGTG GACATGGAAG GCGCGCTCGT GCTGCCGTGC TTCACCGACA TGCACACCCA TCTGGACAAG GGCCATATCT GGGGGCGCAG TCCGAACCCC GACGGTACGT TCATGGGCGC GCTCAGCACC GTGGCCGAAG ACCGCGCGGC GCGCTGGTCG GCTGAGGATG TGCGGCGCCG GATGAGCTTT GCCCTGCGCT GCGCCTATGC CCACGGGACG CGGGCGATCC GCACGCATCT CGACAGCATT CCGCCGCAGG ACGGGATCTC CTTTCCGCTC TTTCGCGACA TGCAGGCCGA GTGGGCCGGC CGGATCGAGT TGCAGGCGGT CTGCCTGATC GGCTGCGATC ACTTCTCGAC CGACGGGCCG TTCAAGGCCA CCGCCGATCG GGTGGCCGAG ACCCCCGGCG GGGTTCTGGG CATGGTGACC TACCCGGTGC CGGACCTGAT CGACCGGCTG CGCGGCTTCT TCGCCATGGC CGCCGAGCGG GGCCTTGCCG CCGATTTCCA CGTGGACGAA ACCATGGATC CGTCGTCCGA GACCCTGCGC GCGATTGCCG AGACCGCCCA TGAGGTGGGG TTCGACGCGC CGATCACCGT TGGCCATTGC TGCTCGCTCG GCACGCAAGA CGAGGCCCGG GCGCTGGACA CGCTGGACCT GGTCGCGCAG GTGGGGATCA ACGTGGTGTC GCTGCCCTTG TGCAACCTCT ACTTGCAGGA CCGTCATGCG GGCCGGACCC CGCGCGGGCG CGGCATCACG CTCGTGCACG AGATGATGGC GCGGGACATT CCCGTGGCCT TTGCCTCGGA CAACACCCGC GATCCGTTCT ACGCCTATGG CGACATGGAC ATGGTCGAGG TGATGCGGGA GGCGACGCGG ATCGGGCATC TCGATCACGG GCGCTTCGAC TGGGTGCGGG CCTTCACCGC GACCCCGGCG GCGATCTGCG GCTTCGACGC GCCGAGCCTT GCGCCCGGCG CGCCTGCGGA CCTGGTGATC ACCCGCGCGC GGAGCTGGAA CGAGTTCTTC TCCCGCCCGC AAAGCGACCG GATCGTGCTG CGCGGTGGCG AGCCCATCGA CCGCACCCTG CCCGATTACG CAGAACTGGA CGACCTGATG GAGACGCCCC AATGA
|
Protein sequence | MDYRALPSGA FTMTNVHVPA CLLGQDGDLV RTEISIDAQG DLSEPQPIAV DMEGALVLPC FTDMHTHLDK GHIWGRSPNP DGTFMGALST VAEDRAARWS AEDVRRRMSF ALRCAYAHGT RAIRTHLDSI PPQDGISFPL FRDMQAEWAG RIELQAVCLI GCDHFSTDGP FKATADRVAE TPGGVLGMVT YPVPDLIDRL RGFFAMAAER GLAADFHVDE TMDPSSETLR AIAETAHEVG FDAPITVGHC CSLGTQDEAR ALDTLDLVAQ VGINVVSLPL CNLYLQDRHA GRTPRGRGIT LVHEMMARDI PVAFASDNTR DPFYAYGDMD MVEVMREATR IGHLDHGRFD WVRAFTATPA AICGFDAPSL APGAPADLVI TRARSWNEFF SRPQSDRIVL RGGEPIDRTL PDYAELDDLM ETPQ
|
| |