Gene Dshi_1381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1381 
Symbol 
ID5712557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1434352 
End bp1435626 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content68% 
IMG OID641267293 
Productcytosine deaminase-like protein 
Protein accessionYP_001532724 
Protein GI159043930 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.283067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTACC GCGCATTGCC CAGCGGCGCC TTTACGATGA CGAACGTGCA TGTGCCCGCC 
TGCCTTTTGG GGCAGGACGG GGACCTGGTC AGGACCGAGA TTTCCATCGA TGCCCAAGGC
GACCTGTCGG AGCCGCAGCC CATCGCGGTG GACATGGAAG GCGCGCTCGT GCTGCCGTGC
TTCACCGACA TGCACACCCA TCTGGACAAG GGCCATATCT GGGGGCGCAG TCCGAACCCC
GACGGTACGT TCATGGGCGC GCTCAGCACC GTGGCCGAAG ACCGCGCGGC GCGCTGGTCG
GCTGAGGATG TGCGGCGCCG GATGAGCTTT GCCCTGCGCT GCGCCTATGC CCACGGGACG
CGGGCGATCC GCACGCATCT CGACAGCATT CCGCCGCAGG ACGGGATCTC CTTTCCGCTC
TTTCGCGACA TGCAGGCCGA GTGGGCCGGC CGGATCGAGT TGCAGGCGGT CTGCCTGATC
GGCTGCGATC ACTTCTCGAC CGACGGGCCG TTCAAGGCCA CCGCCGATCG GGTGGCCGAG
ACCCCCGGCG GGGTTCTGGG CATGGTGACC TACCCGGTGC CGGACCTGAT CGACCGGCTG
CGCGGCTTCT TCGCCATGGC CGCCGAGCGG GGCCTTGCCG CCGATTTCCA CGTGGACGAA
ACCATGGATC CGTCGTCCGA GACCCTGCGC GCGATTGCCG AGACCGCCCA TGAGGTGGGG
TTCGACGCGC CGATCACCGT TGGCCATTGC TGCTCGCTCG GCACGCAAGA CGAGGCCCGG
GCGCTGGACA CGCTGGACCT GGTCGCGCAG GTGGGGATCA ACGTGGTGTC GCTGCCCTTG
TGCAACCTCT ACTTGCAGGA CCGTCATGCG GGCCGGACCC CGCGCGGGCG CGGCATCACG
CTCGTGCACG AGATGATGGC GCGGGACATT CCCGTGGCCT TTGCCTCGGA CAACACCCGC
GATCCGTTCT ACGCCTATGG CGACATGGAC ATGGTCGAGG TGATGCGGGA GGCGACGCGG
ATCGGGCATC TCGATCACGG GCGCTTCGAC TGGGTGCGGG CCTTCACCGC GACCCCGGCG
GCGATCTGCG GCTTCGACGC GCCGAGCCTT GCGCCCGGCG CGCCTGCGGA CCTGGTGATC
ACCCGCGCGC GGAGCTGGAA CGAGTTCTTC TCCCGCCCGC AAAGCGACCG GATCGTGCTG
CGCGGTGGCG AGCCCATCGA CCGCACCCTG CCCGATTACG CAGAACTGGA CGACCTGATG
GAGACGCCCC AATGA
 
Protein sequence
MDYRALPSGA FTMTNVHVPA CLLGQDGDLV RTEISIDAQG DLSEPQPIAV DMEGALVLPC 
FTDMHTHLDK GHIWGRSPNP DGTFMGALST VAEDRAARWS AEDVRRRMSF ALRCAYAHGT
RAIRTHLDSI PPQDGISFPL FRDMQAEWAG RIELQAVCLI GCDHFSTDGP FKATADRVAE
TPGGVLGMVT YPVPDLIDRL RGFFAMAAER GLAADFHVDE TMDPSSETLR AIAETAHEVG
FDAPITVGHC CSLGTQDEAR ALDTLDLVAQ VGINVVSLPL CNLYLQDRHA GRTPRGRGIT
LVHEMMARDI PVAFASDNTR DPFYAYGDMD MVEVMREATR IGHLDHGRFD WVRAFTATPA
AICGFDAPSL APGAPADLVI TRARSWNEFF SRPQSDRIVL RGGEPIDRTL PDYAELDDLM
ETPQ