Gene Dshi_4171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_4171 
SymbolmdoD 
ID5714686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009959 
Strand
Start bp19852 
End bp21336 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content72% 
IMG OID641277066 
Productglucan biosynthesis protein D 
Protein accessionYP_001542362 
Protein GI159046694 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.13351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGGC GGACGTTTCT GGCCGGGGCC ACCTGCCTCG CGACCCTGCC GCGGCTGGCC 
CCGGCGCAAT CCGTGCCCAC GCTGCTGGAC GCGGCCCGCG CTTTGGCCGC CCAGCCCTTC
GCGCGCGATA CCGCGCCGCT GCCCCCGCCC TTCGCGGGGC TGAGCTATGA CGCCTATCGC
GGCATCCGCC CGATCCCGGG CCGGGCCGCC CTGCTGCCCC ATGGCCCGGA TTTCGCGGTC
GATTTGCTGC CGCCGGGACT GTTTTTCCCC GACCCGGTGC GAATCGACCT TGACCGGGGC
GACGGGCCGC AAGAGGTCAC ACTCACCCCT GACCTGTTCG ACTTCGCGCC CCGGTATTTC
GACACCATCC CCGCCACCGC CCCCGGCGCG GGGTTCTCGG GCCTGCGCCT GCGCCACCCG
CTGAACACCC CGGACGTGCT GGACGAGGTG CTGGTGGTTC AGGGGGCCAG CTATTTCCGC
GCCATCGGGC AGGCGATGGT CTACGGCCTC TCGGCCCGCG CGGTCGCCCT CGGCACCGGC
GGGGCAGGGC CGGAGGAATT CCCCCGCTTC ACCCATCTGC GTCTGCACCC GGCCAGGGAC
GGCACCGTTC GGCTGGAGGC GGTGATCGAC AGCCCGTCGC TGGCCGGGCA TCTCGACATG
GTCCTGCGCC CCGGTGAAGA CACGGTTTGC GACGTGGCCG TGACCCTGCT GCCGCGCCGC
GAGATCGCCG ATATCGGCAT CGCCCCGCTG ACCTCGATGT ATCTCAAGGG CCCGCTGCGC
GCCGCAGTCA GCGACGATTT CCGTCCCCGC GTCCATGACA GCGACGTGCT GCGCATCGAG
AACGGCGCGG GCGAGACGCT CTGGCGTCCC ATCGCCAACC CCGCCCGTCT GGAAACCTCC
GCCTTTCTCG ATGACGGGCC GGTCAGCTTC GGCCTGTTCC AGAGCCGGCG GCGGTTCACC
GATTTCGAGG ATACCGAGGC GCGGTACCAC GACCGCCCCG CGGCGGTGGT GCGCCCGTCA
GGCGACTGGG GCCGCGGCGC GGTCATGCTG GTGGAGATCC CCACGAGCGA CGAGTTCATG
GACAATATCG TGGCCTTCTG GCGCCCCGAG GCCCCCCTGA CCGCAGGCAG CGAGCACCGC
TTCGCCTACA GTCTGACCTG GACGCGCGCG GCCCCGGGCA CAGGCCTGCC CCACGCCATC
GCCCAAAGCC GCAGTGGCCG CGAACACGAC CGCCCGGGCA CCCGGCGCTA TGTCATCGAC
ATCGCGGGCG ACGCCACCGG TCTTGCCCCC GAAATCACGG GGCCGGAGGC GGTTGCGATC
ACCGGCGTCA GCCTCTTTGC CCTGCCCGAG GGACGCGGCA GCCGCCTGAC CTTCCTGCTG
ATCCCGGGGG AGGCCCGCGC CGCCGACCTG CGCGTGACTT TGGGCGTAAA TGGCGCGCCG
GTCTGGCTCC ACCGCTGGAC CCGGGCGCGC GACGGCGGGG TGTAG
 
Protein sequence
MRRRTFLAGA TCLATLPRLA PAQSVPTLLD AARALAAQPF ARDTAPLPPP FAGLSYDAYR 
GIRPIPGRAA LLPHGPDFAV DLLPPGLFFP DPVRIDLDRG DGPQEVTLTP DLFDFAPRYF
DTIPATAPGA GFSGLRLRHP LNTPDVLDEV LVVQGASYFR AIGQAMVYGL SARAVALGTG
GAGPEEFPRF THLRLHPARD GTVRLEAVID SPSLAGHLDM VLRPGEDTVC DVAVTLLPRR
EIADIGIAPL TSMYLKGPLR AAVSDDFRPR VHDSDVLRIE NGAGETLWRP IANPARLETS
AFLDDGPVSF GLFQSRRRFT DFEDTEARYH DRPAAVVRPS GDWGRGAVML VEIPTSDEFM
DNIVAFWRPE APLTAGSEHR FAYSLTWTRA APGTGLPHAI AQSRSGREHD RPGTRRYVID
IAGDATGLAP EITGPEAVAI TGVSLFALPE GRGSRLTFLL IPGEARAADL RVTLGVNGAP
VWLHRWTRAR DGGV