Gene Dshi_4170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_4170 
Symbol 
ID5714685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009959 
Strand
Start bp18287 
End bp19852 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content75% 
IMG OID641277065 
Productglucosyltransferase MdoH 
Protein accessionYP_001542361 
Protein GI159046693 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0929009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGG TCGAACCGAT TTCCCGTGCA TGGCGTGTCG ATGCCGGCTG GCCCGGCCCG 
CGCGCCGCTG GCCTTGGGCT GGCCCTGGGT CTGACGCTGG CCATCGTGGC GGGCTTTGCC
ACCAGTGTCA CCGACTGGAC CCCCGGGGCG CTGCTGGCGC TGCCGTTGGT GATGCTCGGC
GCGGTCTGGA TCGCGGGCGG GGCGGCGACA GCCCTTCTGG GGCTTGCCCT GCGCCCGGAC
CCGGAACCGC CCGTGCCCGC GGGCTGGCGC CCGGCCAGCC GCACCGCCCT CCTGGTGACC
CTGTGCAAGG AAGACCCCGC GCCGCTGGCC GCCCATCTCG TCGCCCTGCG CGCGGGCCTC
GACCGGGTGG GGCTGGACGC AGGCGCACAT ATCTTCGTGC TGTCGGACAC CTCCGGCGCC
GCCGCAATCG CCGCGGAGGA GGCCGCCTTC GCCCCGCTGA TCGAAGCGGG GACCGTCACC
TACCGCAGGA GGGCCGAGAA TACCGGGCGT AAACCGGGCA ATATCGCCGA CTGGCTGGCG
GTTCATGGGG ACCGGTTCGA GCATATGATG GTGCTCGACG CCGACAGCCG GATGAGCCCC
GACCGCATCC GCCGCATGAT CCACCGGATG GACCGGACCC CCGCTCTGGG CCTTTTGCAG
GCAGGCCTCG CACTGGTGCC GGGCCGCACC CGGTTCGGCC GCCACCAGCG GACGGGCGTG
CGCCTTCTGT CCCGGGGCTT CGGGCGCGGG TTCGCCGCCT GGACCGGCGA CAGCGGCAAT
TACTGGGGCC ATAACGCGAT CATGCGCGTC GCGGCCTTCC GCAGCGCCGC CGCCCTGCCG
GTCCTGCCCG GGCGCGCGCC CTTCGGCGGC GCGCTGCTGA GCCATGATTT CATCGAAGCC
GCCTGGATCC GCCGCGCGGG CTGGGCCGTG GCGCTGGACC CGGACATGAC CGGCAGCGCC
GAGGACGCGC CCCAGACCCT GGCCGCCTTC CACGCGCGCG ACCGCCGCTG GTGCCAGGGC
AACCTGCAAC ACCTGCGCCT GCTGGCTGCG CCCGGGCTGG ACCCGGTCAG CCGCCTGCAC
CTGCTCATGG GGGTCCTGAG CTACCTCGTG GCCCCGGTCT GGCTGGTCCT GATCGCGCTG
ATCGCCCTGG GGCTGGTGCC CGTGGCCGGG GCGCTGCCCC TGCTGGTCGC GGCGCTGGTG
CTGCTGATCC CCAAGCTCTG CGCGCTGGTC GAAGGCCTCT GCCGCAGTCG CAGCTGGGCG
CGCCGGGCGG TGATCCTGCG GGCCTGGGTG GGCGAGCTTG CGACCTCCAC CCTGATCGCG
CCGCTGGTGA TGCTGCGCCA GGCGGGGGCT GTCCTGGCGG TCTGCCTGGG CCGCGATTGC
GGCTGGAAGA CCGCGCGCCG GGCCGGGCCC ACCCTGCCGT GCGGCACGGT GGAGGCGGTG
GCGGGCGCGG CCCTCGTGAC CCTCGCCGTG GCCACCTCCG GCAGCGCGGC CCTGTGGCTC
GCCCCCGTGG CGCTGCCGCT CTGCTGCGCG CCGCTGATCG TGCCGGTCCT CGACCGGGCG
GCGTGA
 
Protein sequence
MSLVEPISRA WRVDAGWPGP RAAGLGLALG LTLAIVAGFA TSVTDWTPGA LLALPLVMLG 
AVWIAGGAAT ALLGLALRPD PEPPVPAGWR PASRTALLVT LCKEDPAPLA AHLVALRAGL
DRVGLDAGAH IFVLSDTSGA AAIAAEEAAF APLIEAGTVT YRRRAENTGR KPGNIADWLA
VHGDRFEHMM VLDADSRMSP DRIRRMIHRM DRTPALGLLQ AGLALVPGRT RFGRHQRTGV
RLLSRGFGRG FAAWTGDSGN YWGHNAIMRV AAFRSAAALP VLPGRAPFGG ALLSHDFIEA
AWIRRAGWAV ALDPDMTGSA EDAPQTLAAF HARDRRWCQG NLQHLRLLAA PGLDPVSRLH
LLMGVLSYLV APVWLVLIAL IALGLVPVAG ALPLLVAALV LLIPKLCALV EGLCRSRSWA
RRAVILRAWV GELATSTLIA PLVMLRQAGA VLAVCLGRDC GWKTARRAGP TLPCGTVEAV
AGAALVTLAV ATSGSAALWL APVALPLCCA PLIVPVLDRA A