Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_4170 |
Symbol | |
ID | 5714685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009959 |
Strand | + |
Start bp | 18287 |
End bp | 19852 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641277065 |
Product | glucosyltransferase MdoH |
Protein accession | YP_001542361 |
Protein GI | 159046693 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2943] Membrane glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0929009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTGG TCGAACCGAT TTCCCGTGCA TGGCGTGTCG ATGCCGGCTG GCCCGGCCCG CGCGCCGCTG GCCTTGGGCT GGCCCTGGGT CTGACGCTGG CCATCGTGGC GGGCTTTGCC ACCAGTGTCA CCGACTGGAC CCCCGGGGCG CTGCTGGCGC TGCCGTTGGT GATGCTCGGC GCGGTCTGGA TCGCGGGCGG GGCGGCGACA GCCCTTCTGG GGCTTGCCCT GCGCCCGGAC CCGGAACCGC CCGTGCCCGC GGGCTGGCGC CCGGCCAGCC GCACCGCCCT CCTGGTGACC CTGTGCAAGG AAGACCCCGC GCCGCTGGCC GCCCATCTCG TCGCCCTGCG CGCGGGCCTC GACCGGGTGG GGCTGGACGC AGGCGCACAT ATCTTCGTGC TGTCGGACAC CTCCGGCGCC GCCGCAATCG CCGCGGAGGA GGCCGCCTTC GCCCCGCTGA TCGAAGCGGG GACCGTCACC TACCGCAGGA GGGCCGAGAA TACCGGGCGT AAACCGGGCA ATATCGCCGA CTGGCTGGCG GTTCATGGGG ACCGGTTCGA GCATATGATG GTGCTCGACG CCGACAGCCG GATGAGCCCC GACCGCATCC GCCGCATGAT CCACCGGATG GACCGGACCC CCGCTCTGGG CCTTTTGCAG GCAGGCCTCG CACTGGTGCC GGGCCGCACC CGGTTCGGCC GCCACCAGCG GACGGGCGTG CGCCTTCTGT CCCGGGGCTT CGGGCGCGGG TTCGCCGCCT GGACCGGCGA CAGCGGCAAT TACTGGGGCC ATAACGCGAT CATGCGCGTC GCGGCCTTCC GCAGCGCCGC CGCCCTGCCG GTCCTGCCCG GGCGCGCGCC CTTCGGCGGC GCGCTGCTGA GCCATGATTT CATCGAAGCC GCCTGGATCC GCCGCGCGGG CTGGGCCGTG GCGCTGGACC CGGACATGAC CGGCAGCGCC GAGGACGCGC CCCAGACCCT GGCCGCCTTC CACGCGCGCG ACCGCCGCTG GTGCCAGGGC AACCTGCAAC ACCTGCGCCT GCTGGCTGCG CCCGGGCTGG ACCCGGTCAG CCGCCTGCAC CTGCTCATGG GGGTCCTGAG CTACCTCGTG GCCCCGGTCT GGCTGGTCCT GATCGCGCTG ATCGCCCTGG GGCTGGTGCC CGTGGCCGGG GCGCTGCCCC TGCTGGTCGC GGCGCTGGTG CTGCTGATCC CCAAGCTCTG CGCGCTGGTC GAAGGCCTCT GCCGCAGTCG CAGCTGGGCG CGCCGGGCGG TGATCCTGCG GGCCTGGGTG GGCGAGCTTG CGACCTCCAC CCTGATCGCG CCGCTGGTGA TGCTGCGCCA GGCGGGGGCT GTCCTGGCGG TCTGCCTGGG CCGCGATTGC GGCTGGAAGA CCGCGCGCCG GGCCGGGCCC ACCCTGCCGT GCGGCACGGT GGAGGCGGTG GCGGGCGCGG CCCTCGTGAC CCTCGCCGTG GCCACCTCCG GCAGCGCGGC CCTGTGGCTC GCCCCCGTGG CGCTGCCGCT CTGCTGCGCG CCGCTGATCG TGCCGGTCCT CGACCGGGCG GCGTGA
|
Protein sequence | MSLVEPISRA WRVDAGWPGP RAAGLGLALG LTLAIVAGFA TSVTDWTPGA LLALPLVMLG AVWIAGGAAT ALLGLALRPD PEPPVPAGWR PASRTALLVT LCKEDPAPLA AHLVALRAGL DRVGLDAGAH IFVLSDTSGA AAIAAEEAAF APLIEAGTVT YRRRAENTGR KPGNIADWLA VHGDRFEHMM VLDADSRMSP DRIRRMIHRM DRTPALGLLQ AGLALVPGRT RFGRHQRTGV RLLSRGFGRG FAAWTGDSGN YWGHNAIMRV AAFRSAAALP VLPGRAPFGG ALLSHDFIEA AWIRRAGWAV ALDPDMTGSA EDAPQTLAAF HARDRRWCQG NLQHLRLLAA PGLDPVSRLH LLMGVLSYLV APVWLVLIAL IALGLVPVAG ALPLLVAALV LLIPKLCALV EGLCRSRSWA RRAVILRAWV GELATSTLIA PLVMLRQAGA VLAVCLGRDC GWKTARRAGP TLPCGTVEAV AGAALVTLAV ATSGSAALWL APVALPLCCA PLIVPVLDRA A
|
| |