Gene Dshi_3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3866 
Symbol 
ID5714395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp75905 
End bp79813 
Gene Length3909 bp 
Protein Length1302 aa 
Translation table11 
GC content69% 
IMG OID641276779 
Productglycosyl transferase group 1 
Protein accessionYP_001542075 
Protein GI159046404 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.806044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCA ACGCCGACAG CATGCAGCGC GACCGCACGG ACCTGGAGGC CTCCGGGCTC 
TTCGACCCGG CATGGTACCT GGAAACATAC CCGGATGTGG CGCAACTGGG CATGGACCCG
CTGGAGCATT TCCTGTGGCT GGGCGCCCGG CTCAATCGCA GCCCCGGACC GACTTTCGAT
GCGGCCGCCT ACCGGGCCGA TTACGCGGAC GTGGCCCGGG CCGATTACAA TCCGGTGCTG
CATTACATCC GCTACGGCCG CCCCGAGGGC CGCAAGGCCC GCGCGCCGGG GCTCACCATA
CCGCCCCTGG CCGAGAGCGC CGGGGCAAGC GCCCCCTTCA GACGGCTGAC CGGCCACCGG
GCCCGCCGCC CGGGACACCC GACCGTGCTG CTGGTGGCAC ATATCGTGGG CCACCAGCTC
TACGGCTCCG AACGCAGCCT GCTCGACATG CTCGACGGGT TGGCGGCGAT GGATGCCAAC
GTGATCGTCG CCGTGCCCAG CACCCGCAAC AAGGCCTATA TCGAGCTGCT GCGCGCCCGC
GCCTGTGCGG TCAGCGTGCT GTCCTATGGC TGGTGGCGGG CGGGCACGGC GCCCGACGAA
ACCGTCATCG CCACCTTCGC CCGGGTGATC ACGGAAGAAC GTATCGACAT CGTCCATGCC
AACACGATCA TGCTGCGCGA GCCACTGATC GCGGCCCGGC GCCTGGGGCT GCCGGCCGTG
GTGCAGGCGC GCGAGCTGAT CCGTCACGAT GCCAAGCTGC TGGAGCTCAT CGGGCTCAGC
GCCGACGAGA TCATCGCGGC GCTCTGGGAG AATTGCGATG TCATGATCGC CAATTCCCGG
GCGACCGCGG CCTGTTTCGA CACCGAAACC CGCCGCTCCG CCGTGGTCTA CAACACCGCC
GACATGACCG CGCTGCAGGC GCTGCCCCCG CCCCCGGCCG AAGCCCCCCT GCGGGTCGGC
ATGGTCAGCT CCAATGTTCC CAAGAAGGGG CTGGCGGATT TTGCCGAGGT CGCCCGTCTG
GTGGCCGCCG AACTGCCCGA CGCGGAATTC CTGCTGATCG GCCCGCAACA CGAACATACC
GAGGCGATCG AGGCCCGGAT CGCCGACGGC TCCCTGCCGC GCAGCCTCAG GGTCGCGGGC
TATCGCGACA CCCCGGCCGA GGCCATGGCC GAGCTCGACC TGGTGCTCAG CCTGTCGCAT
TTCCAGGAAA GCTTCGGGCG CACCGTGCTG GAGGCCATGG CCGCAGGCCG GCCGGTCATC
GTCTATGACC ATGGCGCCCC GCCGGAACTG GTGGTGTCGA GCAAGACCGG CCAAGTGGTC
CCCTTCGGCG ACATCCAGGC CGTGGCCGAC CAGGTCCTGG CCTATGGCCG CGACCGCAAA
CGGCTCTTGG TGGACGGGCT GCGGGCGGCC GACCACGCCG AGACCACCTT TGGGCGCACC
GCCTATAACG CGGCCATGGC GGCGGCCTAT GCGCCGCTGC TGCAGGAGCT GGCCCGGGAT
GCCCACCGCC CCGAGCAGAT GGTGCTGCGC GCGCGCGACC TGCCGGTGAA GATCCCACGC
GACACCCTCA AGGTCGCCTA TTTCTGCTGG CACTTCCCGG TGCCCTCGGA AACCTTCGTG
CTCAACGAGT TGCGCATCCT CAAGGCGCAG GGCATCGACG TGACGGTGTT TTGCCGGGAC
TCCCCCTACC CCGATTTCAC GCCGGATTTC GACATCACCT GGGAGCAGGT CCACGATGCC
GACCACCTGG CCCGGCGGCT CACCGAGACC GGGCGCGACG TGGTCCACGG CCATTTCGTC
TATCCCACGG TCACCGAGAT GGTCTGGCCC GCGGCACGCA TCGCGAACAT CCCCTTCACC
TGCATCGCCC ATGCCCAGGA CATCTTCCGC TACCGCAACG CGGTGGCCAA CCGGATCGAC
GAGATCAGCG CCGATCCCCT CTGCCGGCAG ATCTTCACCC TGTCGCGCTT CCACCGGCAA
TACCTGGTGG ACCGCGGCGT GCGCCCCGAG AAGGTGACCA TCAATTCCAA CTGCATCGAC
CCGGAGCTGT TCTCGGGCGG CAAGATCCCC GACCGGCCGG CCCGCCGCAC CCGATCCGTT
GCCGCGGTCT CGCGCTTTGC CGACAAGAAG GGGCTGGAGG TGCTGGTGCG CGCCGGCAAG
CTGCTGGAAG ACGACGGGAT CACCATCAAT ATCCACGGCT ACGGCCCGCT GGAGGATCTC
TACCGCCAGA TCATCGCCGA GCAGGAGATC ACCAATGTCA CCATCCACGG CCCCGTGGAG
GGTCGCGCGG CGCTGCTGGA GGTGTTCCGC ACCCATGATC TGTTCGCCGT GCCCTCGGTG
CGCGCGCTGG ACGGGGACAT GGACGGCATC CCCACCACCC TGATGGAGGC GATGGCCGCC
GGCCTGCCGG TTCTGACCAC GCCCGTCGCG GGCATTCCCG ACCTGGTGCG CGACGGGATC
ACCGGCATGC TGTCGGAAGA TGCCACCCCC GAGGCGCTGG CCGCCAAGAT CCGCGAATTC
TACGCCCTGC CCGAGATCGC CGTGCAGGTG ATGATCGAGG ATGCCGAGGC CCTGCTGCGG
CGCAACCATA ACGGCCCGGA TCTCGTGAAC ACGCTCCTGC GCTTCTGGGC GGGCGAGACC
ATCGACCTGA TGATCGTGTC CTGGAACAAC CTCGCCCAGA CCCGCGAGGT GATCCGGCGG
CTCTACGAAT ATACCGACCT GCCCTTCCAC CTCATTGTCT GCGACAATGG CTCGGACCCG
CCGGCGCTGG CGCACCTGCT GTCGGTCTAT GCGGCGCGCA CGAATTTCAC CCTGATCCTG
AACCGCGAGA ACGCCTTCGT CGGCCCCGGC ACCAACAAGT GCATCGCCCA AGGCGACTCC
GACTACATGA TCTATGTCTG CGGCAAGGAA GGCATGACCA CCCGCCACGG CTGGGAGAAA
TCCTTCGTCA CCTACATGGA CGCCCACCCC CGGGTGGGCC AGGCCGGCAC CCTGTGCTAC
TCGCCCAGCT ATCTCTTCGG GCGCGACTAC CCCGAGGGGG TGGCGCTGTT CCCGGATTTC
CGCAACCCGG GCTTTGCCGC CGACAATCCC GACCGGCCGT TCTCTCACGT CCAGGGCGGG
TTCTTCGTCA TCCGGCGCGC CACCTATGAC GAGATCGGCG GGTTTTCCGA CGCGGTCCCC
CACAGCTACA CGGATGTGGA GTTTTCCTAT TACGTGGAAA GCTGCGGCTG GGAGCTTGGC
ACGGTGCCCG GGCTGATGGC GCTGTTCAAC AAGACCCGGC CCGGGCTGGA GGCGCGGGTG
GACGAACATC ACGGCGCGTT GCATCCGCCC AATCTCGACG ATCTGCCCTG GCTCGACCGG
ATCGCCCGGC GCGAGGTGCG CCACTGCAAC ATGTGCGGCC ACCAGGCGCC CGCCTTCGAG
GGCGGCGATG CCGAGGCACG CTGCGCCGGG TGCGGCTCGG ACCGCCGCGC GCGCAGCCTG
CACCGGGTGC TGGCCGAGAC CATCCTGCTC TATCGCCGCC TGCCGGGGCT GGGGGTGAAC
CTGCCCGCGC CGCTGCAGGG TTTCTGGTCG GACCAGTTCC AGGGCCCGAT GCTGCCCCTG
GAGGCGTTCA CGGACCCCCT GAGCCGCGGC CAGACCCTAC CCAACCGCGC GGGCGCGCTG
CAGCTGGCCT GCCTGAACGA CGTGCTGGAT GACGTGGCCC TGCGCGGGGC GGCCCTGGCC
GAGACCGCGC GCCTGCTGGC GCCGGGGGCC ACGCTTTTCG TGGCCGGCGC CACCCCGCTC
GACACGCTGG AGGCCGAGAT CACCGCGGCA GGCTTCACGC CCGCGGGGCG CAAACGCCCC
TGCTCGGCGG TGCTGCGCTT CGACTGGATC GAGATCGGCC TCTATACCCG CGCCGGAGAC
ACGCAATGA
 
Protein sequence
MNANADSMQR DRTDLEASGL FDPAWYLETY PDVAQLGMDP LEHFLWLGAR LNRSPGPTFD 
AAAYRADYAD VARADYNPVL HYIRYGRPEG RKARAPGLTI PPLAESAGAS APFRRLTGHR
ARRPGHPTVL LVAHIVGHQL YGSERSLLDM LDGLAAMDAN VIVAVPSTRN KAYIELLRAR
ACAVSVLSYG WWRAGTAPDE TVIATFARVI TEERIDIVHA NTIMLREPLI AARRLGLPAV
VQARELIRHD AKLLELIGLS ADEIIAALWE NCDVMIANSR ATAACFDTET RRSAVVYNTA
DMTALQALPP PPAEAPLRVG MVSSNVPKKG LADFAEVARL VAAELPDAEF LLIGPQHEHT
EAIEARIADG SLPRSLRVAG YRDTPAEAMA ELDLVLSLSH FQESFGRTVL EAMAAGRPVI
VYDHGAPPEL VVSSKTGQVV PFGDIQAVAD QVLAYGRDRK RLLVDGLRAA DHAETTFGRT
AYNAAMAAAY APLLQELARD AHRPEQMVLR ARDLPVKIPR DTLKVAYFCW HFPVPSETFV
LNELRILKAQ GIDVTVFCRD SPYPDFTPDF DITWEQVHDA DHLARRLTET GRDVVHGHFV
YPTVTEMVWP AARIANIPFT CIAHAQDIFR YRNAVANRID EISADPLCRQ IFTLSRFHRQ
YLVDRGVRPE KVTINSNCID PELFSGGKIP DRPARRTRSV AAVSRFADKK GLEVLVRAGK
LLEDDGITIN IHGYGPLEDL YRQIIAEQEI TNVTIHGPVE GRAALLEVFR THDLFAVPSV
RALDGDMDGI PTTLMEAMAA GLPVLTTPVA GIPDLVRDGI TGMLSEDATP EALAAKIREF
YALPEIAVQV MIEDAEALLR RNHNGPDLVN TLLRFWAGET IDLMIVSWNN LAQTREVIRR
LYEYTDLPFH LIVCDNGSDP PALAHLLSVY AARTNFTLIL NRENAFVGPG TNKCIAQGDS
DYMIYVCGKE GMTTRHGWEK SFVTYMDAHP RVGQAGTLCY SPSYLFGRDY PEGVALFPDF
RNPGFAADNP DRPFSHVQGG FFVIRRATYD EIGGFSDAVP HSYTDVEFSY YVESCGWELG
TVPGLMALFN KTRPGLEARV DEHHGALHPP NLDDLPWLDR IARREVRHCN MCGHQAPAFE
GGDAEARCAG CGSDRRARSL HRVLAETILL YRRLPGLGVN LPAPLQGFWS DQFQGPMLPL
EAFTDPLSRG QTLPNRAGAL QLACLNDVLD DVALRGAALA ETARLLAPGA TLFVAGATPL
DTLEAEITAA GFTPAGRKRP CSAVLRFDWI EIGLYTRAGD TQ