Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3578 |
Symbol | |
ID | 5713809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3768197 |
End bp | 3769435 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641269507 |
Product | glycosyl transferase |
Protein accession | YP_001534912 |
Protein GI | 159046118 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.643047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCGG AGGGCATGCG CCTGAACCAG GATCCCTTGC CCCTGTTGCT CGACGTGACC CGGCTGGCCT CGCGGATCGG GGACAGGCGC TTGTCCGGCG TCGACCGGGT GGAGGCGGCC TATCTACGCT TTGTTCTGTC GCAACCGGAC ATGCCCTATG GGCTGGTGCG GTCCGGCTTC GGCTACCTGC TGCTCGATCG CGCGGGGTTG CAGCCGCTCT GCGATGCCGT GGCCGATGAA ACGATCTCCT GGGGGCGTCC GGACCTGTTG TCCCGGCTGG CGCGCCACCG CTCCAAGGCG CGCGCGGGGG TCGAGACGAC CTTGCGCGCG CTCTCCATCG CGCGGGCGCC GCGTTCGGGG CTGGGCCGGA TGCTGCGCAA ATGCCTGCCC GACGGGGCCC ATTACCTGAA CGTGGGCGAG ACGAATTTCG ATGGCCCGGT CGCGATGGCC CTGCGCGGTC TGCGCCGCAG CTGGATCGAC ATGGTGCTGC ATGACACCAT TCCCTGCGAC TTTCCCGATC TCGTGACCCC GGCCTCCGCC GCGCGGTTCG ACAAGCGCCT GCGCGCGATG CGCACACATG CGGATCGGAT CATCACCGCC ACCTGGGCCG TGCAACAGGC CGGGTGCCGC CACCTGGGGC TTGGATCGGA GGACGCGCGC TGGTGCGTGG CGCCCTTCGG GCTCGACCTG CCGGAGCCCG ACGCCAAGGC CCCCGCCCGG CACGGGCTGA CCCGGCCCTA CATGCTGGCG CTGGGGACGT TGGAGCCGCG CAAGAACATC ACCTTCCTGC TGGAGATCTG GCGGCAGGCA ACCGCCACGG GCAGCGCCAT GCCGGACCTC GTTCTCTGCG GTGCGCGCGG CTGGTATCCG GCGAAGGTCT TCGCCGCGCT GGACGCCGAC CCGCTGCGCG GTGTTCATAT CCACGAGATC AACGATGCCC TGGATCCGGA AGTCGCGGGC CTGATCGAAG GGGCCGAGGC GCTGCTGTTT CCCACCGTGG CCGAAGGCTT CGGCTTCCCG CCGCTGGAGG CCATTGCCCT GGGCGTGCCG GTGATCTGCA GCGATCTGCC CGTGCTGCGC GAGACCCTCG GGGCCTTGCC CGTTTACGTG GCCCCCGGGG ACAGCTACGC TTGGACATCA AAAATTAAGC AAGGTTGTGC AAAACCGGAT CCGACCGCTG TGGAGGCCCT GCTGGACCGG TTCACCTGGG AGCAGCACTT CGCCCGGGTG TTCGGCTGA
|
Protein sequence | MAAEGMRLNQ DPLPLLLDVT RLASRIGDRR LSGVDRVEAA YLRFVLSQPD MPYGLVRSGF GYLLLDRAGL QPLCDAVADE TISWGRPDLL SRLARHRSKA RAGVETTLRA LSIARAPRSG LGRMLRKCLP DGAHYLNVGE TNFDGPVAMA LRGLRRSWID MVLHDTIPCD FPDLVTPASA ARFDKRLRAM RTHADRIITA TWAVQQAGCR HLGLGSEDAR WCVAPFGLDL PEPDAKAPAR HGLTRPYMLA LGTLEPRKNI TFLLEIWRQA TATGSAMPDL VLCGARGWYP AKVFAALDAD PLRGVHIHEI NDALDPEVAG LIEGAEALLF PTVAEGFGFP PLEAIALGVP VICSDLPVLR ETLGALPVYV APGDSYAWTS KIKQGCAKPD PTAVEALLDR FTWEQHFARV FG
|
| |