Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1993 |
Symbol | |
ID | 5712988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2109965 |
End bp | 2111209 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641267917 |
Product | putative glycosyltransferase |
Protein accession | YP_001533333 |
Protein GI | 159044539 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00985671 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.577803 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCATA TCCGTCCACC GAAACAGGGA AAGATCGGAT ATATCTGCAA GCGCTACCCG CGCTTTTCCG AGACCTTCAT CGTCCACGAG ATCCTCGCCC ATGAGCGCGC GGGCCAGCAG GTGGAGATAT TCGCGCTGCG CCCGGTCATG GACACGCATT TCCAGGACAT CCTGTCGAAA GTGCGCGCGC CCGTGCACCG GATCCCCGAG AAGACCCGCT CTGTCAGCGT GTTCCGGGAC CTTCTGGAGA AGGCCGAGGC GCTTTATCCC GGTGCGCCGC AACGGGCGCT GGCCACGGGC GCTGCGACCG ATGCCATTGC GCAGGGGCTT GCGCTGGCCA TAGACGCCAA ACGCCTTGGC GTGACGCATT TCCACGCCCA TTTCGGCACG GTCGCCACGA CGGTCGCGCG CGTGGCCTCG CAGGTCTCCG GCATTCCGTA TACTTTCACG GCCCATGCCA AGGATATCTA TTACCGCTAC GACCCGCCGA TCGAGCTGGA CGTGAAGCTG CGCGATGCGG CGGCGGCGGT GACGGTTTCG GATTTCAACC TGGCCTACAT GACCGAGACG TTCGGCAAGG ACGCGGCCGG GCTCGTGCGG CTTTACAACG GGCTCGATCT GTCGGGCTTT GCGTGGTCCG AGCCGACGGC GCGGCAGACG GATATCCTCG CCGTGGGCCG CCTGATCGAG AAGAAGGGGT TCCATATCCT TGTGGAGGCC CTGTGGCAGT TGGCGCGCAA GGGGCAGACC CCGCGCTGCC GGATCATCGG CATGGGGGAG GACGAGGACA ACCTGCGCAG CCAGATCGCG GCGGCGGGCC TGGAGGGTCA GGTGACCATC GAAGGACCGC GCCCGCAATC CGAGGTCATC GCCGCCATGC GCGACGCCGC CGTTCTGGTC TGCCCCTGTA TCGTGGCCCG CGACGGCAAC CGTGACGGGT TGCCCACCGT GTTGCTGGAG GCGATGGCGC TTGGAACGCC CTGCATCGGG ACGGATGTGG TCGGCCTGCC GGAAATCCTG CGCCCGGGGG ACACCGGGCT GCTGGCCAGC GAGGGCGACC CCGACACCTT GTCCGCCGCG ATTTCGCAGA TGCTTGGCGA CATCGACCTG CGCCGGCGCG TGTCGCGCAA TGCGCGCCGG TTGATCGAGG AAGAGTTCGA CATCGACCGC AACGCGGCCC GGTTGCGCGA GCTCTTTGCG TCCTGCTCGG GCCCTGTGCC CGCCGGCCTG AAGGGAGCAG CCTGA
|
Protein sequence | MIHIRPPKQG KIGYICKRYP RFSETFIVHE ILAHERAGQQ VEIFALRPVM DTHFQDILSK VRAPVHRIPE KTRSVSVFRD LLEKAEALYP GAPQRALATG AATDAIAQGL ALAIDAKRLG VTHFHAHFGT VATTVARVAS QVSGIPYTFT AHAKDIYYRY DPPIELDVKL RDAAAAVTVS DFNLAYMTET FGKDAAGLVR LYNGLDLSGF AWSEPTARQT DILAVGRLIE KKGFHILVEA LWQLARKGQT PRCRIIGMGE DEDNLRSQIA AAGLEGQVTI EGPRPQSEVI AAMRDAAVLV CPCIVARDGN RDGLPTVLLE AMALGTPCIG TDVVGLPEIL RPGDTGLLAS EGDPDTLSAA ISQMLGDIDL RRRVSRNARR LIEEEFDIDR NAARLRELFA SCSGPVPAGL KGAA
|
| |