Gene Dshi_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1993 
Symbol 
ID5712988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2109965 
End bp2111209 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content67% 
IMG OID641267917 
Productputative glycosyltransferase 
Protein accessionYP_001533333 
Protein GI159044539 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00985671 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.577803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCATA TCCGTCCACC GAAACAGGGA AAGATCGGAT ATATCTGCAA GCGCTACCCG 
CGCTTTTCCG AGACCTTCAT CGTCCACGAG ATCCTCGCCC ATGAGCGCGC GGGCCAGCAG
GTGGAGATAT TCGCGCTGCG CCCGGTCATG GACACGCATT TCCAGGACAT CCTGTCGAAA
GTGCGCGCGC CCGTGCACCG GATCCCCGAG AAGACCCGCT CTGTCAGCGT GTTCCGGGAC
CTTCTGGAGA AGGCCGAGGC GCTTTATCCC GGTGCGCCGC AACGGGCGCT GGCCACGGGC
GCTGCGACCG ATGCCATTGC GCAGGGGCTT GCGCTGGCCA TAGACGCCAA ACGCCTTGGC
GTGACGCATT TCCACGCCCA TTTCGGCACG GTCGCCACGA CGGTCGCGCG CGTGGCCTCG
CAGGTCTCCG GCATTCCGTA TACTTTCACG GCCCATGCCA AGGATATCTA TTACCGCTAC
GACCCGCCGA TCGAGCTGGA CGTGAAGCTG CGCGATGCGG CGGCGGCGGT GACGGTTTCG
GATTTCAACC TGGCCTACAT GACCGAGACG TTCGGCAAGG ACGCGGCCGG GCTCGTGCGG
CTTTACAACG GGCTCGATCT GTCGGGCTTT GCGTGGTCCG AGCCGACGGC GCGGCAGACG
GATATCCTCG CCGTGGGCCG CCTGATCGAG AAGAAGGGGT TCCATATCCT TGTGGAGGCC
CTGTGGCAGT TGGCGCGCAA GGGGCAGACC CCGCGCTGCC GGATCATCGG CATGGGGGAG
GACGAGGACA ACCTGCGCAG CCAGATCGCG GCGGCGGGCC TGGAGGGTCA GGTGACCATC
GAAGGACCGC GCCCGCAATC CGAGGTCATC GCCGCCATGC GCGACGCCGC CGTTCTGGTC
TGCCCCTGTA TCGTGGCCCG CGACGGCAAC CGTGACGGGT TGCCCACCGT GTTGCTGGAG
GCGATGGCGC TTGGAACGCC CTGCATCGGG ACGGATGTGG TCGGCCTGCC GGAAATCCTG
CGCCCGGGGG ACACCGGGCT GCTGGCCAGC GAGGGCGACC CCGACACCTT GTCCGCCGCG
ATTTCGCAGA TGCTTGGCGA CATCGACCTG CGCCGGCGCG TGTCGCGCAA TGCGCGCCGG
TTGATCGAGG AAGAGTTCGA CATCGACCGC AACGCGGCCC GGTTGCGCGA GCTCTTTGCG
TCCTGCTCGG GCCCTGTGCC CGCCGGCCTG AAGGGAGCAG CCTGA
 
Protein sequence
MIHIRPPKQG KIGYICKRYP RFSETFIVHE ILAHERAGQQ VEIFALRPVM DTHFQDILSK 
VRAPVHRIPE KTRSVSVFRD LLEKAEALYP GAPQRALATG AATDAIAQGL ALAIDAKRLG
VTHFHAHFGT VATTVARVAS QVSGIPYTFT AHAKDIYYRY DPPIELDVKL RDAAAAVTVS
DFNLAYMTET FGKDAAGLVR LYNGLDLSGF AWSEPTARQT DILAVGRLIE KKGFHILVEA
LWQLARKGQT PRCRIIGMGE DEDNLRSQIA AAGLEGQVTI EGPRPQSEVI AAMRDAAVLV
CPCIVARDGN RDGLPTVLLE AMALGTPCIG TDVVGLPEIL RPGDTGLLAS EGDPDTLSAA
ISQMLGDIDL RRRVSRNARR LIEEEFDIDR NAARLRELFA SCSGPVPAGL KGAA