Gene Dshi_3045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3045 
Symbol 
ID5710897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3213310 
End bp3214326 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content69% 
IMG OID641268972 
Productputative glycosyl transferase 
Protein accessionYP_001534379 
Protein GI159045585 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.109272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.871174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGT CACCCTCCGC CCCCCTGCCC GCCGAGATCC GCGCCACGCG GCCCCTGACC 
GTGGTGGTCG CCGCCTGCAC AAGGCGCAGG CCGAAGATGC TGGAGCGGTT GCTTGGCTCC
TACGCCGCGC TGGAGGTGCC GGAGAACGTC ACGCCGATCT TCCTCGTGGT CGAGAATGAC
GAGACCGCCC GGAGTACCGA GGTGATCGCG GCCTTCGAGG ACAAGCTGCC CGGGCCTCTG
CACGCCGTTC TGGAAACCGT GCCGGGTATC CCCATGGCAC GCAATCGCGG GCTGGTGGAG
GCCGCCGCCC TCGGCGCGGA CCTGGTGCTC TATGTCGATG ATGACGAGAC CGTGGCGCCC
GACTGGCTGA CCGAGATCGT GGCCGCCTGG CGCGGCGGCA CGGCCGAGCT GATCGGCGGC
CCCGTGCGGC TGACCGAGCC GCAGGCACCT CTCAGCGGAC CCCAGAAAAC CGTCTTCGAT
GGCATGGTCA AACGCTTCGC CACCAAGGAG GCCCGCGCGG TCGACCGCAT GAAGGCCGGG
CAGGCCGACC GGGTGACCGT GGTCACGAAC AACTGGCTTT GCGACATGCG GCTGGTGCGC
GACCTCGGCC TGCGCTTCGA CGAGGCGCTG CAATTCACCG GCGGGTCCGA CACCAAGTTC
TTCCGCGACG CCCGCGCCAA GGGGGTCGAG ACCGGCTGGG CCCCTGCCGC CATCGTTTAT
GAAACCGTCC CGCCCGAACG GCTCACCTTG CCCTATCAGT ACACGCGCGG CCGGGACCAG
TCGGCCACCT CCTTCGGCCA GAAAGTTGCC GAAGGCAAAT GGGCCAGTGC GGCCACCAGC
ATCCTGATTT TGCTGCCGCT CAAGGCGCTG TCGCTGGTCC TGATCGCCCT GTCGCTGCCG
GTGACGCGCA GCTACGGGCT GGTATCGCTC TTCCGGCAGG CAGGCTGGAT CGCCGGACGC
CTGACCCGGC TCTTCGGGCG CGCCTCGAAG CTCTACGTCA AGACGACGGG AAACTGA
 
Protein sequence
MSQSPSAPLP AEIRATRPLT VVVAACTRRR PKMLERLLGS YAALEVPENV TPIFLVVEND 
ETARSTEVIA AFEDKLPGPL HAVLETVPGI PMARNRGLVE AAALGADLVL YVDDDETVAP
DWLTEIVAAW RGGTAELIGG PVRLTEPQAP LSGPQKTVFD GMVKRFATKE ARAVDRMKAG
QADRVTVVTN NWLCDMRLVR DLGLRFDEAL QFTGGSDTKF FRDARAKGVE TGWAPAAIVY
ETVPPERLTL PYQYTRGRDQ SATSFGQKVA EGKWASAATS ILILLPLKAL SLVLIALSLP
VTRSYGLVSL FRQAGWIAGR LTRLFGRASK LYVKTTGN