Gene Dshi_1649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1649 
Symbol 
ID5713214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1713282 
End bp1714940 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content61% 
IMG OID641267565 
Productalpha-glucosidase 
Protein accessionYP_001532992 
Protein GI159044198 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.856905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000000195123 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGCAG AGGCGCAAAT GCGCGAGGTC AAGAGCCTCG CTGCCGATCC GGATTGGTGG 
CGTGGGGCGG TGATCTATCA GATCTACCCC CGTAGCTTCC AGGACAGCAA TGGCGACGGG
ATCGGCGATC TTCTGGGGAT CGTGGAGCGG ATGCCCTATA TCGCCTCGCT CGGGGTGGAT
GCGATCTGGA TCAGCCCGTT TTTCACGTCG CCGATGAAGG ATTTCGGCTA CGACATCAGC
GATTATTTCG ACGTTGATCC TATGTTTGGC AGCTTGGCGG ATTTCGATGC CCTGATCGAG
ACCGCGCACA TGTATGGGCT GCGGGTGATG ATCGACCTTG TTCTCAGCCA CACCAGCGAC
CAGCACCCCT GGTTCGAGGA AAGTCGATCC AGTCGCGACA ACCCCAAGGC TGACTGGTAT
GTCTGGGCCG ATGCCAAGCC AGACGGGACG CCGCCGAACA ACTGGTTGTC GATCTTTGGC
GGCTCAGGCT GGCATTGGGA CGCGCGGCGT TGCCAGTACT ACCTGCACAA CTTCCTGACT
TCGCAGCCGG ATCTGAACTT TCACTGCGCC GACGTTCAGG ATGCGCTGCT AGGTGTGGGG
CGGTTCTGGC TGGACCGGGG CGTGGATGGC TTCCGGCTCG ACACCATCAA CTTCTATGTC
CACGACGCAG AGTTGCGCGA CAATCCTCCG TTGCCGCCGG AAGAACGCAA CTCCAACATC
GCGCCAGAAG TGAACCCCTA TAATCACCAG CGGCATCTTT ACTCCAAGAA CCAGCCGGAG
AACCTGGAAT TCCTGGCAAA ATTCCGCGCG ATGATGGAGG AATACCCAGC AATCGCGGCT
GTGGGCGAGG TCGGTGACGC GCAGTACGGG TTGGAGATCC TCGGCCAGTA CACCCGGGGA
GAGACCGGGG TGCATATGTG CTACGCCTTC GAATTTCTTG CGCAGGAGAA GCTGACCGCC
AAGCGGGTGG CGGAGGTTCT CAACAAGGTC GACGAGGTTG CGTCCGACGG TTGGGCGTGC
TGGGCGTTTT CCAACCATGA CGTTATGCGC CATGTCTCCC GCTGGGACCT GACCCCGGGC
GCACAAAGAG GGATGCTGAC CCTCCTGATG TGTCTGCGCG GGTCCGTATG CCTGTATCAA
GGCGAAGAGT TGGGCCTGCC GGAGGCGGAA GTGGCCTTCG ACGACCTGCA GGACCCCTAC
GGCATCGAGT TCTGGCCGGA ATACAAGGGG CGGGACGGCT GCCGGACGCC GATGGTCTGG
CAATCGGACA ACATGTCGGG CGGCTTCTCT ATTCACCGGC CCTGGTTGCC GGTCTCGACC
GAGCATCTCG GCCTAGCGGT CGCGGTCCAG GAAGAAGCCC CGGACGCGCT GTTGCACCAT
TACCGCCGGG CGCTGGCCTT CCGCCGCGCC CATCCGGCAT TGGTGAAAGG CGATATTTCG
GATGTGACCG TTGTCGGCGA CGTCATTAGC TTTCTGCGCA AGGACCCCGA AGAGACGGTA
TTCGTCGCCA TCAACATGAG CGATGCGCCC GGTGCGGTCG ATCTGCCTCC GGGCAACTGG
ATGCAGATCG GGGCGGAATT GAATTCAGGC GGGACAAGCC CGGACGGGCG CGTGCATCTC
GGGCCTTGGC AGCCCTGCAT TGCGCTGAAG GCACCGTGA
 
Protein sequence
MNAEAQMREV KSLAADPDWW RGAVIYQIYP RSFQDSNGDG IGDLLGIVER MPYIASLGVD 
AIWISPFFTS PMKDFGYDIS DYFDVDPMFG SLADFDALIE TAHMYGLRVM IDLVLSHTSD
QHPWFEESRS SRDNPKADWY VWADAKPDGT PPNNWLSIFG GSGWHWDARR CQYYLHNFLT
SQPDLNFHCA DVQDALLGVG RFWLDRGVDG FRLDTINFYV HDAELRDNPP LPPEERNSNI
APEVNPYNHQ RHLYSKNQPE NLEFLAKFRA MMEEYPAIAA VGEVGDAQYG LEILGQYTRG
ETGVHMCYAF EFLAQEKLTA KRVAEVLNKV DEVASDGWAC WAFSNHDVMR HVSRWDLTPG
AQRGMLTLLM CLRGSVCLYQ GEELGLPEAE VAFDDLQDPY GIEFWPEYKG RDGCRTPMVW
QSDNMSGGFS IHRPWLPVST EHLGLAVAVQ EEAPDALLHH YRRALAFRRA HPALVKGDIS
DVTVVGDVIS FLRKDPEETV FVAINMSDAP GAVDLPPGNW MQIGAELNSG GTSPDGRVHL
GPWQPCIALK AP