Gene Dshi_3007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3007 
SymbolybgG 
ID5710859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3172670 
End bp3175837 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content67% 
IMG OID641268934 
Productalpha-mannosidase 
Protein accessionYP_001534341 
Protein GI159045547 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.799127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCACC AGATCTTCTG GACCGAGCAG AAAATCCTCA GCCGCCTGCC GCTGATCGCG 
CCGCTGGTCC ACCGCCGTGC TGCCCCGATC CCGCCCTTCC GGTACCTGCC GCTGGACAGC
CCGCGGGCCG AGGCCCCGGT GGGCCGCGAT GTGGATACCG CCGACTGGGC CGAGATCCCG
TTCGACAGCT ATTGGGGCGA CTGGAGCCAG GATTTCGTCC TCCGCTCCGA GTTCGCGGTG
CCCGAAGGCT TCGGCATCCA CGGCCCCGTC GCCCTGCACC TGCCGCTCGG GGTCGCGGGC
GACATCTTCT GCCACCCCGA GGCGCTGGCC TATATCGACG GGACCGGCTT TGCCTCCGCC
GACCGTCAGC ACCACGAGAT CTACCTCGAC CCCGACCTCT GCGACGGCGC GCGCCACAGC
CTCGCGCTCC ATGGCTGGAC CGGGCTCAGC GGCTGGCCGC CCGACCGCAA CGCCAGGACC
AAGCTCTTCA TGAAGCCCTG CGCCGTGGTC GAGATCGACA CGCCCACACG CGAGCTGGTG
GCGCTCGTGA GCCGTGCGCT GGACGTGGCC AGCCACATGG AGGACGGCAA CACCGCCCGG
AGCCGGATCC TCCGCGCGCT CGATCTGTGC TTCAAGGCGC TCGACACCCG CGACCCCATC
CGCTCCGACG CCTTCTACGA CAGCGTCCCC ACGGCCCTGG CCACGCTGCG CGAAGCGCTG
GAAGGCTACA AGGGCCGGAT GGAGGTCGAC ATCCTCGCCA TCGGCCACGC CCATATCGAC
ATCGCCTACC TGTGGCAGGT CGACCAGACC CGGCGCAAAT GCGGGCGCAG CTTCTCCAAC
GTGCTCAAGC TGATGGAGGA ATTTCCCGAC TACCATTTCA GCGCCTCGCA ACCGGTCCTC
TACGAGATGA CCGAGGCCGA CTACCCGCAG GTCTTCGACG GCATCCGCGC CCGCGTTGCC
GAAGGCCGGT GGGAGCCGAT GGGCGGCATG TGGGTCGAGC CGGATTGCAA CCTCGCGGGC
GGCGAGAGCC TCGTGCGGCA GCTGATGTAC GGGCGGCGCT ATTTCATCGA CAAGTTCGGC
GAAACCGCCG AGACCCCCGT GCTCTGGCTG CCCGACACGT TCGGCTTCAC CGCCGCCCTG
CCGCAGCTGA TGCAACAGGC CGGGCTGAAG TGGTTCGTCG CCAACAAGCT GAACTGGAAC
CAGTACAACC AGATGCCCAA CCAGTTGTTC TGGTGGGAAG GCATCGACGG CTCGCGCGTG
CTCAGCCACT TCCTTACGAC CCCCTCCACG GTGCAATACC TGCCCCATCC CACCACCTAC
AAGGCCGAGA TGACCGCGCG CGAGGTCTTC GGCACCTGGG ACAATTTCCG CCAGAAACAC
CTGCACCAGG AATTGATCAC CTGTTTCGGC TACGGCGATG GCGGTGGCGG GCCGACCCGC
GAGTTGATCG AGGCGGCCCG CGCCTATGCC GAATTGCCCG GCACGCCCTC GGTGCGCATG
GGCACCGTGC GGGAGTTTTT CGAGCGCGTC GAGGAAAACG CCGCCCATGA CCTGCCCACC
TGGTCCGGCG AGTTCTACCT GGAACTGCAC CGCGGCACCT TGACCAGCCA GGCCCGGATC
AAGCGCGCCA ATCGCAAGGC GGAGGTGCTG CTGCACGACG CCGAGTTCCT CGCGAGCCTC
GCGGGGGTCA TCACCGACCA CCCCTATCCG GGCGACGCGC TCGAAGAGGC GTGGAAACTC
GTCCTTCTGA ACCAGTTCCA CGACATCTTG CCCGGCACCT CGATCACCCA GGTGTTCGAG
GATGCGACCC GCGACTACGC CAAAGTGACC GAGATCGGCA CCGCCGCGCG CGACACGGCC
CTCGCCGCCC TCACCACCCG GATGCCGCCC GAGGCGCAGG TGATGGCCGT GAACCCCACC
GGGTTTTCCG TGGACCGGAT CGGCTGGATC GCCGAACCGG TGGCGGGGCT GTGCGACCTG
CGCACCAACC GCCCCCTGCG CACCCAACCC GTGGCCGATG GCACCCTGAT CGACCTGCCC
GCCCTGCCGG GCTATGCCGC GCTCGGCCTC GGACCCTGCG CCGAGGCGCC CGAGCCCACG
TCGCTGCGCA TCACCGACTT CCCAGATGGC GCCGTTCTGG AAAACGACCT GATCCGGGTC
CATGTCGCGC CCAACGGGCA GCTCGTGTCG GTCTTCGACA AGACCGCCCT GCGCGAGGTG
CTCGCCGCAG ACGCCTTCGG CAACCAGCTC CTTGCCTTCG AGGACCGACC CATGGTCTGG
GATGCCTGGG ATATCGACAT CTTCTACGAA GACCGCTGCG AGGTGATCGA CGCGCCCGTG
CGGTTCGAGA TCACCGAACG CGGTCCCCTG CGCGCATCAC TGGAGGTCGA ACACCATTGG
CGCGGCTCCA CCATCACCCA GCGCATCAGC CTGCATCACA ACTCCAAACG GATCGAGTTC
AAGACCGACG TGGACTGGCA AACCTCCCAT ATCCTGCTCA AATGCGCCTT CCCGGTGGAG
GTCTTCTCCC CCCGTGCCAC CTACGACATC CAGTTCGGAA ATACCGAACG CACCACCCAC
CGCAACACCA GCTGGGACTG GGCCCGGTTC GAAAGCGTCG GGCACAAATG GGCGGACCTG
AGCGAAGGCA ACTATGGCGT CGCCCTGCTC AATGATTGCA AATACGGCTA CGACATCCAC
CACAACGTGA TGCGCCTGAG CCTGCTGAAA TCCGCCACCA TGCCCGACCC GGTGCAGGAC
CGCGGCCGCC ACGAGATGAC CTATGCGCTC CTGCCGCATG AAGGCAACTG GCGCTCCGAC
GTGACCGAGA GCGGTTATAT CCTCAACAAC CCGGTGCTTT GCCGACCCGT CGCGGCCGGG
CAGGGCGACG CAGCCCTGGT GCAGATGGTC GGCGTCTCGG GCTTTTCGAC CGTGATCGAC
ACGGTCAAAC GGGCCGAGGA CGGGCAGGGC TATATCGTGC GTCTTTTCGA GAACGAGCGC
ACGCGCGGCC CCGTCACGCT ACGTTTCGGG TTCGACCTGG GCGAGGTGCG CGCCGTCACC
ATCCTGGAGG ACGACATCGG CCCGGTGGCC ATGCGCGGCC GGGAGGTCAC GGTGGAACTG
ACGCCCTACA AGATCGTGTC GCTCCGTGTG ATCCCCGCCG GGGCCTGA
 
Protein sequence
MKHQIFWTEQ KILSRLPLIA PLVHRRAAPI PPFRYLPLDS PRAEAPVGRD VDTADWAEIP 
FDSYWGDWSQ DFVLRSEFAV PEGFGIHGPV ALHLPLGVAG DIFCHPEALA YIDGTGFASA
DRQHHEIYLD PDLCDGARHS LALHGWTGLS GWPPDRNART KLFMKPCAVV EIDTPTRELV
ALVSRALDVA SHMEDGNTAR SRILRALDLC FKALDTRDPI RSDAFYDSVP TALATLREAL
EGYKGRMEVD ILAIGHAHID IAYLWQVDQT RRKCGRSFSN VLKLMEEFPD YHFSASQPVL
YEMTEADYPQ VFDGIRARVA EGRWEPMGGM WVEPDCNLAG GESLVRQLMY GRRYFIDKFG
ETAETPVLWL PDTFGFTAAL PQLMQQAGLK WFVANKLNWN QYNQMPNQLF WWEGIDGSRV
LSHFLTTPST VQYLPHPTTY KAEMTAREVF GTWDNFRQKH LHQELITCFG YGDGGGGPTR
ELIEAARAYA ELPGTPSVRM GTVREFFERV EENAAHDLPT WSGEFYLELH RGTLTSQARI
KRANRKAEVL LHDAEFLASL AGVITDHPYP GDALEEAWKL VLLNQFHDIL PGTSITQVFE
DATRDYAKVT EIGTAARDTA LAALTTRMPP EAQVMAVNPT GFSVDRIGWI AEPVAGLCDL
RTNRPLRTQP VADGTLIDLP ALPGYAALGL GPCAEAPEPT SLRITDFPDG AVLENDLIRV
HVAPNGQLVS VFDKTALREV LAADAFGNQL LAFEDRPMVW DAWDIDIFYE DRCEVIDAPV
RFEITERGPL RASLEVEHHW RGSTITQRIS LHHNSKRIEF KTDVDWQTSH ILLKCAFPVE
VFSPRATYDI QFGNTERTTH RNTSWDWARF ESVGHKWADL SEGNYGVALL NDCKYGYDIH
HNVMRLSLLK SATMPDPVQD RGRHEMTYAL LPHEGNWRSD VTESGYILNN PVLCRPVAAG
QGDAALVQMV GVSGFSTVID TVKRAEDGQG YIVRLFENER TRGPVTLRFG FDLGEVRAVT
ILEDDIGPVA MRGREVTVEL TPYKIVSLRV IPAGA