Gene Dshi_1245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1245 
SymbolmelA 
ID5711803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1292045 
End bp1293385 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content66% 
IMG OID641267157 
Productglycoside hydrolase family 4 
Protein accessionYP_001532588 
Protein GI159043794 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.778162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGA TCGCGTTCAT CGGCGCAGGA TCGACCATTT TCATGAAGAA CATCCTCGGC 
GATGCCCTGC ATTTCGAGGC CCTGCGCGAC GGCCATTTCG CCCTGATGGA TATCGACCCC
GACCGGCTGG CCGAAAGCGC CGCCGTCGCC CGCGCCATGA TCGCCACCAT GGGCACCGGG
GCCACGATCA GCACCCATGG AGAGCGGCGC GCTGCGCTAG AGGGGGCGGA TTTCGTGGTC
ACCGCCTTCC AGATCGGCGG CTACCAGCCC TGCACCGTGA CGGATTTCGA GATCCCCAAG
GTCTACGGCT TGCGCCAGAC CATCGGCGAC ACGCTCGGCG TTGGCGGCAT CATGCGCGGC
CTGCGCACGG TCCCGCATCT CTGGGCCGTG GCCGAGGATA TGGCGCAGCT CTGCCCGGAC
GCGACGCTGC TGCAATACGT CAACCCCATG GCGATCAACA CCTGGGCGCT GGCCGAACGG
TTCCCGACCC TGCGCCAGGT CGGCCTGTGC CACTCGGTGC AAAACACCGT GCAGGAACTG
GCCCATGACC TCGACCTGCC GCCACATGAG ATCCGCTACC GGGTCGCGGG GGTCAACCAC
GTGGCCTTTT TCCTCGACCT GACCCACCGG GGCCGCGACC TCTATCCGGC GCTGCGGGTG
GGATACGCAG AGGGGCGCCT GCCCAAGCCG CCGCTGCTGA TGCCGCGCTG CGCCAACAAG
GTGCGGTACG AGGTGATGAA TCACCTCGGC TATTTCTGCA CCGAAAGCTC CGAGCATCTG
GCCGAATACG TCCCCTGGTT CATCAAGAAC GGGCGCATGG ATCTGATCGA AACCTACGCC
ATCCCGCTCG ACGAATACCC CACCCGCTGC CTCGAGCAGA TCGCAGACTG GCGCGCCCAG
GCCGAGGCGC TGACCAATGC CGCCCGGATC GACGTGCCCA AGAGCCACGA GTTTGCCGCC
GAGATCATGA ACGCCGTGGT CACCAATACG CCCTACCGGA TCTACGGCAA CTTGGCGAAT
ACCGGCCAGA CCCCGCAACT GCCCCCGGGG GCCGCGGTGG AAACCCCCTG CCTCGTGGAT
GCGAACGGCG TGCAGCCCAC CACCGTCGCC GACATCCCGC CGCAACTGGT CGCACTCATG
CGCAGCCAGA TCAACGTGCA GGAACTTGTG GTCCGCGCGC TGATCGACGA GAATCCAGCG
CATCTCTATC ACGCCGCGAT GATGGACCCC CACACGGCCG CCGAGCTTGA CCTGCGCCAG
ATCCGCAGCC TGGTCACCGA CCTGCTCAAC GCCCATGGCG ACTGGATCCC GGCCTGGGCC
CGCCCCGCCA AGGCCGCCTG A
 
Protein sequence
MTRIAFIGAG STIFMKNILG DALHFEALRD GHFALMDIDP DRLAESAAVA RAMIATMGTG 
ATISTHGERR AALEGADFVV TAFQIGGYQP CTVTDFEIPK VYGLRQTIGD TLGVGGIMRG
LRTVPHLWAV AEDMAQLCPD ATLLQYVNPM AINTWALAER FPTLRQVGLC HSVQNTVQEL
AHDLDLPPHE IRYRVAGVNH VAFFLDLTHR GRDLYPALRV GYAEGRLPKP PLLMPRCANK
VRYEVMNHLG YFCTESSEHL AEYVPWFIKN GRMDLIETYA IPLDEYPTRC LEQIADWRAQ
AEALTNAARI DVPKSHEFAA EIMNAVVTNT PYRIYGNLAN TGQTPQLPPG AAVETPCLVD
ANGVQPTTVA DIPPQLVALM RSQINVQELV VRALIDENPA HLYHAAMMDP HTAAELDLRQ
IRSLVTDLLN AHGDWIPAWA RPAKAA