Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1245 |
Symbol | melA |
ID | 5711803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1292045 |
End bp | 1293385 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641267157 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_001532588 |
Protein GI | 159043794 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.778162 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGA TCGCGTTCAT CGGCGCAGGA TCGACCATTT TCATGAAGAA CATCCTCGGC GATGCCCTGC ATTTCGAGGC CCTGCGCGAC GGCCATTTCG CCCTGATGGA TATCGACCCC GACCGGCTGG CCGAAAGCGC CGCCGTCGCC CGCGCCATGA TCGCCACCAT GGGCACCGGG GCCACGATCA GCACCCATGG AGAGCGGCGC GCTGCGCTAG AGGGGGCGGA TTTCGTGGTC ACCGCCTTCC AGATCGGCGG CTACCAGCCC TGCACCGTGA CGGATTTCGA GATCCCCAAG GTCTACGGCT TGCGCCAGAC CATCGGCGAC ACGCTCGGCG TTGGCGGCAT CATGCGCGGC CTGCGCACGG TCCCGCATCT CTGGGCCGTG GCCGAGGATA TGGCGCAGCT CTGCCCGGAC GCGACGCTGC TGCAATACGT CAACCCCATG GCGATCAACA CCTGGGCGCT GGCCGAACGG TTCCCGACCC TGCGCCAGGT CGGCCTGTGC CACTCGGTGC AAAACACCGT GCAGGAACTG GCCCATGACC TCGACCTGCC GCCACATGAG ATCCGCTACC GGGTCGCGGG GGTCAACCAC GTGGCCTTTT TCCTCGACCT GACCCACCGG GGCCGCGACC TCTATCCGGC GCTGCGGGTG GGATACGCAG AGGGGCGCCT GCCCAAGCCG CCGCTGCTGA TGCCGCGCTG CGCCAACAAG GTGCGGTACG AGGTGATGAA TCACCTCGGC TATTTCTGCA CCGAAAGCTC CGAGCATCTG GCCGAATACG TCCCCTGGTT CATCAAGAAC GGGCGCATGG ATCTGATCGA AACCTACGCC ATCCCGCTCG ACGAATACCC CACCCGCTGC CTCGAGCAGA TCGCAGACTG GCGCGCCCAG GCCGAGGCGC TGACCAATGC CGCCCGGATC GACGTGCCCA AGAGCCACGA GTTTGCCGCC GAGATCATGA ACGCCGTGGT CACCAATACG CCCTACCGGA TCTACGGCAA CTTGGCGAAT ACCGGCCAGA CCCCGCAACT GCCCCCGGGG GCCGCGGTGG AAACCCCCTG CCTCGTGGAT GCGAACGGCG TGCAGCCCAC CACCGTCGCC GACATCCCGC CGCAACTGGT CGCACTCATG CGCAGCCAGA TCAACGTGCA GGAACTTGTG GTCCGCGCGC TGATCGACGA GAATCCAGCG CATCTCTATC ACGCCGCGAT GATGGACCCC CACACGGCCG CCGAGCTTGA CCTGCGCCAG ATCCGCAGCC TGGTCACCGA CCTGCTCAAC GCCCATGGCG ACTGGATCCC GGCCTGGGCC CGCCCCGCCA AGGCCGCCTG A
|
Protein sequence | MTRIAFIGAG STIFMKNILG DALHFEALRD GHFALMDIDP DRLAESAAVA RAMIATMGTG ATISTHGERR AALEGADFVV TAFQIGGYQP CTVTDFEIPK VYGLRQTIGD TLGVGGIMRG LRTVPHLWAV AEDMAQLCPD ATLLQYVNPM AINTWALAER FPTLRQVGLC HSVQNTVQEL AHDLDLPPHE IRYRVAGVNH VAFFLDLTHR GRDLYPALRV GYAEGRLPKP PLLMPRCANK VRYEVMNHLG YFCTESSEHL AEYVPWFIKN GRMDLIETYA IPLDEYPTRC LEQIADWRAQ AEALTNAARI DVPKSHEFAA EIMNAVVTNT PYRIYGNLAN TGQTPQLPPG AAVETPCLVD ANGVQPTTVA DIPPQLVALM RSQINVQELV VRALIDENPA HLYHAAMMDP HTAAELDLRQ IRSLVTDLLN AHGDWIPAWA RPAKAA
|
| |