Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1541 |
Symbol | |
ID | 5713198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1602173 |
End bp | 1604119 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267456 |
Product | putative amidohydrolase 3 |
Protein accession | YP_001532884 |
Protein GI | 159044090 |
COG category | [R] General function prediction only |
COG ID | [COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.260198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.229672 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGCT TTCTGCCCCT GATCGCCGCC CTTCTCGCGG CCCTCGCCGC CCCCCTCGCA CTGGCCGACA CCGACACCGA CACCGCCTTC GCCGACCGGA TCTGGACCGG CGGGCCGATC CTGACCATGG AAGACAACGC CATGCGCGCC GAGGCCCTGG CCGAAAAGGA CGGCGTGATC CTCGGCGTCG GACCTCTCGA CGAGGTCACC GCCTTCCAGG GACCCCAGAC GCAGATGATC GACCTGGCCG GGCGCACCAT GATCCCGGGC TTCGTGGACG CCCACGGCCA TGTCTTCATG ATCGGCCTGC AGGCGCTCTC GGCCAACCTG TTGCCCGCAC CCGATGGCAC CGTGAACGAC ATCCCGACCC TGCAAGCGGT GCTGCGCGCC TTCGCCGAGG CCCAGCCCGA GCGTGTCGCG GCCGCCGGTC TCATCCTCGG CTTCGGCTAT GACGACGCCC AGCTGGCCGA GCAGCGGCAT CCGACACGGG ACGAGCTCGA CGCCGTCTCC ACCGAGATCC CGGTCTATGC CATCCACCAA TCCGGCCATC TCGGCGTGGC CAACTCACTG GCACTCGAAC AGGCTGGCAT CACCGCCGAC ACCCCCGACC CGGCGGGGGG CGTCATCCGC CGCGGCCCGG ATGGCGCACC CAACGGCGTG CTCGAGGAAA ACGCCGCCAA CATGGTGATC GGCGGTCTGC TCGGCGGGCT GGACGAAGCC GCCAACCGCG CGATCTTTCG CGCGGGCACC GAGTTGATCG CCTCTTTCGG CTACACCACC GCGCAGGAAG GCCGCGCGCT CCCGCCCGTG GCCGAGCTGA TGCAATCCGT CGCCGCCGAA GAAGGGCTGG ACATCGACGT GGTGGTCTAC CCCGATGTGC TGCTGGCCCG CGACTACATC CTGGAACACC ACGCGCGGGT CTACGAGAAC CGGATCCGCG TCGGCGGCGG CAAGCTGACC ATCGACGGAT CGCCCCAGGG CTTCACCGCC CTGCGCGACC GGCCCTATTA CGACCCGCCC GAAGGGGTTC GCGCCGATTA CGCGGGCTAT GCCTCGGCCA GCGGCGACCA GGTGTTCGAC GCGGTCGACT GGGCCTTCGC CAACGGGGTG CAGATCCAGA CCCATGCCAA TGGGGAAGGG GCCTCCGACA TGCTGATCGC GGCCATTCAG ACCGCGACAG AGACTCACGG GGCCGCCGAC CGCCGCCCGG TCCTGATCCA CGGGCAGTTT TTGCGCGAGG ACCAGGTGGA CGCCTATAAC CGGCTCGATG TGTTCCCCTC GCTCTTTCCG ATGCACACCT TCTATTGGGG CGACTGGCAC CGTGACCGCA CCGTGGGGCC CGTGGCCGCC GACAACATCT CGCCCACGGG TTGGGTACGG GCGCGTGGCA TGATGTTCTC CAGCCACCAT GACGCGCCCG TGGCCTTCCC CGATAGCATG CGCATCCTGG ACGCCACCGT CACCCGGCGC TCGCGGTCCG GCGACATCAT CGGCCCCGAC CACCGGGTCG ACGTGCTCAC CGCGCTCAAG GCGATGACGA TCTGGCCCGC CTACCAGCAT TTCGAGGAAG ACAGCAAAGG CTCCCTCGCG CCGGGCAAAC TGGCCGATCT CGTCATATTG TCAGACGATC CGACCGTGAT CGATCCCGAA CAACTCGACA CGATCACCGT GGTCGAGACC ATCAAGGAGG GACAGACGAT CTACGCCGCG GGCTTGCGCG AGGGGCGGCT GCATTACCGC CCGCGCAGGG ACGGCACCGA CCCCTATGCC GGGTTCCTGC GCACCGTGGC CATCGCCCAC GAGATGCAAG GCAATCCCGG CATGCTGAGC CGCATCCGCC CCTCGACCCT GGCACGCGCG CCCCATTCCA GCGCCTGCGT CGCCCGCACC CTGAGCGACC TGATCACCGC CAACCTGTCC CTGCCGGACA CGCCTTTGCT TCCCTGA
|
Protein sequence | MPRFLPLIAA LLAALAAPLA LADTDTDTAF ADRIWTGGPI LTMEDNAMRA EALAEKDGVI LGVGPLDEVT AFQGPQTQMI DLAGRTMIPG FVDAHGHVFM IGLQALSANL LPAPDGTVND IPTLQAVLRA FAEAQPERVA AAGLILGFGY DDAQLAEQRH PTRDELDAVS TEIPVYAIHQ SGHLGVANSL ALEQAGITAD TPDPAGGVIR RGPDGAPNGV LEENAANMVI GGLLGGLDEA ANRAIFRAGT ELIASFGYTT AQEGRALPPV AELMQSVAAE EGLDIDVVVY PDVLLARDYI LEHHARVYEN RIRVGGGKLT IDGSPQGFTA LRDRPYYDPP EGVRADYAGY ASASGDQVFD AVDWAFANGV QIQTHANGEG ASDMLIAAIQ TATETHGAAD RRPVLIHGQF LREDQVDAYN RLDVFPSLFP MHTFYWGDWH RDRTVGPVAA DNISPTGWVR ARGMMFSSHH DAPVAFPDSM RILDATVTRR SRSGDIIGPD HRVDVLTALK AMTIWPAYQH FEEDSKGSLA PGKLADLVIL SDDPTVIDPE QLDTITVVET IKEGQTIYAA GLREGRLHYR PRRDGTDPYA GFLRTVAIAH EMQGNPGMLS RIRPSTLARA PHSSACVART LSDLITANLS LPDTPLLP
|
| |