Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1955 |
Symbol | |
ID | 5712949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2048152 |
End bp | 2049948 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641267880 |
Product | putative short-chain dehydrogenase |
Protein accession | YP_001533297 |
Protein GI | 159044503 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG2910] Putative NADH-flavin reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00955158 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00174251 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATACC TTGTGACGGA TGCGGCGGGC GCGGTCGGGC ATCGCCTCGC CGCGACCCTT GTGGCCGCGG GGCACGAGGT GCTGGCCTTC TCGGACAGCC CCGTCAATGG GGTGGCCAGC CGGGCCTGGC CCACCCGTGC GGACGTGCTG GCCAGGGCGC TGGCGGGGGT GGACGTGATC TGCGCGGTGG ACCTGGCGAC GGTCGCGCCG GACGCGGCCC CGGCGCTGCT CAAACGTCTG GACAAGCTGC GGGTTGCGGC GGGCAAGGCC GGGTGCGGCC GGTTCGTCCT GCTGGGGCGC GCCGATATCT ACGACCCCGC GCTTCTGGCC GACAAGACGC TGCTGGAACG CTCGGCCACC GCCGCGCCCG CCGACCTCGC CCCACCTGCG GCGGCGGCCC TCGCGCTGGA GGAGAGCCTG CGCGCCGGCG CGGGGCTCGA CTGGACAATC CTGCGCGCGC CGATGATCCT CGCCCCCGAC AGCGCCGAGG CCAAGGCCCT CTTCCGCCGC ATCCTGATCG ACGACGCCCC GTCGCCCGCC CGGTTCCACC CGGTCGATGC CCGGGACGTG GCCACGGCCC TGGCCACCCT CGGCACGCAC GACCGCGCAG GGGGGCAGGT GTTCAACATC GCCGCCCCCG CGGCCCTCGC CGCCCGGACC CTGGACCAGG AGCTGGGCCG GCTCGCGCGG CTGCTGAACG ACGAGGCCAC GCCGCAGGAC AAGCTCCGCC CGCCCCACCA GGTCGCGGAC CCGGTGCTGG CCACGGACAA GCTCGCGCGC CTGACCGGCG TCACCCCGAC GCGGCCGATC TGGGTCAGCC TCGCCGAGAC GATGCAGCAG GTGATGAAGG ACCTGCGCGC CGACGGCACC CTCCCGCCGC TGCCCGAGAA GATCAACCCG GTCGTCAAGG CAGTGGAGTT GGGGCAAAAA CTGCTCGCCG GCAAGACCGC GGTGATCACC GGCGTGACCT CCGGCATCGG CCGCGCGACC AGCCTGCTGC TGTCGCGGCT CGGGGCCACG GTGGTGGGCA TCGCCCGCAA TGCCGAGGCC GGGGAAGCCT GGGAAGCGGC CCTGGCCGAA GGCCGCCACA CCGTGCCCGG CCATTTCATC CAGGCCGACC TGATGTCCTT TGCCCGGATC CGCACCCTCG CCGCCGACCT CGCCACGCGC TTTCCGCGGA TCGACATCCT GATCAACAAT GCGGGCGCGA ACTTCCCCAC CCGGCGCCTG ACCGAGGACG GGGTCGAGGC CACCCTGGCG ATCAACCATT TCGCGCCCTT CTTGCTGACC AACCTGCTGG CGGGCCCGCT GAAGGCGGCC CCGGCGGCGC GGATCATCAT GGTGAATTCC GACTGGCACC GCCGCTCCCT GCCGGACATG CACGACCTGC AGATGGAGAG CGGCTATCGC ACGGGCGAGG CCTATTGCCG GGCCAAGTTC ATGAATTTGC AGATCACCTA CGCCATGGCC GCCCTGCTGG ACGGCACCAA CGTGACCGTG AACGCGATCC ATCCCGGGGT GGTCCGCACG GGCATCTCGA CCCGCAACAC CGACGGCGCG CCCCAGGTCC CCGCCCAGGC CCGCGAACGC GCGCAACAGC GGATGATCTC GCCGGAGTTT TCCGCCGTCT ACCTCGCCAG CCTCGCCACC GCACCGGACT ACGAAGGCCA GAGCGGGCTC TACCTCGATA CCGACGAGAT CAAGCAGTCT CACGAGGCCA CCTATGACGA GGATTGGGCC TGGCAGATCT GGGAATGGAG CGCGCAGATC ACCGGGCTGA CCCCGGCCTC CGCCTGA
|
Protein sequence | MKYLVTDAAG AVGHRLAATL VAAGHEVLAF SDSPVNGVAS RAWPTRADVL ARALAGVDVI CAVDLATVAP DAAPALLKRL DKLRVAAGKA GCGRFVLLGR ADIYDPALLA DKTLLERSAT AAPADLAPPA AAALALEESL RAGAGLDWTI LRAPMILAPD SAEAKALFRR ILIDDAPSPA RFHPVDARDV ATALATLGTH DRAGGQVFNI AAPAALAART LDQELGRLAR LLNDEATPQD KLRPPHQVAD PVLATDKLAR LTGVTPTRPI WVSLAETMQQ VMKDLRADGT LPPLPEKINP VVKAVELGQK LLAGKTAVIT GVTSGIGRAT SLLLSRLGAT VVGIARNAEA GEAWEAALAE GRHTVPGHFI QADLMSFARI RTLAADLATR FPRIDILINN AGANFPTRRL TEDGVEATLA INHFAPFLLT NLLAGPLKAA PAARIIMVNS DWHRRSLPDM HDLQMESGYR TGEAYCRAKF MNLQITYAMA ALLDGTNVTV NAIHPGVVRT GISTRNTDGA PQVPAQARER AQQRMISPEF SAVYLASLAT APDYEGQSGL YLDTDEIKQS HEATYDEDWA WQIWEWSAQI TGLTPASA
|
| |