Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3817 |
Symbol | |
ID | 5714346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | + |
Start bp | 25016 |
End bp | 26218 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641276732 |
Product | beta-ketoadipyl CoA thiolase |
Protein accession | YP_001542028 |
Protein GI | 159046357 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases [TIGR02430] beta-ketoadipyl CoA thiolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.342628 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.265145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGCAG TGATTTGCGA TGGGGTCCGA ACCCCGATCG GGCGCTATGG CGGGGCGCTG TCCTCGGTGC GGGCCGATGA CCTCGCCGCC CTGCCCATCG CAGCCCTGAT GGCGCGCAAT CCGGGTGTGG ACTGGGCCCG GGTCGACGAG GTGATCTATG GCGCCGCCAA CCAGGCCGGG GAAGACAACC GCAACGTCGC GCGCATGGCC GCCCTGCTCG CCGGTCTGCC CGAGGAGGTG CCCGGCCTCA CGGTGAACCG GCTCTGTGCC AGTGGCATGG ACGCGGTCGG CGCTGCCGCG CGCGGGATCA AGGCCGGGGA ATATGACCTG GCCATCGCCG GCGGGATCGA GAGCATGAGC CGCGCGCCCT TCGTCATGCC CAAGGCCGAG AGCGCGTTTA CCCGCGCCGC CACGGTCCAC GACACCACCA TCGGCTGGCG CTTCGTCAAC CCGAAGATTG CGGCAATGCA TGGCATCGAT ACGATGCCGC AAACCGCCGA CACCGTCGCC GCCGCCTACG AGATCAGCCG CGCCGACCAG GACGCCTTCG CCGCGCGGTC CCAGGCCCGC TGGGCCGCCG CCGACGCAGC CGGGCTCTTT GCCGACGAGA TCGTGCCGGT CCCGGTGCCC CAGCGCGGGA GTGCCCCGAT CCTCGTGGAC CGGGACGAAC ACCCCCGCCC GGGCACCGAT GCCGCCCGGC TGGCCGGGCT GAAGGGCATC AACGGGCCCG GTCTGTCGGT CACGGCGGGC AATGCCAGCG GCGTGAACGA CGGCGCCGCG GCGCTGCTGA TCGCGTCGGC CGCCGCGGCG CGGGCCCATG GGCTGACCCC GATGGCGCGG GTGGTCGGCA TGGCCTCCGC CGGGGTGGCG CCGCGTGTCA TGGGAATTGG CCCCGTGCCC GCCAGCCGCA AGCTGCTGGA CCGCGCGGGC CTGACCCTCG ACCAGATGGA CGTGATCGAG CTGAACGAGG CCTTCGCGAG CCAGAGCCTC GCGACACTGC GCCAGCTTGG CCTGGCCGAT GACGATGTCA GGGTGAACCC CAATGGCGGC GCCATCGCCA TGGGCCATCC GCTGGGCATG TCCGGCGCGC GGCTGGTGCT GACGGCGGCG CATCAGCTCA GGCGCACGGG CGGGCGCTAT GCGCTCTGCA CCATGTGCGT CGGCGTGGGC CAGGGCACGG CCCTGATACT CGAACGCGTC TGA
|
Protein sequence | MDAVICDGVR TPIGRYGGAL SSVRADDLAA LPIAALMARN PGVDWARVDE VIYGAANQAG EDNRNVARMA ALLAGLPEEV PGLTVNRLCA SGMDAVGAAA RGIKAGEYDL AIAGGIESMS RAPFVMPKAE SAFTRAATVH DTTIGWRFVN PKIAAMHGID TMPQTADTVA AAYEISRADQ DAFAARSQAR WAAADAAGLF ADEIVPVPVP QRGSAPILVD RDEHPRPGTD AARLAGLKGI NGPGLSVTAG NASGVNDGAA ALLIASAAAA RAHGLTPMAR VVGMASAGVA PRVMGIGPVP ASRKLLDRAG LTLDQMDVIE LNEAFASQSL ATLRQLGLAD DDVRVNPNGG AIAMGHPLGM SGARLVLTAA HQLRRTGGRY ALCTMCVGVG QGTALILERV
|
| |