Gene Dshi_3817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3817 
Symbol 
ID5714346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp25016 
End bp26218 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content72% 
IMG OID641276732 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_001542028 
Protein GI159046357 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.342628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.265145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGCAG TGATTTGCGA TGGGGTCCGA ACCCCGATCG GGCGCTATGG CGGGGCGCTG 
TCCTCGGTGC GGGCCGATGA CCTCGCCGCC CTGCCCATCG CAGCCCTGAT GGCGCGCAAT
CCGGGTGTGG ACTGGGCCCG GGTCGACGAG GTGATCTATG GCGCCGCCAA CCAGGCCGGG
GAAGACAACC GCAACGTCGC GCGCATGGCC GCCCTGCTCG CCGGTCTGCC CGAGGAGGTG
CCCGGCCTCA CGGTGAACCG GCTCTGTGCC AGTGGCATGG ACGCGGTCGG CGCTGCCGCG
CGCGGGATCA AGGCCGGGGA ATATGACCTG GCCATCGCCG GCGGGATCGA GAGCATGAGC
CGCGCGCCCT TCGTCATGCC CAAGGCCGAG AGCGCGTTTA CCCGCGCCGC CACGGTCCAC
GACACCACCA TCGGCTGGCG CTTCGTCAAC CCGAAGATTG CGGCAATGCA TGGCATCGAT
ACGATGCCGC AAACCGCCGA CACCGTCGCC GCCGCCTACG AGATCAGCCG CGCCGACCAG
GACGCCTTCG CCGCGCGGTC CCAGGCCCGC TGGGCCGCCG CCGACGCAGC CGGGCTCTTT
GCCGACGAGA TCGTGCCGGT CCCGGTGCCC CAGCGCGGGA GTGCCCCGAT CCTCGTGGAC
CGGGACGAAC ACCCCCGCCC GGGCACCGAT GCCGCCCGGC TGGCCGGGCT GAAGGGCATC
AACGGGCCCG GTCTGTCGGT CACGGCGGGC AATGCCAGCG GCGTGAACGA CGGCGCCGCG
GCGCTGCTGA TCGCGTCGGC CGCCGCGGCG CGGGCCCATG GGCTGACCCC GATGGCGCGG
GTGGTCGGCA TGGCCTCCGC CGGGGTGGCG CCGCGTGTCA TGGGAATTGG CCCCGTGCCC
GCCAGCCGCA AGCTGCTGGA CCGCGCGGGC CTGACCCTCG ACCAGATGGA CGTGATCGAG
CTGAACGAGG CCTTCGCGAG CCAGAGCCTC GCGACACTGC GCCAGCTTGG CCTGGCCGAT
GACGATGTCA GGGTGAACCC CAATGGCGGC GCCATCGCCA TGGGCCATCC GCTGGGCATG
TCCGGCGCGC GGCTGGTGCT GACGGCGGCG CATCAGCTCA GGCGCACGGG CGGGCGCTAT
GCGCTCTGCA CCATGTGCGT CGGCGTGGGC CAGGGCACGG CCCTGATACT CGAACGCGTC
TGA
 
Protein sequence
MDAVICDGVR TPIGRYGGAL SSVRADDLAA LPIAALMARN PGVDWARVDE VIYGAANQAG 
EDNRNVARMA ALLAGLPEEV PGLTVNRLCA SGMDAVGAAA RGIKAGEYDL AIAGGIESMS
RAPFVMPKAE SAFTRAATVH DTTIGWRFVN PKIAAMHGID TMPQTADTVA AAYEISRADQ
DAFAARSQAR WAAADAAGLF ADEIVPVPVP QRGSAPILVD RDEHPRPGTD AARLAGLKGI
NGPGLSVTAG NASGVNDGAA ALLIASAAAA RAHGLTPMAR VVGMASAGVA PRVMGIGPVP
ASRKLLDRAG LTLDQMDVIE LNEAFASQSL ATLRQLGLAD DDVRVNPNGG AIAMGHPLGM
SGARLVLTAA HQLRRTGGRY ALCTMCVGVG QGTALILERV