Gene Daro_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2059 
Symbol 
ID3570181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2216451 
End bp2217482 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content63% 
IMG OID637680533 
Productphospholipase/carboxylesterase 
Protein accessionYP_285273 
Protein GI71907686 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3509] Poly(3-hydroxybutyrate) depolymerase 
TIGRFAM ID[TIGR01840] esterase, PHB depolymerase family 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value0.550035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGGC GAATACGAAA AACGGTCTGG AGCCGATCAT TCCAGAGTGC ACTGAAGGCA 
ATGACGCGAA CGGCAATGCG TGTCGGCACC AAAGCAATCA AGCAAAGTTT GCGGGCCGCG
CCGCTGACGA GCCAACGCAA ACCCGTCAAA AAGGCGACGG CGCACTCGGC CAACTGGACG
AAGGGGATCG CCATCGGCGC GGCGGGGCCT CGTGGCTATC GACTCTATAA GCCATCCGGC
ATGCGGCGCA ATGAGACGCG GCCCTTGCTC GTCATGCTGC ACGGCTGCGG GCAGGATGCC
GAAGCGCTGG CTGCCAGTAG CCGGATGAAC ACGATTGCAG CCCGTGAGCG ATTTTTTGTG
CTCTACCCCG AGCAGGATCG CCTGTCGAAC GTGCAAGGTT GCTGGAACTG GTACGACACC
CGGACAGGCA GGGCGCAGGC CGAAGCCAAT TCGATCAGTG CGGCCATCGA GCAGATTTGC
CTGTCGCAAG CGGTAGACCG CAGTAAAGTT GCCCTGGCCG GAATTTCAGC CGGGGCGGGG
ATGGCGGTGT TGCTGGCGAC GCATCATCCG GAACGTTTTC GGGCCATTGC CATGCATTCG
GGAATCGCAC CGGGGGTTGC GCATTCGTCG GCCAGCGCCA TCAAGGCCAT GTTCGGGCAA
AGCGTGACAA GCTCGCCGCT CCCGGCCATT CCGCCCGACG TGAGGCTGCC GGCCTTATTG
GTCATCCACG GTGCAGCCGA CCACGTGGTG GCGCCGGGGA ATGGTGCCGA AGCGGCCATG
CGGTGGGGGG AACGGGTTGG CGCCAAGACG AGCAAGCCGC GCCTGGTGCA GCGCGGTGCG
CGTTATGCGG CGACCATTAC CGATTATCGA AAAAGCGGCC GCTTGGTGGC GACGCTCTGT
GCAGTCGACC GGCTTGGGCA CGCCTGGAGC GGTGGCGCCG CAGGGCATTC TTACAGCGAT
CCCAAAGGCC CGGACGCATC TCGAATGATC TGGTCTTTCG TCGCCAGGCA ATTTGCCCGT
TCGGCAGACT AG
 
Protein sequence
MARRIRKTVW SRSFQSALKA MTRTAMRVGT KAIKQSLRAA PLTSQRKPVK KATAHSANWT 
KGIAIGAAGP RGYRLYKPSG MRRNETRPLL VMLHGCGQDA EALAASSRMN TIAARERFFV
LYPEQDRLSN VQGCWNWYDT RTGRAQAEAN SISAAIEQIC LSQAVDRSKV ALAGISAGAG
MAVLLATHHP ERFRAIAMHS GIAPGVAHSS ASAIKAMFGQ SVTSSPLPAI PPDVRLPALL
VIHGAADHVV APGNGAEAAM RWGERVGAKT SKPRLVQRGA RYAATITDYR KSGRLVATLC
AVDRLGHAWS GGAAGHSYSD PKGPDASRMI WSFVARQFAR SAD