Gene Daro_3128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3128 
Symbol 
ID3568179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3371713 
End bp3373434 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content62% 
IMG OID637681599 
Productsulfate thiol esterase SoxB 
Protein accessionYP_286328 
Protein GI71908741 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.304524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGA ATCGTCGTGA ATTCCTTCAG GTCATGGCCG TTGCCGCAGC CGGCGGCATG 
TCCCTGCACA GCGAGCTGGC CATGGCCGAA AAAGGCGCCG CCAAGCTGTA CGACCTGCCC
AAATTCGGCA ATGTCAGCCT GTTGCATATC ACCGACTGTC ATGCCCAGTT GCTGCCCATC
TACTTCCGCG AACCGAACGT CAATCTCGGT TTCGGCGACC AGTTCGGCAA GGTGCCGCAC
CTGGTGGGTG ACAACCTGCT CAAGCATTTC GGTTTCAAAC CGAACACCAT CGAGGCGCAC
GCCTATACCT ATCTGAATTT CGAGCAGGCC GCCAAGACCT ACGGCAAGGT CGGCGGCTTT
GCCCACCTCG CCACGCTGGT CAAGCGCATG AAAGCCAATC GGCCCGGCGC GCTGCTGCTC
GACGGCGGCG ACACCTGGCA AGGCTCCGGC ACGGCGCTGT GGTCGAACGC GCAGGACATG
GTCGACGCCT GCAAGGCGCT CGGCGTCAAT GTCATGACCC TGCACTGGGA ATCGACCTAC
GGCGAGGCCC GGGTCAAGGA AATCGAGGAA AAGGATTTCG CCGGCCAGAT CGACATCGTC
GCCCAGAACG TCAAGACCAC CGATTTCGGC GATGCCGTCT TCAAGCCCTT CGTGATGAAG
AACATGAACG GCGTGCCGGT CGCCATCATC GGCCAGGCCT TCCCCTACAC GCCGATTGCC
AACCCGCGCT GGCAGACGCC GAACTGGAGC TTCGGCATCC AGGAAGAGAA CATGCAGAAG
ACCGTCGACG AAGCCCGCGC CGCCGGTGCG CAGGTCGTTG TCGTGCTGTC GCACAACGGC
ATGGACGTGG ACCTCAAGAT GGCTTCGCGC ATCAAGGGTA TCGACGCCAT CCTCGGCGGC
CACACCCACG ACGGCATGCC GGCACCGGTC GTTGTCAAGA ATGCCGGTGG CCAGACTCTG
GTCACCAATG CCGGCTCCAA CGGCAAGTAC CTCGGTGTGC TCGATTTCGA CGTCAAGAAT
GGCAAGATCG CCGACTTCCG CTACAAACTG CTGCCGGTCT TCGCCAACCT GCTGCCGGCC
GACAAGGACA TGCAGACGTT GATCGACAAG GCGCGTGCAC CGTACCTGTC CAAGCTCAAC
GAAAAGTTGG CCATCTCCGA AGGCACGCTG TATCGCCGCG GCAATTTCAA CGGCACCTTC
GATCAGGTCA TTCTCGATGC GCTGATGAAG GTCAAGGACG CCGAAATCGC CTTCTCGCCC
GGCTTCCGCT GGGGCACCTC GCTGCTGCCC GGCCAGCCCA TCCTGATGGA ACACGTGCTC
GACCAGACCG CGATCACTTA CCCATGGACG ACGGTGACCA ACATGAGCGG CGAGATGATC
AAAACGGTGC TCGAAGATGT CTGCGACAAC CTGTTCAATC CGGACCCGTA CTACCAGCAA
GGGGGCGACA TGGTGCGCGT TGGCGGCCTG CAATGGACCT GCGAGCCGAC CGCCAGGATG
GGCCAGCGCA TCCAGAACAT GATGCTCAAG GGCAAGCCCA TTGACCCAGC CAAGACCTAC
AAGGTCGCGG GCTGGGCGCC GGTATCCGAG GAGGCCAAGG GGTCTGGCGC GCCGATTTGG
GACGTTGTGG CCGAATACCT CCGCGACATC AAGACCGTCA AACCGGTGAG TCTCAACCTG
CCGACCCTGA AGGGCGCGGC GAACAACCCG GGGATTGCAT GA
 
Protein sequence
MSMNRREFLQ VMAVAAAGGM SLHSELAMAE KGAAKLYDLP KFGNVSLLHI TDCHAQLLPI 
YFREPNVNLG FGDQFGKVPH LVGDNLLKHF GFKPNTIEAH AYTYLNFEQA AKTYGKVGGF
AHLATLVKRM KANRPGALLL DGGDTWQGSG TALWSNAQDM VDACKALGVN VMTLHWESTY
GEARVKEIEE KDFAGQIDIV AQNVKTTDFG DAVFKPFVMK NMNGVPVAII GQAFPYTPIA
NPRWQTPNWS FGIQEENMQK TVDEARAAGA QVVVVLSHNG MDVDLKMASR IKGIDAILGG
HTHDGMPAPV VVKNAGGQTL VTNAGSNGKY LGVLDFDVKN GKIADFRYKL LPVFANLLPA
DKDMQTLIDK ARAPYLSKLN EKLAISEGTL YRRGNFNGTF DQVILDALMK VKDAEIAFSP
GFRWGTSLLP GQPILMEHVL DQTAITYPWT TVTNMSGEMI KTVLEDVCDN LFNPDPYYQQ
GGDMVRVGGL QWTCEPTARM GQRIQNMMLK GKPIDPAKTY KVAGWAPVSE EAKGSGAPIW
DVVAEYLRDI KTVKPVSLNL PTLKGAANNP GIA