Gene Daro_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2034 
Symbol 
ID3566752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2184790 
End bp2185809 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content57% 
IMG OID637680505 
Productbeta-hexosaminidase 
Protein accessionYP_285249 
Protein GI71907662 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value0.362016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCATT TACCGCTCGG CCCGCTGATG ATCGACATTA CTGGCACCGA ACTGACAGAT 
CTTGATCGCG AACGCCTTTG CCATTCATTG GTTGGTGGGA TCATCCTGTT TTCTCGAAAC
TATGCCAACC AGGATCAATT GCTAGAACTC TGTTCGGCAA TCCATACCTT GCGCTCGCCA
TCGCTGCTGA TTGCCGTTGA TCACGAGGGG GGCAGGGTTC AGCGTTTTCG CGACGGTTTT
ACCCGTCTGC CACCCATGGC CACGTTGGGG AAACTTTGGG ATAGGGATCC GCAGGCGGCA
CTTGTCGCCA CCCGCCAGAC TGGCTACGTG CTGGCCGCCG AACTTCGCGC CCGTGGCGTC
GATTATTCCT TTACACCGGT GCTTGATCTC GATTATGGCC CCTCACGCGT CATCGGCGAT
CGTGCTTTCC ACCGCCAACC GGACGCGGTA ATCGCGCTCG CCGCTGCGCT AGGTGAAGGT
CTGCGCCAGG CAGGCATGGG CAGTTGTGGC AAGCATTATC CGGGACACGG TTATGTCATC
CCCGATTCGC ATGTCGAACT GCCGGTCGAT GATCGTGCTT TCGAAGCAAT GCAGGAAGAT
ATCGCTCCCT ACCGGAATCT TCCGCTGGAT GGCGTGATGG CTGCCCATGT GATTTACAAC
TGCATGGACT GCAATACGGC TGTATTTTCA AATAAATGGA TAAGTTATTT GAGAAATGAC
ATTAAATTTA ACGGGGCGGT TTTCACCGAC GATTTATCGA TGGCCGGCGC CGGTGTGGTC
GGCGGCATGC TGTCTCGGGT CGAGACAGCT TACGCAGCCG GCTGCGACAT GCTGCTCGTG
TGTAATGCCC CTGATGTTGT CGGCGATGTG CTGGAAAACT GGAAGCCGGA AATCGATTTG
CGGCGCGGCA AACGGGTCGA GGCGCTGATC CCCAAGACGC CCGCCGTGCC TTGGCAAGTG
CTTCAGGCAG ACCCGGCTTA TCAGGCGGCC CAAAAGACCA TCGCCGAATT GATGGCCTGA
 
Protein sequence
MMHLPLGPLM IDITGTELTD LDRERLCHSL VGGIILFSRN YANQDQLLEL CSAIHTLRSP 
SLLIAVDHEG GRVQRFRDGF TRLPPMATLG KLWDRDPQAA LVATRQTGYV LAAELRARGV
DYSFTPVLDL DYGPSRVIGD RAFHRQPDAV IALAAALGEG LRQAGMGSCG KHYPGHGYVI
PDSHVELPVD DRAFEAMQED IAPYRNLPLD GVMAAHVIYN CMDCNTAVFS NKWISYLRND
IKFNGAVFTD DLSMAGAGVV GGMLSRVETA YAAGCDMLLV CNAPDVVGDV LENWKPEIDL
RRGKRVEALI PKTPAVPWQV LQADPAYQAA QKTIAELMA