Gene Daro_0557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0557 
Symbol 
ID3568882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp617654 
End bp618781 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content68% 
IMG OID637679000 
ProductBNR repeat-containing glycosyl hydrolase 
Protein accessionYP_283784 
Protein GI71906197 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4692] Predicted neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones63 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.560584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCCT TGTCGCCAGC AGTGATCCAC CATGGCAGCG CCGGGCTGTT TCTTGCTGCA 
CTGGCAGCCG CCTTCCTGCG CCTGCCAGCC ACCGAAGCAC CGGCCTTTGT GCCGCCCCCC
GTCGTCGCCA GCCCGCTGCC GGCCGACATG ACACGAAGCA ACCTGCCTTC AGCCGGTGCC
ACGGCCGGAC CGGCAAGCCT GACCCTGCTG GCCGACGGCA AGGTAGCCGC GGCCTGGCTG
GCCGGACCGG GTAACGATAA CTCGGCGGCG ACGATCTGGT TGTCGATCCT CGGTCGCAGC
GGCTGGAGCC AACCCTTGCC AGCCGCCACC CGCGAAAGTA CGGCCGCCGG CACTTTTGCC
CACATGAGCA GCCTGGGACG CCCGCTTTTG CTGGCCGAAG GCAGCTGGCT ACACCTCTGG
TACGAAAGCC TGCCGCTCGG CAGCGGGGCA GGGGCGGCCA TTGTCCATAG CCTTTCCACG
GATGGCGGCA AGACGTGGAG CAAAGCGGAA CGACTGCAAA CCTCGCCGCT CGGCACACTG
GGCAACGGAC TGGGCGGACC GCCTCTGATG CTTGCTGACG GCGGCCTTGG CCTGCCGCTC
GACCAGCGAT TTCCGAAGCA AGGCAGCGAG TGGCTGCGCC TGTCGGCGAC CGGCCGGATA
GTGGACAAGA GACGGCTGGC CCACGCGGCA CCAACGCTGC AACCGGCGGT TGTCGCCCTC
GACGACCACA GGGGGCTGGC GGTGCTCCGC GACAACCGCG CCGGCACCAG CCGAGCCACG
CTCAGCACGA CCAACGGCGG CCAGACATGG GAAACGGCCA GCGAACTCGC CCTGCCCGCC
CCGGACGCAC CTGTTGCGCT GCTCCGCCTG GCCAGCGGCC GCCTGCTGCT GGCCGGCAAC
CCGCAACAGG GCAAGGAAGC GCTGCAGCTG TGGCTCTCGG CCGACGATGG GCAAACCTGG
GCGATGAAAC GCATCGTCGA AGCGGCCAGC GATGGCGGGG CCGAATTCGC TGATCCGGCC
TTGCTGCAGG GGCGCGATGG CCGCATTCAT CTGACCTACA CCTGGCGCCA GCAGCAGATC
AGGTATGTCG CATTTACCGA AGCATGGCTG GCGGGAGGCG CACCATGA
 
Protein sequence
MQPLSPAVIH HGSAGLFLAA LAAAFLRLPA TEAPAFVPPP VVASPLPADM TRSNLPSAGA 
TAGPASLTLL ADGKVAAAWL AGPGNDNSAA TIWLSILGRS GWSQPLPAAT RESTAAGTFA
HMSSLGRPLL LAEGSWLHLW YESLPLGSGA GAAIVHSLST DGGKTWSKAE RLQTSPLGTL
GNGLGGPPLM LADGGLGLPL DQRFPKQGSE WLRLSATGRI VDKRRLAHAA PTLQPAVVAL
DDHRGLAVLR DNRAGTSRAT LSTTNGGQTW ETASELALPA PDAPVALLRL ASGRLLLAGN
PQQGKEALQL WLSADDGQTW AMKRIVEAAS DGGAEFADPA LLQGRDGRIH LTYTWRQQQI
RYVAFTEAWL AGGAP