Gene SeD_A3993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3993 
SymbolbcsZ 
ID6871194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3837193 
End bp3838302 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content56% 
IMG OID642786949 
Productendo-1,4-D-glucanase 
Protein accessionYP_002217577 
Protein GI198244736 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.985148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.495711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACTA TGCTGCGCGG ATGGATAACG ATGATCGTCA TGCTGACGGC AATAAATGCG 
CAGGCGGCCT GTAGCTGGCC TGCGTGGGAA CAGTTCAAGA AAGATTACAT TAGCCAGCAG
GGACGCGTTA TCGATCCGGG CGATGCGCGA AAAATTACCA CCTCCGAAGG GCAAAGCTAC
GCCATGTTCT TTGCCCTGGC GGCGAACGAT CGACCGGCGT TCGCGCAACT GTTTAACTGG
ACGCAAAACA ATCTGGCGCA GGGATCGCTG CGTGAACATC TGCCCGCCTG GCTGTGGGGA
CAAAAAGATC CCGACACCTG GTCGGTGCTG GACAGCAACT CCGCGTCCGA CGGCGATATC
TGGATGGCAT GGTCGCTGCT GGAGGCCGGT CGCCTGTGGA AAGAGACGCG TTATACCGAG
GTGGGCACGG CGTTGCTAAA ACGCATCGCC CGCGAAGAGG TCGTGAATGT GCCGGGGCTG
GGCTCAATGC TGCTACCTGG CAAAATCGGC TTTGCCGAGG CGAATAGCTG GCGTTTTAAC
CCAAGCTATC TGCCGCCGCA GTTGGCGCAA TACTTTAGCC GTTTTGGCGC GCCGTGGTCG
ACGCTACGGG AAACCAATTT GCGGCTTTTG CTGGAAACCG CGCCGAAAGG TTTCTCGCCG
GACTGGGTGC GTTATGAAAG CAAGCAAGGC TGGCAGTTGA AAGCGGAAAA GACGCTGATC
AGTAGCTACG ATGCGATTCG CGTCTATTTA TGGACGGGAA TGATGCATGA TGGCGATCCG
CAAAAAGCGC GTTTACTGGC GCGATTTAAA CCGATGGCGA CGTTAACGAT GAAAAACGGC
GTTCCACCGG AGAAAGTGGA TGTCGTCAGC GGGAATGCGC AGGGGACGGG GCCGGTCGGG
TTTTCCGCCG CTTTACTGCC TTTCCTGCAA AATCGCGACG CCCAGGCCGT GCAGCGACAG
CGGGTCGCAG ACCATTTTCC TGGCAGCGAT GCCTATTACA ACTATGTGCT GACTCTCTTT
GGACAAGGCT GGGATCAGCA CCGTTTTCGC TTCACCGTCA AAGGTGAATT ATTACCTGAC
TGGGGCCAGG AATGCGTAAG TTCACGTTAA
 
Protein sequence
MMTMLRGWIT MIVMLTAINA QAACSWPAWE QFKKDYISQQ GRVIDPGDAR KITTSEGQSY 
AMFFALAAND RPAFAQLFNW TQNNLAQGSL REHLPAWLWG QKDPDTWSVL DSNSASDGDI
WMAWSLLEAG RLWKETRYTE VGTALLKRIA REEVVNVPGL GSMLLPGKIG FAEANSWRFN
PSYLPPQLAQ YFSRFGAPWS TLRETNLRLL LETAPKGFSP DWVRYESKQG WQLKAEKTLI
SSYDAIRVYL WTGMMHDGDP QKARLLARFK PMATLTMKNG VPPEKVDVVS GNAQGTGPVG
FSAALLPFLQ NRDAQAVQRQ RVADHFPGSD AYYNYVLTLF GQGWDQHRFR FTVKGELLPD
WGQECVSSR