Gene YpsIP31758_4070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4070 
SymbolbcsZ 
ID5385679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4586251 
End bp4587366 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content50% 
IMG OID640867098 
Productendo-1,4-D-glucanase 
Protein accessionYP_001403014 
Protein GI153950370 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGTCA TGTTTAAACA CTTAGCCAGC ATGTTCCTGT TGCTGGCCAG TTTTAGTCTC 
GCAGCCGCCA GTAACTGGCC CGCGTGGCAA CAGTTCAAAC AAGATTACAT CAGTGAAGGG
GGGCGGATCA TTGACCCCGG TAGCCCCTCG AAAATAACCA CCTCGGAAGG CCAGAGTTAT
GGGCTTTTTT TTGCATTGGT TGCGGATGAT CAGCCGATGT TTGAGCGTTT ATTGGCTTGG
ACAGAAAACA ATCTGGCAGC CGGTGATCTC ACTTCCCGCC TTCCCGCTTG GTTATGGGGG
CAAAACTCAC AAAATAACTG GGATATTCTG GACCCTAATT CGGCCTCCGA TGCGGATATC
TTGATTGCCT ACAACTTGCT GGAGGCTGGC AGGTTATGGG GTAACCGCCG TTACCTGATT
ATGGGTACCT TATTACTGCA ACGTATTGCG CAAGAAGAAG TCATGGATAT TCCCGGCCTT
GGCCAGATGC TATTACCGGG AAAAATTGGT TTTAACGATG AGGATACCTG GCGTCTCAAC
CCAAGTTATT TACCGCCACA ACTACTGGCA CGATTTTCCT CCATAGACGG GCCTTGGGAA
GCGATGGTAG AAGTGAATCA GCGTATGTGG CTGGAAACCG CACCAAACGG TTTTTCGCCG
GACTGGGTGG TCTGGCAGAA AGGTAAAGGC TGGCAGCCCG ATACCATAAA ACCGGATGTC
GGCAGTAACG ATGCCATTCT GGTTTATCTG TGGGCGGGGA TGCTGGCAAT GGACAGCCCA
CAAAAAGCTG AATTGATTGC GCGTTTTCAG CCAATGGCGG TAATCACTCA GCAGCAAGGC
CTGCCACCGT TTACGACCAA CAGCGACAAT GGTAAAACTA ACGGGGATGG GTCAGTGGGT
TTTTCTGCGG CATTATTGCC CTTTTTAGCC AGCAGCCCAG AGCCATTTAA TCAGCAAACA
CTGAATCTCC AACAGCGACG GGTACAAAAT TCACCGCCTG GCGCTGATGA TTATTACAGT
GCTATTCTGA CCCTGTTTGG TCAGGGGTGG TTACAGCATC GTTATCATTT TACCCATCAG
GGAGAGCTAC AGCCCTCATG GCACCGTCAA CGTTAA
 
Protein sequence
MVVMFKHLAS MFLLLASFSL AAASNWPAWQ QFKQDYISEG GRIIDPGSPS KITTSEGQSY 
GLFFALVADD QPMFERLLAW TENNLAAGDL TSRLPAWLWG QNSQNNWDIL DPNSASDADI
LIAYNLLEAG RLWGNRRYLI MGTLLLQRIA QEEVMDIPGL GQMLLPGKIG FNDEDTWRLN
PSYLPPQLLA RFSSIDGPWE AMVEVNQRMW LETAPNGFSP DWVVWQKGKG WQPDTIKPDV
GSNDAILVYL WAGMLAMDSP QKAELIARFQ PMAVITQQQG LPPFTTNSDN GKTNGDGSVG
FSAALLPFLA SSPEPFNQQT LNLQQRRVQN SPPGADDYYS AILTLFGQGW LQHRYHFTHQ
GELQPSWHRQ R