Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_04171 |
Symbol | sgcX |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | - |
Start bp | 4439033 |
End bp | 4440154 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | predicted endoglucanase with Zn-dependent exopeptidase domain |
Protein accession | ACT45954 |
Protein GI | 253980284 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTTT CTGTGCAGGA AACCCTGTTC TCGCTTTTGC AGCACAATGC GATTTCAGGA CACGAAAACG CTGTCGCTGA CGTCATGCTG TGCGAATTCA GGCGTCAGGC AAAAGAGGTC TGGCGAGACA GGCTTGGGAA TGTCGTCGCG CGCTACGGTA GTGATAAACC CGATGCGCTG CGACTGATGA TTTTTGCGCA TATGGATGAA GTCGGTTTTA TGGTGCGCAA AATAGAGCCG TCTGGATTTT TACGCTTTGA ACGCGTAGGC GGTCCTGCGC AGGTCACTAT GGCTGGTTCC ATCGTCACCC TCACCGGGGA CAAAGGGCCA GTCATGGGGT GTATCGGCAT TAAGTCCTAC CACTTTGCCA AAGGCGACGA GCGCACGCAG TCACCTTCTG TCGACAAACT GTGGATTGAT ATTGGTGCCA AAGACAAAGA CGACGCTATA CGGATGGGCA TTCAGGTCGG TACGCCTGTC ACTCTGTATA ACCCGCCGCA ACTCCTGGCA AACGATCTGG TGTGCAGTAA AGCACTAGAC GATCGTCTAG GCTGTACTGC CCTGCTCGGT GTAGCGGATG CTATCAGTAC TATGGAGCTT GATATCGCCG TTTATCTGGT GGCTTCGGTA CAGGAAGAAT TTAATATCCG CGGCATTGTT CCCGTATTAC GCCGTGTAAA ACCTGACCTG GCGATTGGTA TTGATATCAC TCCATCGTGT GACACCCCTG ATTTACACGA TTATTCCGAG GTCAGGATTA ATCAGGGCGT TGGGATCACC TGCCTGAACT ACCATGGTCG GGGAACGCTG GCCGGATTAA TCACGCCTCC TCGCCTGATA CGGATGTTGG AACAGACGGC TCTTGAACAC AACATTCCGG TGCAGCGAGA AGTGGCTCCC GGCGTGATAA CAGAAACCGG TTATATCCAG GTTGAGCAGG ATGGTATTCC CTGCGCCAGT CTCTCTATTC CTTGTCGCTA TACCCATTCT CCGGCGGAGG TTGCCAGCCT GCGTGATTTG ACTGATTGCA TCCGCCTATT GACCGCTCTG GCAGGTATGT CAGCAGCACA TTTCCCCGTT GAGCCTGATT CAGGCACTAC ACAAGAGGCA CATCCATTAT GA
|
Protein sequence | MSFSVQETLF SLLQHNAISG HENAVADVML CEFRRQAKEV WRDRLGNVVA RYGSDKPDAL RLMIFAHMDE VGFMVRKIEP SGFLRFERVG GPAQVTMAGS IVTLTGDKGP VMGCIGIKSY HFAKGDERTQ SPSVDKLWID IGAKDKDDAI RMGIQVGTPV TLYNPPQLLA NDLVCSKALD DRLGCTALLG VADAISTMEL DIAVYLVASV QEEFNIRGIV PVLRRVKPDL AIGIDITPSC DTPDLHDYSE VRINQGVGIT CLNYHGRGTL AGLITPPRLI RMLEQTALEH NIPVQREVAP GVITETGYIQ VEQDGIPCAS LSIPCRYTHS PAEVASLRDL TDCIRLLTAL AGMSAAHFPV EPDSGTTQEA HPL
|
| |