Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_02934 |
Symbol | ygjD |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | - |
Start bp | 3075612 |
End bp | 3076625 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | ACT44738 |
Protein GI | 253979068 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000155968 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTAC TGGGTATTGA AACTTCCTGC GATGAAACCG GCATCGCCAT TTACGACGAT GAAAAAGGTT TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTGCACGC TGACTACGGC GGCGTCGTGC CTGAACTGGC CTCCCGCGAT CATGTGCGTA AAACCGTACC GTTGATCCAG GCGGCGCTAA AGGAGTCTGG TTTAACGGCA AAAGACATTG ATGCTGTGGC CTATACCGCA GGCCCTGGAT TAGTCGGCGC GCTACTGGTT GGCGCGACCG TGGGGCGTTC TCTGGCGTTT GCCTGGGACG TTCCGGCGAT CCCTGTACAC CATATGGAAG GGCATCTGTT AGCGCCGATG CTGGAAGATA ACCCGCCGGA ATTTCCGTTT GTTGCGCTGC TTGTTTCCGG CGGTCATACG CAGTTAATCA GCGTGACTGG CATTGGTCAG TACGAGCTGC TCGGCGAGTC TATCGATGAT GCCGCCGGGG AAGCGTTTGA TAAAACCGCG AAGCTGCTGG GGCTGGATTA TCCTGGCGGG CCGTTACTGT CGAAAATGGC GGCTCAGGGT ACTGCCGGGC GCTTTGTCTT CCCGCGTCCG ATGACCGACC GTCCGGGGCT GGATTTCAGC TTCTCCGGCC TGAAAACCTT CGCGGCAAAT ACCATTCGTG ACAACGGCAC CGACGACCAG ACGCGTGCTG ATATCGCCCG CGCCTTTGAA GATGCGGTGG TCGATACGCT GATGATTAAG TGCAAGCGGG CGCTGGATCA GACGGGCTTT AAGCGACTGG TCATGGCGGG CGGCGTGAGT GCTAACCGTA CGTTACGGGC GAAGCTGGCT GAAATGATGA AAAAACGCCG CGGCGAAGTG TTCTACGCGC GTCCGGAATT TTGTACTGAT AACGGCGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AAGCAGGCGC GACGGCGGAT CTCGGCGTTA GCGTGCGTCC GCGCTGGCCG CTGGCGGAGT TACCGGCTGC GTAA
|
Protein sequence | MRVLGIETSC DETGIAIYDD EKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ AALKESGLTA KDIDAVAYTA GPGLVGALLV GATVGRSLAF AWDVPAIPVH HMEGHLLAPM LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG PLLSKMAAQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRDNGTDDQ TRADIARAFE DAVVDTLMIK CKRALDQTGF KRLVMAGGVS ANRTLRAKLA EMMKKRRGEV FYARPEFCTD NGAMIAYAGM VRFKAGATAD LGVSVRPRWP LAELPAA
|
| |