Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_01706 |
Symbol | celB |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | - |
Start bp | 1764269 |
End bp | 1765627 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | N,N'-diacetylchitobiose-specific enzyme IIC component of PTS |
Protein accession | ACT43561 |
Protein GI | 253977891 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATG TTATTGCATC GCTTGAAAAG GTACTCCTCC CTTTTGCAGT TAAAATAGGA AAGCAGCCAC ACGTTAATGC AATCAAAAAT GGCTTTATTC GCTTAATGCC GTTAACCCTT GCGGGGGCCA TGTTTGTATT AATTAACAAC GTTTTTCTAA GCTTTGGGGA GGGGTCGTTT TTTTATTCCT TAGGTATTCG CCTGGATGCC TCAACCATTG AAACACTTAA TGGTCTGAAA GGTATTGGCG GCAACGTATA TAACGGAACA TTAGGGATAA TGTCTTTAAT GGCACCGTTC TTTATTGGCA TGGCGCTGGC AGAAGAGCGT AAAGTCGATG CGCTGGCGGC TGGGTTGTTA TCCGTTGCAG CATTTATGAC CGTCACCCCA TATAGTGTCG GTGAGGCCTA TGCGGTTGGC GCAAACTGGT TAGGTGGGGC GAATATCATC TCCGGGATTA TTATTGGCCT GGTGGTGGCA GAAATGTTTA CCTTTATTGT CCGCCGCAAT TGGGTCATTA AACTGCCCGA CAGCGTACCT GCTTCAGTAT CGCGTTCCTT CTCGGCATTA ATTCCCGGCT TTATTATTCT TTCCGTGATG GGGATTATTG CCTGGGCGTT GAATACCTGG GGCACCAACT TCCATCAGAT CATTATGGAT ACCATCTCAA CTCCACTGGC ATCGTTGGGT AGCGTGGTGG GCTGGGCCTA TGTGATCTTT GTTCCACTGC TCTGGTTCTT CGGTATTCAT GGCGCGCTGG CGCTGACCGC ACTGGACAAC GGCATTATGA CGCCGTGGGC ACTGGAAAAT ATCGCGACCT ATCAGCAATA TGGTTCCGTC GAAGCGGCGC TGGCAGCCGG TAAGACCTTC CATATCTGGG CCAAGCCGAT GCTGGACTCC TTTATTTTCC TTGGGGGCAG TGGTGCGACT TTAGGCCTGA TCCTGGCTAT CTTTATCGCC TCTCGCCGTG CTGATTATCG TCAGGTGGCA AAACTGGCGC TGCCGTCCGG CATCTTCCAG ATTAACGAAC CGATTCTGTT TGGTCTGCCA ATTATCATGA ACCCGGTGAT GTTTATCCCG TTTGTACTGG TACAACCGAT TCTGGCGGCA ATCACCCTCG CAGCGTACTA CATGGGCATT ATTCCTCCGG TGACCAATAT TGCACCGTGG ACCATGCCAA CCGGTCTGGG AGCCTTCTTT AACACCAACG GTAGCGTCGC CGCATTGCTG GTCGCACTCT TCAACCTTGG CATCGCAACG TTAATTTATC TGCCCTTTGT TGTGGTGGCT AACAAAGCAC AAAATGCGAT TGATAAAGAA GAGAGCGAAG AAGATATCGC TAACGCCCTG AAATTTTAA
|
Protein sequence | MSNVIASLEK VLLPFAVKIG KQPHVNAIKN GFIRLMPLTL AGAMFVLINN VFLSFGEGSF FYSLGIRLDA STIETLNGLK GIGGNVYNGT LGIMSLMAPF FIGMALAEER KVDALAAGLL SVAAFMTVTP YSVGEAYAVG ANWLGGANII SGIIIGLVVA EMFTFIVRRN WVIKLPDSVP ASVSRSFSAL IPGFIILSVM GIIAWALNTW GTNFHQIIMD TISTPLASLG SVVGWAYVIF VPLLWFFGIH GALALTALDN GIMTPWALEN IATYQQYGSV EAALAAGKTF HIWAKPMLDS FIFLGGSGAT LGLILAIFIA SRRADYRQVA KLALPSGIFQ INEPILFGLP IIMNPVMFIP FVLVQPILAA ITLAAYYMGI IPPVTNIAPW TMPTGLGAFF NTNGSVAALL VALFNLGIAT LIYLPFVVVA NKAQNAIDKE ESEEDIANAL KF
|
| |