Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1454 |
Symbol | chbC |
ID | 6145980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1437885 |
End bp | 1439243 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641616332 |
Product | PTS system N,N'-diacetylchitobiose-specific transporter subunit IIC |
Protein accession | YP_001743512 |
Protein GI | 170681270 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1455] Phosphotransferase system cellobiose-specific component IIC |
TIGRFAM ID | [TIGR00359] phosphotransferase system, cellobiose specific, IIC component [TIGR00410] PTS system, lactose/cellobiose family IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.262654 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAATG TTATTGCATC GCTTGAAAAG GTACTCCTCC CTTTTGCAGT TAAAATAGGA AAGCAGCCAC ACGTTAATGC AATCAAAAAT GGCTTTATTC GCTTAATGCC GTTAACCCTT GCGGGGGCCA TGTTTGTATT AATTAACAAC GTTTTTCTAA GCTTTGGGGA GGGGTCGTTT TTTTATTCCT TAGGTATTCG CCTGGATGCC TCAACCATTG AAACACTTAA TGGTCTGAAA GGTATTGGCG GCAACGTATA TAACGGAACA TTAGGGATAA TGTCTTTAAT GGCACCGTTC TTTATTGGCA TGGCGCTGGC AGAAGAGCGT AAAGTCGATG CGCTGGCGGC CGGGTTGTTA TCCGTTGCAG CATTTATGAC CGTCACCCCA TATAGTGTCG GTGAGGCCTA TGCGGTTGGC GCAAACTGGT TAGGTGGGGC GAATATCATC TCCGGGATTA TTATTGGCCT GGTGGTGGCA GAAATGTTTA CCTTTATTGT CCGCCGCAAT TGGGTCATTA AATTGCCCGA TAGCGTACCT GCTTCAGTAT CGCGTTCCTT CTCGGCTTTA ATTCCCGGCT TTATTATTCT TTCCGTGATG GGGATTATTG CCTGGGCGTT GAATACCTGG GGCACCAACT TCCATCAGAT CATTATGGAT ACCATCTCAA CCCCACTGGC ATCGTTGGGT AGTGTGGTGG GCTGGGCCTA TGTGATCTTT GTTCCACTGC TCTGGTTCTT CGGTATTCAT GGTGCGCTGG CGCTGACCGC ACTGGACAAC GGCATTATGA CGCCTTGGGC GCTGGAAAAT ATCGCGACCT ATCAGCAATA TGGTTCCGTC GAAGCGGCGC TGGCAGCCGG TAAGACCTTC CATATCTGGG CCAAGCCGAT GCTGGACTCC TTTATTTTCC TTGGGGGCAG TGGTGCGACT TTAGGTCTGA TCCTGGCTAT CTTTATCGCC TCTCGCCGTG CTGATTATCG TCAGGTGGCA AAACTGGCGC TGCCGTCCGG CATCTTCCAG ATTAACGAAC CGATTCTGTT TGGTCTGCCA ATTATCATGA ACCCGGTGAT GTTTATCCCG TTTGTACTGG TACAACCGAT TCTGGCGGCA ATCACCCTCG CAGCGTACTA CATGGGCATT ATTCCTCCGG TGACCAATAT TGCACCGTGG ACCATGCCAA CCGGTCTGGG AGCCTTCTTT AACACCAACG GTAGCGTCGC CGCATTGCTG GTCGCACTCT TCAACCTTGG TATCGCAACG TTAATTTATC TGCCCTTTGT TGTGGTCGCT AACAAAGCAC AAAACGCGAT TGATAAAGAA GAGAGCGAAG AAGATATCGC TAACGCCCTG AAATTTTAA
|
Protein sequence | MSNVIASLEK VLLPFAVKIG KQPHVNAIKN GFIRLMPLTL AGAMFVLINN VFLSFGEGSF FYSLGIRLDA STIETLNGLK GIGGNVYNGT LGIMSLMAPF FIGMALAEER KVDALAAGLL SVAAFMTVTP YSVGEAYAVG ANWLGGANII SGIIIGLVVA EMFTFIVRRN WVIKLPDSVP ASVSRSFSAL IPGFIILSVM GIIAWALNTW GTNFHQIIMD TISTPLASLG SVVGWAYVIF VPLLWFFGIH GALALTALDN GIMTPWALEN IATYQQYGSV EAALAAGKTF HIWAKPMLDS FIFLGGSGAT LGLILAIFIA SRRADYRQVA KLALPSGIFQ INEPILFGLP IIMNPVMFIP FVLVQPILAA ITLAAYYMGI IPPVTNIAPW TMPTGLGAFF NTNGSVAALL VALFNLGIAT LIYLPFVVVA NKAQNAIDKE ESEEDIANAL KF
|
| |