Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1958 |
Symbol | chbC |
ID | 5590299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1945478 |
End bp | 1946836 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640925630 |
Product | PTS system N,N'-diacetylchitobiose-specific transporter subunit IIC |
Protein accession | YP_001463033 |
Protein GI | 157159318 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1455] Phosphotransferase system cellobiose-specific component IIC |
TIGRFAM ID | [TIGR00359] phosphotransferase system, cellobiose specific, IIC component [TIGR00410] PTS system, lactose/cellobiose family IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATG TTATTGCATC GCTTGAAAAG GTACTCCTCC CTTTTGCAGT TAAAATAGGA AAGCAGCCAC ACGTTAATGC AATCAAAAAT GGCTTTATTC GCTTAATGCC GTTAACCCTT GCGGGGGCCA TGTTTGTATT AATTAACAAC GTTTTTCTAA GCTTTGGGGA GGGGTCATTT TTTTATTCCT TAGGTATTCG CCTAGATGCC TCAACCATTG AAACACTTAA TGGCCTGAAA GGTATTGGCG GCAACGTATA TAACGGAACA TTAGGGATAA TGTCTTTAAT GGCACCGTTC TTTATTGGCA TGGCGCTGGC AGAAGAGCGT AAAGTCGATG CGCTGGCGGC TGGGTTGTTA TCCGTTGCAG CATTTATGAC CGTCACCCCA TATAGTGTCG GTGAGGCCTA TGCGGTTGGC GCAAACTGGT TAGGTGGAGC GAATATCATC TCCGGGATTA TTATTGGCCT GGTGGTGGCA GAAATGTTTA CCTTTATTGT CCGCCGCAAT TGGGTCATTA AATTGCCCGA TAGCGTACCT GCTTCAGTAT CGCGTTCCTT CTCGGCATTA ATTCCCGGCT TTATTATTCT TTCCGTGATG GGGATTATTG CCTGGGCGTT GAATACCTGG GGCACCAACT TCCATCAGAT CATTATGGAT ACCATCTCAA CCCCACTGGC ATCGTTGGGT AGCGTGGTGG GCTGGGCCTA TGTGATCTTT GTTCCACTGC TCTGGTTCTT CGGTATTCAT GGCGCGCTGG CGCTGACCGC ACTGGACAAC GGCATTATGA CGCCGTGGGC GCTGGAAAAT ATCGCGACCT ATCAGCAATA TGGTTCCGTC GAAGCGGCGC TGGCAGCCGG TAAGACCTTC CATATCTGGG CCAAGCCGAT GCTGGACTCC TTTATTTTCC TTGGGGGCAG TGGTGCGACT TTAGGCCTGA TCCTGGCTAT CTTTATCGCC TCTCGCCGTG CTGATTATCG TCAGGTGGCA AAACTGGCGC TGCCGTCCGG CATCTTCCAG ATTAACGAAC CGATTCTGTT TGGTCTGCCA ATTATCATGA ACCCGGTGAT GTTTATCCCG TTTGTACTGG TACAACCGAT TCTGGCGGCA ATCACCCTCG CAGCGTACTA CATGGGCATC ATTCCTCCGG TGACCAATAT TGCACCGTGG ACCATGCCAA CCGGTCTGGG AGCCTTCTTT AACACCAACG GTAGCGTCGC CGCATTGCTG GTCGCACTCT TCAACCTTGG CATCGCAACG TTAATTTATC TGCCCTTTGT TGTGGTGGCT AACAAAGCAC AAAATGCGAT TGATAAAGAA GAGAGCGAAG AAGATATCGC TAACGCCCTG AAATTTTAA
|
Protein sequence | MSNVIASLEK VLLPFAVKIG KQPHVNAIKN GFIRLMPLTL AGAMFVLINN VFLSFGEGSF FYSLGIRLDA STIETLNGLK GIGGNVYNGT LGIMSLMAPF FIGMALAEER KVDALAAGLL SVAAFMTVTP YSVGEAYAVG ANWLGGANII SGIIIGLVVA EMFTFIVRRN WVIKLPDSVP ASVSRSFSAL IPGFIILSVM GIIAWALNTW GTNFHQIIMD TISTPLASLG SVVGWAYVIF VPLLWFFGIH GALALTALDN GIMTPWALEN IATYQQYGSV EAALAAGKTF HIWAKPMLDS FIFLGGSGAT LGLILAIFIA SRRADYRQVA KLALPSGIFQ INEPILFGLP IIMNPVMFIP FVLVQPILAA ITLAAYYMGI IPPVTNIAPW TMPTGLGAFF NTNGSVAALL VALFNLGIAT LIYLPFVVVA NKAQNAIDKE ESEEDIANAL KF
|
| |