Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1427 |
Symbol | chbC |
ID | 6485817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1394180 |
End bp | 1395538 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642736819 |
Product | PTS system N,N'-diacetylchitobiose-specific transporter subunit IIC |
Protein accession | YP_002040573 |
Protein GI | 194446245 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1455] Phosphotransferase system cellobiose-specific component IIC |
TIGRFAM ID | [TIGR00359] phosphotransferase system, cellobiose specific, IIC component [TIGR00410] PTS system, lactose/cellobiose family IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.428261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAACG TTATTGCTTC ACTTGAAAAG GTACTCCTTC CTTTTGCTGT AAAAATAGGA AAGCAGCCAC ACGTTAATGC AATCAAGAAC GGCTTTATTC GTTTAATGCC ATTAACCCTT GCGGGGGCGA TGTTTGTATT AATTAATAAC GTTTTCCTGA GCTTTGGGGA GGGGGCGTTT TTTTATTCCA TGGGCATTCG GCTTGATGCC TCAACCATTG AAACTCTTAA TGGATTGAAA GGTATCGGCG GGAACGTCTA TAACGGCACA TTGGGCATTA TGTCGCTCAT GGCGCCATTT TTTATCGGCA TGGCGCTGGC GGAAGAGCGT AAGGTTGATG CGCTGGCCGC CGGACTGCTC TCCGTGGCGG CATTTATGAC CGTAACGCCG TATAGTGTCG GCGAGGCCTA TGCCGTAGGC GCTAACTGGC TGGGCGGCGC GAATATTATC TCCGGTATTA TTATTGGCCT GGTGGTCGCG GAGATGTTTA CATTTGTTGT CCGGCGTAAC TGGGTGATTA AGCTACCGGA CAGCGTACCG GCTTCGGTGT CTCGTTCATT CTCAGCATTA ATTCCCGGCT TTATTATTCT CTCTATTATG GGGATTATTG CCTGGGCGCT GTCTAATTAC GGTTCTAACT TCCATCAGAT TATTATGGAC ACTATCTCTA CGCCGCTGGC ATCGCTGGGT AGCGTGGTAG GGTGGGCATA TGTCATTTTT GTACCGCTGC TGTGGTTCTT TGGTATTCAT GGTTCGCTGG CGCTGACCGC GCTGGACAGC GGCATCATGA CGCCCTGGGC GCTGGAAAAC ATCTCTATTT ACCAACAGTA TGGCTCCGTC GATGCGGCGC TGGAAGCCGG TAAAACGTTC CATATCTGGG CGAAACCGAT GCTGGATTCT TATATCTTCC TCGGTGGTAG CGGCGCAACG CTGGGTCTGA TCATCGCTAT CTTCCTGGCA TCCCGTCGCG CGGACTATCG CCAGGTGGCA AAACTGGCGC TGCCGTCAGG TATCTTCCAG ATTAACGAAC CCATCCTGTT TGGCCTGCCG ATTATTATGA ACCCGGTGAT GTTTATCCCC TTCATTCTGG TTCAGCCGAT TCTGGCGGCG ATTACGCTGG TGGCTTACTA TTTGGGTATT ATTCCACCGA TTACCAATAT TGCGCCGTGG ACCATGCCAA CCGGGTTGGG GGCGTTCTTT AACACCAACG GCAGTGTCGC CGCGTTGCTG GTTGCGCTAT TTAACCTGGC GGTCGCTACC CTGATTTATC TCCCCTTCGT GGTGGTGGCT AACAAAGCGC AGAACGCCAT CGAGCAGGAA GAAAGCGAAG AAGATATCGC TAACGCACTG AAATTCTAA
|
Protein sequence | MSNVIASLEK VLLPFAVKIG KQPHVNAIKN GFIRLMPLTL AGAMFVLINN VFLSFGEGAF FYSMGIRLDA STIETLNGLK GIGGNVYNGT LGIMSLMAPF FIGMALAEER KVDALAAGLL SVAAFMTVTP YSVGEAYAVG ANWLGGANII SGIIIGLVVA EMFTFVVRRN WVIKLPDSVP ASVSRSFSAL IPGFIILSIM GIIAWALSNY GSNFHQIIMD TISTPLASLG SVVGWAYVIF VPLLWFFGIH GSLALTALDS GIMTPWALEN ISIYQQYGSV DAALEAGKTF HIWAKPMLDS YIFLGGSGAT LGLIIAIFLA SRRADYRQVA KLALPSGIFQ INEPILFGLP IIMNPVMFIP FILVQPILAA ITLVAYYLGI IPPITNIAPW TMPTGLGAFF NTNGSVAALL VALFNLAVAT LIYLPFVVVA NKAQNAIEQE ESEEDIANAL KF
|
| |