Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1634 |
Symbol | |
ID | 6143947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1624014 |
End bp | 1625336 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641616510 |
Product | PTS system lactose/cellobiose family IIC subunit |
Protein accession | YP_001743688 |
Protein GI | 170680463 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1455] Phosphotransferase system cellobiose-specific component IIC |
TIGRFAM ID | [TIGR00359] phosphotransferase system, cellobiose specific, IIC component [TIGR00410] PTS system, lactose/cellobiose family IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTAA TGGCCTCATT CGAACGTGGA ATGGAACGTT TTCTTGTTCC AGTTGCTATC AAGTTAAACT CACAAAAACA TGTTGCAGCG GTGAGAGATG GATTCGTTTT TACGTTTCCA ATTATCATGG CAAGCTCATT AATTATATTA ATTAACTTTG CCATATTATC GCCCGACGGC TTTATTGCCG GGCTGCTGCA TCTGAACAGC GTTTTCCCCA ACCTTGAAAA AGCACAAGCT ATTTTTACTC CGGTAATGAA TGGTTCTGTA AATATCATGT CAATTATGAT TGCTTTCCTG GTCGCCAGGA ATATGGCGAT TAGCTATGAG CAAGATGATC TTTTATGCGG ATTAACGGCA ATAGGAGCAT TTTTTATTGT ATATACGCCA TATCAGATGA TAGATGGGCA AGCATTCCTG ACGACCAAAT ATCTCGGCGC GCAGGGGTTA TTTGTTGCTG TTATCGTTGC ACTGATCACC AGTGAAATAT TTTGTCGCTT AGCTCGAAAC CCCAAAATCA CCATCACGAT GCCGGCAGCT GTACCTCCTG CGGTAGCGCG TTCATTTAAA GTTTTATTGC CAATATTTTT TGTCATGGTG TTCTTTTCCG CACTTAATTA TTGCCTGACA CTGATATCCC CGGCAGGATT AAACGACCTC ATTTACACAT TAATCCAGAC GCCGCTCAAA CATATGGGAA CGAATATCTT TGCGGTAATT ATCCTGGGGG CTGTGGGTAA TTTCCTGTGG GTGCTGGGGA TCCACGGACC TAATACCACC TCGGCAATTC GAGAAACTGT TTTTTCTGAG GCTAATCTGG AGAATCTCTC CTGGGCCGCT CAACACGGCA CTACCTGGGG CGCGCCATAT CCGATTACCT GGACTTCTAT TAATGATGCA TTCGCCAACT GCGGCGGTTC AGGTATGACG TTGGGGTTAT TGTTGGCTAT TTTTATCGCT TCTAAGCGTG CGGAATACCG TGATCTGGCA AAAATGTCAT TTATCCCCGG TATTTTCAAT ATCAATGAAC CGATAATGTT CGGCCTTCCT ATTGTACTTA ACCCCATCAT GATGGTGCCG TTTATTATGG TTCCTATTGT TAACTGTGCC ATTGGTTACT TCTTTGTTTC GATGGAAATT ATTCCACCGG TTGCTTATGC CGTGCCCTGG ACTACGCCCG GACCTTTAAT TGCTTTCCTC GGAACCGGGG GAAACTGGCT GGCGTTACTG GTGGGTTTTT TATGTTTAGG TGTGGCGACA ATGATCTATT TACCTTTTGT TATTGCCGCC AACAAGGTCA ATAACTTAAC AACTAACGGA TAA
|
Protein sequence | MGLMASFERG MERFLVPVAI KLNSQKHVAA VRDGFVFTFP IIMASSLIIL INFAILSPDG FIAGLLHLNS VFPNLEKAQA IFTPVMNGSV NIMSIMIAFL VARNMAISYE QDDLLCGLTA IGAFFIVYTP YQMIDGQAFL TTKYLGAQGL FVAVIVALIT SEIFCRLARN PKITITMPAA VPPAVARSFK VLLPIFFVMV FFSALNYCLT LISPAGLNDL IYTLIQTPLK HMGTNIFAVI ILGAVGNFLW VLGIHGPNTT SAIRETVFSE ANLENLSWAA QHGTTWGAPY PITWTSINDA FANCGGSGMT LGLLLAIFIA SKRAEYRDLA KMSFIPGIFN INEPIMFGLP IVLNPIMMVP FIMVPIVNCA IGYFFVSMEI IPPVAYAVPW TTPGPLIAFL GTGGNWLALL VGFLCLGVAT MIYLPFVIAA NKVNNLTTNG
|
| |