Gene EcSMS35_1634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1634 
Symbol 
ID6143947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1624014 
End bp1625336 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content44% 
IMG OID641616510 
ProductPTS system lactose/cellobiose family IIC subunit 
Protein accessionYP_001743688 
Protein GI170680463 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTAA TGGCCTCATT CGAACGTGGA ATGGAACGTT TTCTTGTTCC AGTTGCTATC 
AAGTTAAACT CACAAAAACA TGTTGCAGCG GTGAGAGATG GATTCGTTTT TACGTTTCCA
ATTATCATGG CAAGCTCATT AATTATATTA ATTAACTTTG CCATATTATC GCCCGACGGC
TTTATTGCCG GGCTGCTGCA TCTGAACAGC GTTTTCCCCA ACCTTGAAAA AGCACAAGCT
ATTTTTACTC CGGTAATGAA TGGTTCTGTA AATATCATGT CAATTATGAT TGCTTTCCTG
GTCGCCAGGA ATATGGCGAT TAGCTATGAG CAAGATGATC TTTTATGCGG ATTAACGGCA
ATAGGAGCAT TTTTTATTGT ATATACGCCA TATCAGATGA TAGATGGGCA AGCATTCCTG
ACGACCAAAT ATCTCGGCGC GCAGGGGTTA TTTGTTGCTG TTATCGTTGC ACTGATCACC
AGTGAAATAT TTTGTCGCTT AGCTCGAAAC CCCAAAATCA CCATCACGAT GCCGGCAGCT
GTACCTCCTG CGGTAGCGCG TTCATTTAAA GTTTTATTGC CAATATTTTT TGTCATGGTG
TTCTTTTCCG CACTTAATTA TTGCCTGACA CTGATATCCC CGGCAGGATT AAACGACCTC
ATTTACACAT TAATCCAGAC GCCGCTCAAA CATATGGGAA CGAATATCTT TGCGGTAATT
ATCCTGGGGG CTGTGGGTAA TTTCCTGTGG GTGCTGGGGA TCCACGGACC TAATACCACC
TCGGCAATTC GAGAAACTGT TTTTTCTGAG GCTAATCTGG AGAATCTCTC CTGGGCCGCT
CAACACGGCA CTACCTGGGG CGCGCCATAT CCGATTACCT GGACTTCTAT TAATGATGCA
TTCGCCAACT GCGGCGGTTC AGGTATGACG TTGGGGTTAT TGTTGGCTAT TTTTATCGCT
TCTAAGCGTG CGGAATACCG TGATCTGGCA AAAATGTCAT TTATCCCCGG TATTTTCAAT
ATCAATGAAC CGATAATGTT CGGCCTTCCT ATTGTACTTA ACCCCATCAT GATGGTGCCG
TTTATTATGG TTCCTATTGT TAACTGTGCC ATTGGTTACT TCTTTGTTTC GATGGAAATT
ATTCCACCGG TTGCTTATGC CGTGCCCTGG ACTACGCCCG GACCTTTAAT TGCTTTCCTC
GGAACCGGGG GAAACTGGCT GGCGTTACTG GTGGGTTTTT TATGTTTAGG TGTGGCGACA
ATGATCTATT TACCTTTTGT TATTGCCGCC AACAAGGTCA ATAACTTAAC AACTAACGGA
TAA
 
Protein sequence
MGLMASFERG MERFLVPVAI KLNSQKHVAA VRDGFVFTFP IIMASSLIIL INFAILSPDG 
FIAGLLHLNS VFPNLEKAQA IFTPVMNGSV NIMSIMIAFL VARNMAISYE QDDLLCGLTA
IGAFFIVYTP YQMIDGQAFL TTKYLGAQGL FVAVIVALIT SEIFCRLARN PKITITMPAA
VPPAVARSFK VLLPIFFVMV FFSALNYCLT LISPAGLNDL IYTLIQTPLK HMGTNIFAVI
ILGAVGNFLW VLGIHGPNTT SAIRETVFSE ANLENLSWAA QHGTTWGAPY PITWTSINDA
FANCGGSGMT LGLLLAIFIA SKRAEYRDLA KMSFIPGIFN INEPIMFGLP IVLNPIMMVP
FIMVPIVNCA IGYFFVSMEI IPPVAYAVPW TTPGPLIAFL GTGGNWLALL VGFLCLGVAT
MIYLPFVIAA NKVNNLTTNG