Gene EcSMS35_2699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2699 
Symbol 
ID6145582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2771161 
End bp2772159 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content55% 
IMG OID641617570 
Productputative sugar ABC transporter, permease protein 
Protein accessionYP_001744735 
Protein GI170680743 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCTT CGTCATTACC ATTGCCGCAG GGCAAGAGCG TCTCGCTCAA ACAATTTGTC 
AGTCGCCATA TTAATGAGAT CGGTTTGCTG GTGGTGATTG CCATTTTGTA TCTGGTCTTC
TCCCTGAACG CGCCGGGTTT TATCTCATTG AATAACCAGA TGAACGTGCT GCGTGATGCC
GCCACCATTG GGATCGCCGC CTGGGCGATG ACGCTGATTA TTATCTCCGG TGAAATTGAT
GTCAGCGTTG GGCCGATGGT GGCTTTTGTC TCGGTGTGCC TGGCATTTTT GCTGCAATTT
GACGTTCCGC TGGCGATTGC TTGTCTGCTG GTGTTGCTGT TAGGTGCGCT GATGGGAACG
CTCGCCGGGG TGCTGCGCGG CGTGTTTAAC GTGCCAAGTT TTGTTGCCAC GCTGGGTTTA
TGGAGCGCCC TGCGCGGAAT GGGGCTGTTT ATGACGAACG CCTTGCCAGT GCCAATTAAC
GAAAACGAAG TGCTGGACTG GCTGGGCGGA CAATTTCTCG GTGTGCCGGT ATCCGCGCTG
ATCATGATGG TGTTATTTGC GCTGTTTGTG TTCATTAGCC GCAAAACCGC CTTCGGGCGC
TCGGTTTTTG CTGTTGGCGG TAATGCCACT GCAGCGCAGT TGTGCGGTAT TAACGTTCGT
CGGGTACGCA TTCTTATTTT TACCCTTTCG GGATTATTAG CGGCGGTGAC CGGCATTTTG
TTGGCGGCGC GCCTCGGTTC CGGTAACGCA GGTGCCGCAA ACGGTCTGGA GTTTGACGTC
ATCGCCGCGG TCGTCGTCGG CGGTACGGCG CTTTCCGGTG GTCGCGGCTC GTTGTTCGGT
ACGCTGCTTG GCGTACTGGT GATTACGCTA ATCGGTAACG GTCTGGTGCT GCTCGGGATT
AACTCCTTTT TCCAGCAGGT GGTGCGCGGC GTCATCATCG TGGTGGCGGT GCTGGCGAAT
ATCTTGCTGA CCCAGCGTAG CAGTAAAGCG AAACGCTAA
 
Protein sequence
MSASSLPLPQ GKSVSLKQFV SRHINEIGLL VVIAILYLVF SLNAPGFISL NNQMNVLRDA 
ATIGIAAWAM TLIIISGEID VSVGPMVAFV SVCLAFLLQF DVPLAIACLL VLLLGALMGT
LAGVLRGVFN VPSFVATLGL WSALRGMGLF MTNALPVPIN ENEVLDWLGG QFLGVPVSAL
IMMVLFALFV FISRKTAFGR SVFAVGGNAT AAQLCGINVR RVRILIFTLS GLLAAVTGIL
LAARLGSGNA GAANGLEFDV IAAVVVGGTA LSGGRGSLFG TLLGVLVITL IGNGLVLLGI
NSFFQQVVRG VIIVVAVLAN ILLTQRSSKA KR