Gene EcSMS35_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1506 
SymbolydiM 
ID6146007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1490578 
End bp1491792 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content46% 
IMG OID641616384 
Productmajor facilitator family transporter 
Protein accessionYP_001743564 
Protein GI170681306 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.340818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.000395679 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAATC CCTATTACCC TACCGCACTG GGGTTGTATT TTAATTATCT GGTGCATGGT 
ATGGGCGTCA TTTTGATGAG CCTGAATATG GCCTCGCTGG AGACGCTTTG GCAGACTAAT
GCCGCGGGCG TCTCGATAGT TATCTCATCA CTGGGCATTG GTCGATTAAG TGTCTTGCTT
TTTGCAGGAT TATTATCCGA TCGCTTTGGT CGCCGCCCTT TTATCATGCT CGGGATGTGC
TGCTATATGG CCTTCTTTTT TGGCATCCTG CACACCAATA ACATCATTAT CGCTTATGTT
TTTGGCTTTC TGGCGGGAAT GGCAAACAGT TTTCTCGATG CAGGCACCTA TCCCAGTTTG
ATGGAAGCTT TTCCACGCTC ACCGGGTACA GCCAATATTT TAATTAAAGC ATTTGTTTCC
AGCGGACAAT TTTTATTACC GCTAATTATC AGCCTGTTAG TGTGGGCTGA ACTGTGGTTC
GGTTGGTCCT TTATGATTGC CGCAGGCATT ATGTTTATTA ACGCTCTGTT TTTATACCGT
TGTACGTTCC CACCCCATCC GGGTCGTCGC TTACCTGTCA TAAAGAAAAC CACCAGCTCT
ACGGAACATC GCTGTTCAAT TATCGATTTA GCCAGTTATA CCTTATATGG CTATATCTCA
ATGGCAACGT TTTATCTGGT TAGCCAGTGG CTGGCACAAT ACGGACAATT TGTTGCAGGC
ATGTCATACA CTATGTCGAT CAAACTACTC AGTATCTACA CCGTGGGTTC GCTGCTTTGT
GTATTTATTA CCGCTCCACT CATTCGTAAT ACCGTTCGCC CAACAACATT ACTGATGCTG
TACACCTTTA TCTCATTTAT CGCCCTGCTC ACCGTCTGCC TGCATCCCAC ATTTTATGTG
GTGATAATAT TTGCTTTCGT CATCGGTTTT ACCTCCGCAG GCGGTGTTGT ACAAATTGGC
CTGACGTTAA TGGCGGAACG TTTTCCCTAC GCTAAAGGAA AAGCAACGGG GATCTATTAC
AGTGCGGGCA GCATTGCAAC ATTTACCATT CCGTTGATTA CGGCTCATCT TTCGCAAAGA
AGTATTGCCG ATATTATGTG GTTCGATACC GCCATCGCTG CCATCGGCTT TATACTGGCA
CTGTTTATTG GCTTACGCAG TCGCAAAGAA ACGCGGCATC ACTCGCTAAA GGAAAATGTC
GCTCCGGGTG GGTAA
 
Protein sequence
MKNPYYPTAL GLYFNYLVHG MGVILMSLNM ASLETLWQTN AAGVSIVISS LGIGRLSVLL 
FAGLLSDRFG RRPFIMLGMC CYMAFFFGIL HTNNIIIAYV FGFLAGMANS FLDAGTYPSL
MEAFPRSPGT ANILIKAFVS SGQFLLPLII SLLVWAELWF GWSFMIAAGI MFINALFLYR
CTFPPHPGRR LPVIKKTTSS TEHRCSIIDL ASYTLYGYIS MATFYLVSQW LAQYGQFVAG
MSYTMSIKLL SIYTVGSLLC VFITAPLIRN TVRPTTLLML YTFISFIALL TVCLHPTFYV
VIIFAFVIGF TSAGGVVQIG LTLMAERFPY AKGKATGIYY SAGSIATFTI PLITAHLSQR
SIADIMWFDT AIAAIGFILA LFIGLRSRKE TRHHSLKENV APGG