Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4877 |
Symbol | |
ID | 6143466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4992126 |
End bp | 4993304 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619681 |
Product | major facilitator transporter |
Protein accession | YP_001746788 |
Protein GI | 170680267 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.429993 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCGA CATCGCATCC CGTAGAACGT TTTTCTTTCA GCACCGCGTT ATTCGGGATG CTGGTTCTGA CCTTAGGTAT GGGTTTAGGC CGCTTTCTCT ATACGCCGAT GCTGCCAGTC ATGCTGGCGG AAGGCGAATT TTCCTTTAGC GAACTCTCAT GGATCGCCAG TGGTAACTAT GCCGGGTATC TGGCGGGGAG CCTGCTGTTT TCATTCGGCG CGTTTCATTT ACCCTCACGC CTGCGCCCGT TCTTGTTAGC TTCCGCCCTC GCAACCGGAT TATTAATCCT CGCGATGGCG TGGCTGCCGC CGTTTCTTCT GGTTTTCATC ATTCGCTTTC TGGCGGGGGT CGCCAGCGCC GGGATGTTGA TTTTCGGCTC AACGCTCATC ATGCAGCATA CTCGTCATCC CTTTGTCCTT GCGGCGCTAT TTTCTGGTGT TGGCGTCGGC ATCGCTCTGG GCAATGAATA TGTGCTGGCA GGCCTGCATT TTGCCCTCTC TTCACAAACG TTGTGGCAAG GTGCCGGAGC ACTTTCTGCC ATTATATTGC TTGCTCTGGC GCTGCTCATC CCGTCGAATA AACACGTTAT CCCGCCAGCG CCATTGGCAA AAATCGCGCA ACAACCCATG AGCTGGTGGT TACTGGCGAT TCTGTATGGT CTGGCGGGTT TTGGTTATAT CATCGTCGCC ACCTACCTGC CGCTCATGGC GAAAGACGCG GGCCAGCCTG TGCTGACGGC TCACCTCTGG ACACTGGTCG GCTTGTCGAT TGTCCCAGGT TGCTTTGGCT GGCTGTGGGC AGCCAAACGG TGGGGAGCAT TACCTTGCCT GACCGCGAAT TTGCTGGTGC AGGCGATCTG CGTGCTGTTA ACCCTCGCCA GCAGCTCTCC TTTATTACTC ATCATCAGCA GTATTGGTTT TGGCGGCACC TTTATGGGAA CGACCTCGCT GGTGATGACC ATCGCCCGCC AGCTTAGCGT GCCGGGAAAT CTTAACCTTT TGGGCTTTGT GACACTCATT TATGGTATCG GGCAAATTCT TGGCCCGGCG CTGACCAGTA TGCTCGGCAA CGGAACGTCG GCCCTCGCCA GTGCCACGCT CTGCGGCGCG GCGGCGCTAT TTATCGCAGC ATTAATCTGC GGGATGCAAA TATTCAAATT GCATACGAAT GATTCTTAA
|
Protein sequence | MNSTSHPVER FSFSTALFGM LVLTLGMGLG RFLYTPMLPV MLAEGEFSFS ELSWIASGNY AGYLAGSLLF SFGAFHLPSR LRPFLLASAL ATGLLILAMA WLPPFLLVFI IRFLAGVASA GMLIFGSTLI MQHTRHPFVL AALFSGVGVG IALGNEYVLA GLHFALSSQT LWQGAGALSA IILLALALLI PSNKHVIPPA PLAKIAQQPM SWWLLAILYG LAGFGYIIVA TYLPLMAKDA GQPVLTAHLW TLVGLSIVPG CFGWLWAAKR WGALPCLTAN LLVQAICVLL TLASSSPLLL IISSIGFGGT FMGTTSLVMT IARQLSVPGN LNLLGFVTLI YGIGQILGPA LTSMLGNGTS ALASATLCGA AALFIAALIC GMQIFKLHTN DS
|
| |